## Abstract

A common feature of ecological data sets is their tendency to contain many zero values. Statistical inference based on such data are likely to be inefficient or wrong unless careful thought is given to how these zeros arose and how best to model them. In this paper, we propose a framework for understanding how zero-inflated data sets originate and deciding how best to model them. We define and classify the different kinds of zeros that occur in ecological data and describe how they arise: either from 'true zero' or 'false zero' observations. After reviewing recent developments in modelling zero-inflated data sets, we use practical examples to demonstrate how failing to account for the source of zero inflation can reduce our ability to detect relationships in ecological data and at worst lead to incorrect inference. The adoption of methods that explicitly model the sources of zero observations will sharpen insights and improve the robustness of ecological analyses.

Original language | English (US) |
---|---|

Pages (from-to) | 1235-1246 |

Number of pages | 12 |

Journal | Ecology Letters |

Volume | 8 |

Issue number | 11 |

DOIs | |

State | Published - Nov 2005 |

Externally published | Yes |

## Keywords

- Bayesian inference
- Detectability
- Excess zeros
- False negative
- Mixture model
- Observation error
- Sampling error
- Zero inflation
- Zero-inflated Poisson
- Zero-inflated binomial

## ASJC Scopus subject areas

- Ecology, Evolution, Behavior and Systematics