A case study in handling over-dispersion in nematode count data

Date

2010-07-01T12:53:45Z

Journal Title

Journal ISSN

Volume Title

Publisher

Kansas State University

Abstract

Traditionally the Poisson process is used to model count response variables. However, a problem arises when the particular response variable contains an inordinate number of both zeros and large observations, relative to the mean, for a typical Poisson process. In cases such as these, the variance of the data is greater than the mean and as such the data are over-dispersed with respect to the Poisson distribution due to the fact that the mean equals the variance for the Poisson distribution. This case study looks at several common and uncommon ways to attempt to properly account for this over-dispersion in a specific set of nematode count data using various procedures in SAS 9.2. These methods include but are not limited to a basic linear regression model, a generalized linear (log-linear) model, a zero-inflated Poisson model, a generalized Poisson model, and a Poisson hurdle model. Based on the AIC statistics the generalized log-linear models with the Pearson-scale and deviance-scale corrections perform the best. However, based on residual plots, none of the models appear to fit the data adequately. Further work with non-parametric methods or the negative binomial distribution may yield more ideal results.

Description

Keywords

Overdispersion, Poisson Hurdle, Zero Inflated Poisson, Generalized Poisson, Nematode

Graduation Month

August

Degree

Master of Science

Department

Department of Statistics

Major Professor

Leigh W. Murray

Date

2010

Type

Report

Citation