Tweedie distributions

Tweedie distributions

In probability and statistics, the Tweedie distributions are a family of probability distributions which include continuous distributions such as the normal and gamma, the purely discrete scaled Poisson distribution, and the class of mixed compound Poisson-Gamma distributions which have positive mass at zero, but are otherwise continuous.Tweedie MCK (1984). An index which distinguishes between some important exponential families. In ‘Statistics Applications and New Directions’, Proceedings of the Indian Statistical Institute Golden Jubilee International Conference. (Ed. JK Ghosh and J Roy) pp. 579-604. (Indian Statistical Institute: Calcutta)] Tweedie distributions belong to the exponential dispersion model family of distributions, a generalization of the exponential family, which are the response distributions for generalized linear models.

Tweedie distributions have a mean mu and a variance phi mu^p, where phi>0 is a "dispersion parameter", and p, called the index parameter, (uniquely) determines the distribution in the Tweedie family. Special cases include:
* p=0 is the normal distribution
* p=1 with phi=1 is the Poisson distribution
* p=2 is the gamma distribution
* p=3 is the inverse Gaussian distribution. Tweedie distributions exist for all real values of p except for 0.Jørgensen, B. 1987. Exponential dispersion models (with discussion). J. R. Stat. Soc. Ser. B Stat. Methodol. 49: 127--162] Apart from the four special cases identified above, their probability density function have no closed form. However, software is available that enables the accurate computation of the Tweedie densities (and probability distribution functions). [Dunn, P. K. and Smyth, G. K. 2005.Series evaluation of Tweedie exponential dispersion models densities. Statistics and Computing, 15: 267--280] [Dunn, P. The tweedie Package, (2007) http://cran.r-project.org/web/packages/tweedie/index.html]

The Tweedie distributions were so named by Bent Jørgensen after M.C.K. Tweedie, a medical statistician at the University of Liverpool, UK, who presented the first thorough study of these distributions in 1984.

The index parameter p defines the type of distribution:
* For p<0, the data y are supported on the whole real line (but, interestingly, mu>0). Applications for these distribution are unknown.
* For p=0 (the normal distribution), the data y and the mean mu are supported on the whole real line.
* For 0, no distributions exist
* For p=1, the distribution exist on the non-negative integers
* For 1, the distribution is continuous on the positive reals, plus an added mass (exact zero) at Y=0. For example, consider monthly rainfallDunn, P. K. 2004.Occurrence and quantity of precipitation can be modelled simultaneously. International Journal of Climatology. 24: 1231--1239.] . When no rain is recorded, an exact zero is recorded. If rain is recorded, a continuous amount results. These distributions are also called the Poisson-gamma distributions, since they can be represented as the Poisson sum of gamma distributions. [Dunn, Peter; Smyth, Gordon. (2005). Series evaluation of Tweedie exponential dispersion model densities Statistics and Computing, Volume 15, Number 4, October 2005 , pp. 267-280(14)http://portal.acm.org/citation.cfm?id=1093724.1093748&coll=&dl=] They are therefore a type of compound Poisson distribution.
* For p>2, the data y are supported on the non-negative reals, and mu>0. These distribution are like the gamma distribution (which corresponds to p=2), but are progressively more right-skewed as p gets larger.

Applications

Applications of Tweedie distributions (apart from the four special cases identified) include:
* actuarial studies [Haberman, S. and Renshaw, A. E. 1996. Generalized linear models and actuarial science. The Statistician, 45: 407--436.] [Renshaw, A. E. 1994.Modelling the claims process in the presence of covariates. ASTIN Bulletin 24: 265--286.] [Jørgensen, B. and Paes de Souza, M. C. 1994. Fitting Tweedie's compound Poisson model to insurance claimsdata. Scand. Actuar. J. 1: 69--93.] [Haberman, S., and Renshaw, A. E. 1998.Actuarial applications of generalized linear models. In Statistics in Finance, D. J. Hand and S. D. Jacka (eds), Arnold, London.] [Millenhall, S. J. 1999. A systematic relationship between minimum bias and generalized linear models. 1999 Proceedings of the Casualty Actuarial Society 86: 393--487.] [Murphy, K. P., Brockman, M. J., and Lee, P. K. W. (2000). Using generalized linear models to build dynamic pricing systems. Casualty Actuarial Forum, Winter 2000.] [Smyth, G. K., and Jørgensen, B. 2002. Fitting Tweedie's compound Poisson model to insurance claims data: dispersion modelling. ASTIN Bulletin 32: 143--157.]

* assay analysis [Davidian, M. 1990. Estimation of variance functions in assays with possible unequalreplication and nonnormal data. Biometrika 77: 43--54.] [Davidian, M., Carroll, R. J. and Smith, W. 1988. Variance functions and the minimum detectable concentration in assays. Biometrika 75: 549--556.]

* survival analysis [Aalen, O. O. 1992. Modelling heterogeneity in survival analysis by the compound Poisson distribution. Ann. Appl. Probab. 2: 951--972.] [Hougaard, P. , Harvald, B. and Holm, N. V. 1992.Measuring the similarities between the lifetimes of adult Danish twins born between 1881--1930. J. Amer. Statist. Assoc. 87: 17--24.] [Hougaard, P. 1986. Survival models for heterogeneous populations derived from stable distributions. Biometrika, 73: 387--396.]

* ecology [Perry, J. N. 1981. Taylor's power law for dependence of variance on mean in animal populations.J. Roy. Statist. Soc. Ser. C 30: 254--263.]

* analysis of alcohol consumption in British teenagers [Gilchrist, R. and Drinkwater, D. 1999.Fitting Tweedie models to data with probability of zero responses. Proceedings of the 14th InternationalWorkshop on Statistical Modelling, Graz, pp. 207--214.]

* medical applications Smyth, G. K. 1996. Regression analysis of quantity data with exact zeros.Proceedings of the Second Australia--Japan Workshop on Stochastic Models in Engineering, Technology and Management. Technology Management Centre, University of Queensland, 572--580.]

* meteorology and climatology

* fisheries [Candy, S. G. 2004. Modelling catch and effort data using generalized linear models,the Tweedie distribution, random vessel effects and random stratum-by-year effects.CCAMLR Science. 11: 59--80.]

References

Further reading

* Kaas, R. (2005). Compound Poisson distribution and GLM’s – Tweedie’s distribution. Handelingen van het contactforum 3rd Actuarial and Financial Mathematics Day (4 February 2005), 3-12. http://ucs.kuleuven.be/seminars_events/other/files/3afmd/Kaas.PDF

* Ohlsson, E and Johansson, B. Exact Credibility and Tweedie Models, University of Stockholm, Research report , October 2003. http://www.math.su.se/matstat/reports/seriea/2003/rep15/report.pdf

* Smith, CAB. (1997). Obituary: Maurice Charles Kenneth Tweedie, 1991-96 Journal of the Royal Statistical Society: Series A (Statistics in Society) 160 (1), 151–154. doi:10.1111/1467-985X.00052

* Smyth, G. K., and Jørgensen, B. (2002). Fitting Tweedie's compound Poisson model to insurance claims data: dispersion modelling. ASTIN Bulletin 32, 143-157. 6/2002 http://www.statsci.org/smyth/pubs/insuranc.pdf

* Tweedie, M. C. K. (1956) Some statistical properties of inverse Gaussian distributions. Virginia J. Sci. (N.S.) 7 (1956), 160--165.

* Tweedie distributions. http://www.statsci.org/s/tweedie.html

* Tweedie generalized linear model family. http://www.statsci.org/s/tweedief.html

* Examples of use of the model. http://www.sci.usq.edu.au/staff/dunn/Datasets/tech-glms.html#Tweedie


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Tweedie — is a surname of Scottish origin. The name is a habitational name from Tweedie, located in the parish of Stonehouse, south of Glasgow. The origin and meaning of the name is unknown. [ [http://www.ancestry.com/facts/Tweedie family history.ashx… …   Wikipedia

  • Generalized linear model — In statistics, the generalized linear model (GLM) is a flexible generalization of ordinary least squares regression. It relates the random distribution of the measured variable of the experiment (the distribution function ) to the systematic (non …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • List of mathematics articles (T) — NOTOC T T duality T group T group (mathematics) T integration T norm T norm fuzzy logics T schema T square (fractal) T symmetry T table T theory T.C. Mits T1 space Table of bases Table of Clebsch Gordan coefficients Table of divisors Table of Lie …   Wikipedia

  • Inverse Gaussian distribution — Probability distribution name =Inverse Gaussian type =density pdf | cdf parameters =lambda > 0 mu > 0 support = x in (0,infty) pdf = left [frac{lambda}{2 pi x^3} ight] ^{1/2} exp{frac{ lambda (x mu)^2}{2 mu^2 x cdf = Phileft(sqrt{frac{lambda}{x… …   Wikipedia

  • Markov chain — A simple two state Markov chain. A Markov chain, named for Andrey Markov, is a mathematical system that undergoes transitions from one state to another, between a finite or countable number of possible states. It is a random process characterized …   Wikipedia

  • Ext3 — infobox filesystem name = ext3 full name = Third extended file system developer = Stephen Tweedie introduction os = Linux 2.4.15 introduction date = November 2001 partition id = 0x83 (MBR) EBD0A0A2 B9E5 4433 87C0 68B6B72699C7 (GPT) directory… …   Wikipedia

  • Normal distribution — This article is about the univariate normal distribution. For normally distributed vectors, see Multivariate normal distribution. Probability density function The red line is the standard normal distribution Cumulative distribution function …   Wikipedia

  • Probability distribution — This article is about probability distribution. For generalized functions in mathematical analysis, see Distribution (mathematics). For other uses, see Distribution (disambiguation). In probability theory, a probability mass, probability density …   Wikipedia

  • Natural exponential family — In probability and statistics, the natural exponential family (NEF) is a class of probability distributions that is a special case of an exponential family (EF). Many common distributions are members of a natural exponential family, and the use… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”