F-test

An F-test is any statistical test in which the test statistic has an F-distribution if the null hypothesis is true. The name was coined by George W. Snedecor, in honour of Sir Ronald A. Fisher. Fisher initially developed the statistic as the variance ratio in the 1920s [Lomax, Richard G. (2007) "Statistical Concepts: A Second Course", p. 10, ISBN 0-8058-5850-4] . Examples include:

* The hypothesis that the means of multiple normally distributed populations, all having the same standard deviation, are equal. This is perhaps the most well-known of hypotheses tested by means of an F-test, and the simplest problem in the analysis of variance (ANOVA).

* The hypothesis that a proposed regression model fits well. See Lack-of-fit sum of squares.

* The hypothesis that the standard deviations of two normally distributed populations are equal, and thus that they are of comparable origin.

Note that if it is equality of variances (or standard deviations) that is being tested, the F-test is extremely non-robust to non-normality. That is, even if the data displays only modest departures from the normal distribution, the test is unreliable and should not be used.

Formula and calculation

The formula for an F- test in multiple-comparison ANOVA problems is: "F" = (between-group variability) / (within-group variability)

(Note: When there are only two groups for the F-test: F-ratio = "t"² where "t" is the Student's "t" statistic.)

In many cases, the F-test statistic can be calculated through a straightforward process. In the case of regression: consider two models, 1 and 2, where model 1 is nested within model 2. That is, model 1 has "p"₁ parameters, and model 2 has "p"₂ parameters, where "p"₂ > "p"₁. (Any constant parameter in the model is included when counting the parameters. For instance, the simple linear model "y" = "mx" + "b" has "p" = 2 under this convention.) If there are "n" data points to estimate parameters of both models from, then calculate "F" as

: $F=frac{left(frac{mbox{RSS}_1 - mbox{RSS}_2 }{p_2 - p_1}
ight)}{left(frac{mbox{RSS}_2}{n - p_2}
ight)}$ [cite web|date=2007-10-11|url=http://www.graphpad.com/help/Prism5/prism5help.html?howtheftestworks.htm|title=How the F test works to compare models|author=GraphPad Software Inc|publisher=GraphPad Software Inc]

where RSS_"i" is the residual sum of squares of model "i". If your regression model has been calculated with weights, then replace RSS_"i" with χ², the weighted sum of squared residuals. "F" here is distributed as an F-distribution, with ("p"₂ − "p"₁, "n" − "p"₂) degrees of freedom; the probability that the decrease in χ² associated with the addition of "p"₂ − "p"₁ parameters is solely due to chance is given by the probability associated with the "F" distribution at that point. The null hypothesis, that none of the additional "p"₂ − "p"₁ parameters differs from zero, is rejected if the calculated "F" is greater than the "F" given by the critical value of "F" for some desired rejection probability (e.g. 0.05).

Table on F-test

A table of F-test critical values can be found [http://www.itl.nist.gov/div898/handbook/eda/section3/eda3673.htm here] and is usually included in most statistical texts.

One-way anova example

a₁, a₂, and a₃ are the three levels of the factor that your are studying. To calculate the F- Ratio:

First calculate the A_i values where i refers to the number of the condition. So:

A₁ = Σa₁ = 6 + 8 + 4 + 5 + 3 + 4 = 30
A₂ = 54
A₃ = 60

Second calculate Ȳ_Ai being the average of the values of condition a_i

Ȳ_A1 = (6+8+4+5+3+4)/ 6 = 5
Ȳ_A2 = 9
Ȳ_A3 = 10

Third calculate these values:

T = Total = ΣA_i = A₁ + A₂+ A₃ = 30 + 54 + 60 = 144

Ȳ_T = Average overall score = T / an = 144 / 3(6) = 8
Note that a = the number of conditions and n = the number of participants in each condition

[Y] = ΣY² = 1304
This is every score in every condition squared and then summed.

[A] = (ΣA_i²/n = 1236

[T] = T² / an = 1152

Now we calculate the Sum of Squared Terms:

SS_A = [A] - [T] = 84

SS_S/A = [Y] - [A] = 68

The Degrees of Freedom are now calculated:

df_a = a-1 = 2

df_S/A = a(n-1) = 15

Finally the Means Squared Terms are calculated:

MS_A = SS_A / df_A = 42

MS_S/A = SS_S/A / df_S/A = 4.5

The ending F-Ratio is now ready:

F = MS_A / MS_S/A = 9.27
Now look up the F_crit value for the problem:

F_crit(2,15) = 3.68 at α = .05 so being that our F value 9.27 ≥ 3.68 the results are significant and one could reject the null hypothesis.

Note F(x, y) notation means that there are x degrees of freedom in the numerator and y degrees of freedom in the denominator.

Footnotes

References

* [http://www.public.iastate.edu/~alicia/stat328/Multiple%20regression%20-%20F%20test.pdf Testing utility of model – F-test]
* [http://rkb.home.cern.ch/rkb/AN16pp/node81.html F-test]

Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

Test-driven development — (TDD ) is a software development technique consisting of short iterations where new test cases covering the desired improvement or new functionality are written first, then the production code necessary to pass the tests is implemented, and… … Wikipedia
Test pilot — Test pilots are aviators who fly new and modified aircraft in specific maneuvers, allowing the results to be measured and the design to be evaluated. Test pilots may work for military organizations or private, (mostly aerospace) companies.… … Wikipedia
Test logiciel — Test (informatique) En informatique, un test (anglicisme) désigne une procédure de vérification partielle d un système informatique. Le but en est de trouver un nombre maximum de comportements problématiques du logiciel, car il est impossible de… … Wikipédia en Français
Test Card F — is a test card that was created by the BBC and used on television in the United Kingdom and in countries elsewhere in the world for more than four decades. Like other test cards, it was usually shown while no programmes were being broadcast, but… … Wikipedia
Test the Nation — is a television programme, first broadcast in 2001 by BNN in The Netherlands where the concept is owned by [http://www.eyeworks.tv/ Eyeworks Holding] who license it to TV production companies around the world. how FormatThe format is designed to… … Wikipedia
Test and tagging — is a generic name given to the process of visually inspecting and electrically testing in service electrical equipment for personal use and/or safety. Colloquially, it is also referred to as; tagging, test tag, test and tag, electrical tagging,… … Wikipedia
Test Rorschach — Test de Rorschach La première planche parmi les dix du test de Rorschach Le test de Rorschach ou psychodiagnostik est un outil d évaluation psychologique de type projectif élaboré par le psychiatre et psychanalyste Hermann Rorschach en 1921. Il… … Wikipédia en Français
Test de Charge — Test de performance Un test de performance ou benchmark est un test dont l objectif est de déterminer la performance d un système informatique. L acception la plus courante de ce terme est celle dans laquelle ces tests logiciels vont avoir pour… … Wikipédia en Français
Test de Khi-2 — Test du χ² Pour la loi de probabilité, voir Loi du χ². Densité du χ² en fonction du nombre de degrés de liberté Le test du χ² (prononcer … Wikipédia en Français
Test de rorschach — La première planche parmi les dix du test de Rorschach Le test de Rorschach ou psychodiagnostik est un outil d évaluation psychologique de type projectif élaboré par le psychiatre et psychanalyste Hermann Rorschach en 1921. Il consiste en une… … Wikipédia en Français
Test des taches d'encre — Test de Rorschach La première planche parmi les dix du test de Rorschach Le test de Rorschach ou psychodiagnostik est un outil d évaluation psychologique de type projectif élaboré par le psychiatre et psychanalyste Hermann Rorschach en 1921. Il… … Wikipédia en Français

Academic Dictionaries and Encyclopedias

F-test

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

F-test

Look at other dictionaries:

Share the article and excerpts

Direct link