Fraction of variance unexplained

Fraction of variance unexplained

In statistics, the fraction of variance unexplained (or FVU) in the context of a regression task is the amount of variance of the regressand "Y" which cannot be explained, i.e., which is not correctly predicted, by the explanatory variable "X".

For a more general definition of explained/unexplained variation/randomness/variance, see the article explained variation.

Formal definition

Given a regression function "f"(·) yielding for each "yi", 1leq ileq N, an estimate widehat{y}_i = f(x_i), we have:

:egin{align}FVU &= {SS_E over SS_T} = 1-{SS_R over SS_T}. \ &= 1 - R^2,end{align}

where "R"2 is the coefficient of determination and

:egin{align} SS_E&=sum{}_{i=1}^N;(y_i - widehat{y_i})^2\SS_T&=sum{}_{i=1}^N;(y_i-ar{y})^2 \SS_R&=sum{}_{i=1}^N;(widehat{y_i}-ar{y})^2 ext{ and } \ar{y}&=frac{1}{N}sum{}_{i=1}^N;y_i.end{align}

Alternatively, the fraction of variance unexplained can be defined as:

:FVU = frac{MSE(f)}{mathrm{var} [Y] } = frac{mathrm{var} [Y - f(X)] }{mathrm{var} [Y] },

where "MSE"("f") is the mean squared error of the regression function "f"(·).

Explanation

It is useful to consider the second definition to get the idea behind FVU. When trying to predict "Y", the most naïve regression function that we can think of is the constant function predicting the mean of "Y", i.e., f(x_i)=ar{y}. It follows that the MSE of this function equals the variance of "Y"; that is, "SSE" = "SST", and "SSR" = 0. In this case, the variations in "Y" cannot be accounted for, and the FVU then has its maximum value of 1.

The FVU will also be 1 if the explanatory variable "X" tells us nothing about "Y" in the sense that the predicted values of "Y" do not covary with "Y". But as prediction gets better and the MSE can be reduced, the FVU goes down. In the case of perfect prediction where hat{y}_i = y_i, the MSE is 0, "SSE" = 0, "SST = SSE", and the FVU is 0.

ee also

* Coefficient of determination
* Correlation
* Regression analysis
* Linear regression


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать курсовую

Look at other dictionaries:

  • Unexplained — means not explained and may refer to:* Unexplained , a 1992 rock music album by EMFTelevision * The Unexplained , a 1990s documentary television series * Unexplained Mysteries , a 2003 documentary television series * Unexplained Canada , a 2006… …   Wikipedia

  • Coefficient of determination — In statistics, the coefficient of determination R2 is used in the context of statistical models whose main purpose is the prediction of future outcomes on the basis of other related information. It is the proportion of variability in a data set… …   Wikipedia

  • Explained variation — In statistics, explained variation or explained randomness measures the proportion to which a mathematical model accounts for the variation (= apparent randomness) of a given data set. Often, variation is quantified as variance; then, the more… …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • Correlation — In probability theory and statistics, correlation, (often measured as a correlation coefficient), indicates the strength and direction of a linear relationship between two random variables. In general statistical usage, correlation or co relation …   Wikipedia

  • Correlation and dependence — This article is about correlation and dependence in statistical data. For other uses, see correlation (disambiguation). In statistics, dependence refers to any statistical relationship between two random variables or two sets of data. Correlation …   Wikipedia

  • List of mathematics articles (F) — NOTOC F F₄ F algebra F coalgebra F distribution F divergence Fσ set F space F test F theory F. and M. Riesz theorem F1 Score Faà di Bruno s formula Face (geometry) Face configuration Face diagonal Facet (mathematics) Facetting… …   Wikipedia

  • statistics — /steuh tis tiks/, n. 1. (used with a sing. v.) the science that deals with the collection, classification, analysis, and interpretation of numerical facts or data, and that, by use of mathematical theories of probability, imposes order and… …   Universalium

  • Cosmic microwave background radiation — CMB and Cosmic background radiation redirect here. For other uses see CMB (disambiguation) and Cosmic background (disambiguation). Physical cosmology …   Wikipedia

  • KABBALAH — This entry is arranged according to the following outline: introduction general notes terms used for kabbalah the historical development of the kabbalah the early beginnings of mysticism and esotericism apocalyptic esotericism and merkabah… …   Encyclopedia of Judaism

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”