Linear Regression

In statistics, linear regression is an approach to modeling the relationship between a scalar dependent variable y and one or more explanatory variables denoted X. The case of one explanatory variable is called simple regression. More than one explanatory variable is multiple regression. (This in turn should be distinguished from multivariate linear regression, where multiple correlated dependent variables are predicted, rather than a single scalar variable.)

In linear regression, data is modelled using linear predictor functions, and unknown model parameters are estimated from the data. Such models are called linear models. Most commonly, linear regression refers to a model in which the conditional mean of y given the value of X is an affine function of X. Less commonly, linear regression could refer to a model in which the median, or some other quantile of the conditional distribution of y given X is expressed as a linear function of X. Like all forms of regression analysis, linear regression focuses on the conditional probability distribution of y given X, rather than on the joint probability distribution of y and X, which is the domain of multivariate analysis.

Linear regression was the first type of regression analysis to be studied rigorously, and to be used extensively in practical applications. This is because models which depend linearly on their unknown parameters are easier to fit than models which are non-linearly related to their parameters and because the statistical properties of the resulting estimators are easier to determine.

Linear regression has many practical uses. Most applications of linear regression fall into one of the following two broad categories:

  • If the goal is prediction, or forecasting, linear regression can be used to fit a predictive model to an observed data set of y and X values. After developing such a model, if an additional value of X is then given without its accompanying value of y, the fitted model can be used to make a prediction of the value of y.
  • Given a variable y and a number of variables X1, ..., Xp that may be related to y, linear regression analysis can be applied to quantify the strength of the relationship between y and the Xj, to assess which Xj may have no relationship with y at all, and to identify which subsets of the Xj contain redundant information about y.

Linear regression models are often fitted using the least squares approach, but they may also be fitted in other ways, such as by minimizing the “lack of fit” in some other norm (as with least absolute deviations regression), or by minimizing a penalized version of the least squares loss function as in ridge regression. Conversely, the least squares approach can be used to fit models that are not linear models. Thus, while the terms "least squares" and "linear model" are closely linked, they are not synonymous.

Read more about Linear Regression:  Introduction To Linear Regression, Extensions, Estimation Methods, Applications of Linear Regression

Other articles related to "linear regression, regression":

Calculation Of Glass Properties - Methods - Non-linear Regression
... The liquidus temperature has been modeled by non-linear regression using neural networks and disconnected peak functions ... functions approach is based on the observation that within one primary crystalline phase field linear regression can be applied and at eutectic points ...
Logistic Regression - Background
... Logistic regression can be binomial or multinomial ... Binomial or binary logistic regression refers to the instance in which the observed outcome can have only two possible types (e.g ... Multinomial logistic regression refers to cases where the outcome can have three or more possible types (e.g ...
Applications of Linear Regression - Environmental Science
... Linear regression finds application in a wide range of environmental science applications ... In Canada, the Environmental Effects Monitoring Program uses statistical analyses on fish and benthic surveys to measure the effects of pulp mill or metal mine effluent on the aquatic ecosystem ...
Calculation Of Glass Properties - Methods - Linear Regression
... liquidus temperature) or phase separation, linear regression can be applied using common polynomial functions up to the third degree ...
Analyse-it
... including Descriptive statistics, ANOVA, Mann–Whitney, Wilcoxon, chi-square, correlation, linear regression, logistic regression, polynomial regression ... and demonstration, including Altman-Bland bias plots, Linear regression, Weighted Linear regression, Deming regression, Weighted Deming regression and Passing Bablok for method comparison, CLSI ...