Econometric Theory/Assumptions of Classical Linear Regression Model

From Wikibooks, open books for an open world
< Econometric Theory
Jump to: navigation, search

The estimators that we create through linear regression give us a relationship between the variables. However, performing a regression does not automatically give us a reliable relationship between the variables. In order to create reliable relationships, we must know the properties of the estimators  \hat{\alpha}, \hat{\beta} and show that some basic assumptions about the data are true.one must understand that having a good dataset is of enormous importance for applied economic research.

Contents

[edit] Unbiasedness

Under the following four assumptions, OLS is unbiased. This means that:
E(\hat \alpha) = \alpha
E(\hat \beta) = \beta

[edit] Linearity

The model must be linear in the parameters.
The parameters are the coefficients on the independent variables, like α and β. These must be linear, so having β2 or eβ would violate this assumption.

[edit] Sample Variation

The xis cannot all have the same value.

[edit] Random Sampling

The xi values must be randomly selected. In other words, there is no correlation between two different x values: Cov(xi,xj) = 0 for i \neq j.

[edit] Zero Conditional Mean

The mean of the error terms, given a specific value of the independent variable xi, is zero. E(\epsilon_i | X_i) = 0.

[edit] Efficiency of OLS

Given the following two assumptions, OLS is the Best Linear Unbiased Estimator (BLUE). This means that out of all possible linear unbiased estimators, OLS gives the precise estimates of α and β.

With the third assumption, OLS is the Best Unbiased Estimator (BUE), so it even beats non-linear estimators. Also given this assumption, \hat \alpha is distributed according to the Student's t-distribution about α, and \hat \beta is distributed in such a way about β.

[edit] No Heteroskedasticity

The variance of the Error terms are constant. Var(ui | xi) = σ2. This means that the variance of the error term ui does not depend on the value of xi. If this is the case, the error terms are called homoskedastic. This is not always the case in economic data, for example the variation in a person's wage will vary with their level of education -- someone who is a high-school dropout will not have much variation in their wage, where people with Ph.D.s may see very different wages.

[edit] No Serial Correlation

The error terms are independently distributed so that their covariance is 0. Cov(u_i, u_j | x_i, x_j) = 0 \forall i \ne j.

[edit] Normally Distributed Errors

The error terms are normally distributed. u_i ~ N(0,\sigma^2)

Personal tools
Namespaces
Variants
Actions
Navigation
Community
Toolbox
Sister projects
Print/export