03.05.2024 OLS Distribution and Tests#
Sampling Distribution of OLS-Estimator#
Assumption:
error \(\epsilon\) = independent of explanatory variables
normally distributed with mean = 0 and sd = \(\sigma^2\)
=> depends on empirical topic
e.g Wage regression
wage > 0 \(\neq\) normal distributed
minimum wage => population eran exaclty minimum
=> solve with log
Theorem
t-statistic#
Normal Model with CLM Assumptions
Distribution of Beta => follows T-distribution
Test the Null Hypothesis (\(H_0 : \beta_j \le 0\))
Example: hourly wage equation for n=526
\(H_0\): \(\beta_{exper} \le 0\) vs \(H_1: \beta_{exper} > 0\)
Test Statistic: \(t_{\beta_j} = \hat{ \beta_j } / se(\beta_j)\)
Rejection Rule:
calculate distribution value
use normal distribution, because df>120
choose significance level = 5%
\(N(0.95) = 1.645\)
Calculate \(t_{exper} = \frac{ 0.0041 }{0.0017}= 2.41\)
2.41 > 1.645 => \(H_1\) accepted
=> variable is different from 0 => good
Confidence Intervals#
= interval estimates
Example:
given: degrees of freedom = 25, Confidence = 95%
c = 2.06 (look here)
\([\hat{ \beta_j }-2.06 \cdot se(\hat{ \beta_j }),\hat{ \beta_j }+2.06 \cdot se(\hat{ \beta_j })]\)
Test Hypotheses: Multiple Parameters#
Test a single hypothesis with more than one \(\beta\)
Example: Returns to Education at junior colleges vs. four year colleges
Result:
Numerator: \(\hat{ \beta_1 }-\hat{ \beta_2 } = 0.0067-0.0769 = -0.0102\)
Denominator: fucking complicated?!!!!!
F-Test#
probably (hopefully) not relevant
to test mutliple hypothesis at the same time
Example: Salary of a Baseball Player
Hypothesis: after controlling for years and games/yr, the other performance statistics do not have any effect
Results in unrestricted Model:
Result in restricted Model
F-Test
Question: increase in SSR large enough to consider dropping?
F = always nonnegative
q = \(df_r-df_{ur}\)
\(n-k-1 = df_{ur}\)
Rejection Rule: c is taken from F distribution
also often used: F-Statistic to test all variables (confirm that they are different from 0)