斯托克、沃森着《计量经济学》第八章
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
Chapter 8. Nonlinear Regression Functions
8.1 A General Strategy for Modeling Nonlinear Regression Functions
•Everything so far has been linear in the X’s
•The approximation that the regression function is linear
might be good for some variables, but not for others.
•The multiple regression framework can be extended to handle regression functions that are nonlinear in one or more X.
The TestScore – STR relation looks approximately linear…
But the TestScore – average district income relation looks like it is nonlinear.
If a relation between Y and X is nonlinear:
•The effect on Y of a change in X depends on the value of X – that is, the marginal effect of X is not constant
•A linear regression is mis-specified – the functional form is wrong
•The estimator of the effect on Y of X is biased – it needn’t even be right on average. 遗漏高次项会带来遗漏变量偏
差。例如:
()
2
012
Y X X u
βββ
=+++,显然X与2
X相关。
•The solution to this is to estimate a regression function that is nonlinear in X
The General Nonlinear Population Regression Function
Y i = f (X1i, X2i,…, X ki) + u i, i = 1,…, n Assumptions
1.E(u i| X1i, X2i,…, X ki) = 0 (same); implies that f is the
conditional expectation of Y given the X’s.
2.(X1i, …, X ki, Y i) are i.i.d. (same).
3.“enough” moments exist (same idea; the precise statement
depends on specific f ).
4.No perfect multicollinearity (same idea; the precise
statement depends on the specific f ).
8.2 Nonlinear Functions of a Single Independent Variable
We’ll look at two complementary approaches:
1. Polynomials in X
The population regression function is approximated by a quadratic, cubic, or higher-degree polynomial
2. Logarithmic transformations
•Y and/or X is transformed by taking its logarithm
• This gives a “percentages” interpretation that makes sense in many applications
1. Polynomials in X
Approximate the population regression function by a polynomial:
Y i = β0 + β1X i + β22i X +…+ βr r
i X + u i
•This is just the linear multiple regression model – except that the regressors are powers of X!
•Estimation, hypothesis testing, etc. proceeds as in the multiple regression model using OLS
•The coefficients are difficult to interpret, but the regression function itself is interpretable
Example: The TestScore – Income relation
Income i = average district income in the i th district
(thousand dollars per capita)
Quadratic specification:
TestScore i = β0 + β1Income i + β2(Income i)2 + u i