用R语言做非参数和半参数回归笔记docx文档格式.docx

资源描述

用R语言做非参数和半参数回归笔记docx文档格式.docx

《用R语言做非参数和半参数回归笔记docx文档格式.docx》由会员分享，可在线阅读，更多相关《用R语言做非参数和半参数回归笔记docx文档格式.docx（18页珍藏版）》请在冰豆网上搜索。

用R语言做非参数和半参数回归笔记docx文档格式.docx

6、高铁梅（2009）.计量经济分析方法与建模：

EVIEWS应用及实例（第二版）.清华大学出版社.（P127/143）

7、李雪松（2008）.高级计量经济学.中国社会科学出版社.（P45ch3）

8、陈强（2010）.高级计量经济学及Stata应用.高教出版社.（ch23/24）

【其他参看原ppt第一章】

二、内容简介

方法：

——移动平均（movingaverage）

——核光滑（Kernelsmoothing）

——K近邻光滑（K-NN）

——局部多项式回归（LocalPolynormal）

——LoesssandLowess

——样条光滑（SmoothingSpline）

——B-spline

——FriedmanSupersmoother

模型：

——非参数密度估计

——非参数回归模型

——时间序列的半参数模型

——Paneldata的半参数模型

——QuantileRegression

三、不同的模型形式

1、线性模型linearmodels

2、Nonlinearinvariables

3、Nonlinearinparameters

四、数据转换Powertransformation（对参数方法）

IntheGLMframework,modelsareequallyprone（倾向于）tosomemisspecification（不规范）fromanincorrectfunctionalform.

Itwouldbeprudent（谨慎的）totestthattheeffectofanyindependentvariableofamodeldoesnothaveanonlineareffect.Ifitdoeshaveanonlineareffect,analystsinthesocialscienceusuallyrelyonPowerTransformationstoaddressnonlinearity.

[ADD:

检验方法见SanfordWeisberg.AppliedLinearRegression（ThirdEdition）.AJohnWiley&

Sons,Inc.,Publication.（本科的应用回归分析课教材）]

----------------------------------------------------------------------------

第二章

NonparametricDensityEstimation

非参数密度估计

一、三种方法

1、直方图Hiatogram

2、Kerneldensityestimate

3、Knearest-neighborsestimate

二、Histogram对直方图的一个数值解释

Supposex1,…xN–f（x）,thedensityfunctionf（x）isunknown.

Onecanusethefollowingfunctiontoestimatef（x）

【与x的距离小于h的所有点的个数】

三、Kerneldensityestimate

Bandwidth:

Windowwidth:

2h.

1、Kernelfunction的条件

ThekernelfunctionK（.）isacontinuousfunction,symmetric（对称的）aroundzero,thatintegrates（积分）tounityandsatisfiesadditionalboundedconditions:

（1）K（）issymmetricaround0andiscontinuous;

（2）

（3）Either

（a）K（z）=0if|z|>

=z0forz0

（b）|z|K（z）à

0as

;

（4）

where

isaconstant.

2、主要函数形式

3、置信区间

其中,

4、窗宽的选择

实际应用中,。

其中,s是样本标准差,iqr是样本分位数级差（interquartilerange）

四、Knearest-neighborsestimate

五、R语言部分

da<

-read.table（"

PSID.txt"

header=TRUE）

lhwage<

-da$lhwage

#***bandwidth相等,核函数不同***

den1<

density（lhwage,bw=0.45,kernel="

epan"

）

den2<

-density（lhwage,bw=0.45,kernel="

gauss"

den3<

biwe"

den4<

rect"

plot（den4,lty=4,main="

xlab="

LogHourlyWage"

ylab="

Kerneldensityestimates"

lines（den3,lty=3,col="

red"

lines（den2,lty=2,col="

green"

lines（den1,lty=1,col="

blue"

#***bandwidth不相等,核函数也不同***

den5<

-density（lhwage,bw=0.545,kernel="

den6<

-density（lhwage,bw=0.246,kernel="

den7<

-density（lhwage,bw=0.646,kernel="

den8<

-density（lhwage,bw=0.214,kernel="

plot（den8,lty=4,main="

lines（den7,lty=3,col="

lines（den6,lty=2,col="

lines（den5,lty=1,col="

----------------------------------------------------------------------------

第三章

smoothingandlocalregression

一、简单光滑估计法

SimpleSmoothing

1、LocalAveraging局部均值

按照x排序,将样本分成若干部分（intervalsor“bins”）；

将每部分x对应的y值的均值作为f（x）的估计。

三种不同方法：

（1）相同的宽度（equalwidthbins）：

uniformlydistributed.

（2）相同的观察值个数（equalno.ofobservationsbins）：

k-nearestneighbor.

（3）移动平均（movingaverage）

K-NN：

等窗宽：

2、kernelsmoothing核光滑

二、局部多项式估计LocalPolynomialRegression

1、主要结构

局部多项式估计是核光滑的扩展,也是基于局部加权均值构造。

——localconstantregression

——locallinearregression

——lowess（Cleveland,1979）

——loess（Cleveland,1988）

【本部分可参考：

Takezana（2006）.IntroductiontoNonparametricRegression.（P1853.7andP1953.9）

ChambersandHastie（1993）.StatisticalmodelsinS.（P312ch8）】

2、方法思路

（1）对于每个xi,以该点为中心,按照预定宽度构造一个区间；

（2）在每个结点区域内,采用加权最小二乘法（WLS）估计其参数,并用得到的模型估计该结点对应的x值对应y值,作为y|xi的估计值（只要这一个点的估计值）；

（3）估计下一个点xj；

（4）将每个y|xi的估计值连接起来。

【R操作

library（KernSmooth）#函数locpoly（）

library（locpol）#locpol（）;

locCteSmootherC（）

library（locfit）#locfit（）

#weightfunciton:

kernel=”tcub”.And“rect”,“trwt”,“tria”,“epan”,“bisq”,“gauss”

】

3、每个方法对应的估计形式

（1）变量个数p=0,localconstantregression（kernelsmoothing）

min

（2）变量个数p=1,locallinearregression

（3）Lowess（LocalWeightedscatterplotsmoothing）

p=1:

【还有个加权修正的过程,这里略,详见原书或者PPT】

（4）Loess（Localregression）

p=1,2

展开阅读全文