利用STATA创建空间权重矩阵及空间杜宾模型计算----命令

合集下载

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

利用STATA创建空间权重矩阵及空间杜宾模型计算----命令
** 创建空间权重矩阵介绍
*设置默认路径
cd C:\Users\xiubo\Desktop\F182013.v4\F101994\sheng
**创建新文件
*shp2dta:reads a shape (.shp) and dbase (.dbf) file from disk and converts them into Stata datasets.
*shp2dta:读取CHN_adm1文件
*CHN_adm1：为已有的地图文件
*database (chinaprovince)：表示创建一个名称为“chinaprovince”的dBase数据集
*database(filename)：Specifies filename of new dBase dataset
*coordinates(coord)：创建一个名称为“coord”的坐标系数据集
*coordinates(filename)：Specifies filename of new coordinates dataset
*gencentroids(stub)：Creates centroid variables
*genid(newvarname)：Creates unique id variable for database.dta
shp2dta using CHN_adm1,database (chinaprovince) coordinates(coord) genid(id) gencentroids(c)
**绘制2016年中國GDP分布圖
*spmap:Visualization of spatial data
*clnumber(#):number of classes
*id(idvar):base map polygon identifier(识别符，声明变量名，一般以字母或下划线开头，包含数字、字母、下划线)
*_2016GDP：变量
*coord:之前创建的坐标系数据集
spmap _2016GDP using coord, id(id) clnumber(5)
*更改变量名
rename x_c longitude
rename y_c latitude
**生成距离矩阵
*spmat:用于定义与管理空间权重矩阵
*Spatial-weighting matrices are stored in spatial-weighting matrix objects (spmat objects).
*spmat objects contain additional information about the data used in constructing spatial-weighting matrices.
*spmat objects are used in fitting spatial models; see spreg (if installed) and spivreg (if installed).
*idistance:(产生距离矩阵)create an spmat object containing an inverse-distance matrix W
*或contiguity:create an spmat object containing a contiguity matrix W
*idistance_jingdu:命名名称为“idistance_jingdu”的距離矩陣*longitude:使用经度
*latitude:使用纬度
*id(id):使用id
*dfunction(function[, miles]):(设置计算距离方法)specify the distance function.
*function may be one of euclidean (default), dhaversine, rhaversine, or the Minkowski distance of order p, where p is an integer greater than or equal to 1.
*normalize(row):（行标准化）specifies one of the three available normalization techniques: row, minmax, and spectral.
*In a row-normalized matrix, each element in row i is divided by the sum of row i's elements.
*In a minmax-normalized matrix, each element is divided by
the minimum of the largest row sum and column sum of the matrix.
*In a spectral-normalized matrix, each element is divided by the modulus of the largest eigenvalue of the matrix.
spmat idistance idistance_jingdu longitude latitude, id(id) dfunction(euclidean) normalize(row)
**保存stata可读文件idistance_jingdu.spmat
spmat save idistance_jingdu using idistance_jingdu.spmat
**将刚刚保存的idistance_jingdu.spmat文件转化为txt文件
spmat export idistance_jingdu using idistance_jingdu.txt
**生成相邻矩阵
spmat contiguity contiguity_jingdu using coord, id(id) normalize(row)
spmat save contiguity_jingdu using contiguity_jingdu.spmat spmat export contiguity_jingdu using contiguity_jingdu.txt **计算Moran’s I
*安装spatwmat
*spatwmat:用于定义空间权重矩阵
*spatwmat:imports or generates the spatial weights matrices required by spatgsa, spatlsa, spatdiag, and spatreg.
*As an option, spatwmat also generates the eigenvalues matrix required by spatreg.
*name(W):读取空间权重矩阵W
*name(W):使用生成的空间权重矩阵W
*xcoord:x坐标
*ycoord:y坐标
*band(0 8):宽窗介绍
*band(numlist) is required if option using filename is not specified.
*It specifies the lower and upper bounds of the distance
band within which location pairs must be considered "neighbors"
(i.e., spatially contiguous)
*and, therefore, assigned a nonzero spatial weight.
*binary:requests that a binary weights matrix be generated. To this aim, all nonzero spatial weights are set to 1.
spatwmat, name(W) xcoord(longitude) ycoord(latitude) band(0 8)
*安装绘制Mora n’s I工具:splagvar
*splagvar --- Generates spatially lagged variables, constructs the Moran scatter plot,
*and calculates global Moran's I statistics.
*_2016GDP：使用变量_2016GDP
*wname(W):使用空间权重矩阵W
*indicate the name of the spatial weights matrix to be used *wfrom(Stata):indicate source of the spatial weights matrix *wfrom(Stata | Mata) indicates whether the spatial weights matrix is a Stata matrix loaded in memory or a Mata file located in the working directory.
*If the spatial weights matrix had been created using spwmatrix it should exist as a Stata matrix or as a Mata file.
*moran(_2016GDP):计算变量_2016GDP的Moran's I值
*plot(_2016GDP):构建变量_2016GDPMoran散点图
splagvar _2016GDP, wname(W) wfrom(Stata) moran(_2016GDP) plot(_2016GDP)
====================================== ========================================= **使用距离矩阵计算空间计量模型
*设置默认路径
cd D:\软件学习软件资料\stata\stata指导书籍命令\陈强高级计量经济学及stata应用（第二版）全部数据
*使用product.dta数据集（陈强的高级计量经济学及其stata应用P594）
*将数据集product.dta存入当前工作路径
use product.dta , clear
*创建新变量，对原有部分变量取对数
gen lngsp=log(gsp)
gen lnpcap=log(pcap)
gen lnpc=log(pc)
gen lnemp=log(emp)
*将空间权重矩阵usaww.spat存入当前工作路径
spmat use usaww using usaww.spmat
*使用聚类稳健的标准误估计随机效应的SDM模型
xsmle lngsp lnpcap lnpc lnemp unemp,wmat(usaww) model(sdm)robust nolog
*使用选择项durbin(lnemp)，不选择不显著的变量，使用聚类稳健的标准误估计随机效应的SDM模型
xsmle lngsp lnpcap lnpc lnemp unemp,wmat(usaww) model(sdm) durbin(lnemp) robust nolog noeffects
*使用选择项durbin(lnemp)，不选择不显著的变量，使用聚类稳健的标准误估计固定效应的SDM模型
xsmle lngsp lnpcap lnpc lnemp unemp,wmat(usaww) model(sdm) durbin(lnemp) robust nolog noeffects fe *存储随机效应和固定效应结果
qui xsmle lngsp lnpcap lnpc lnemp unemp,wmat(usaww) model(sdm) durbin(lnemp) r2 nolog noeffects re
est sto re
qui xsmle lngsp lnpcap lnpc lnemp unemp,wmat(usaww) model(sdm) durbin(lnemp) r2 nolog noeffects fe
est sto fe
*esttab:将保存的结果汇总到一张表格中
*b(fmt):specify format for point estimates
*beta[(fmt)]:display beta coefficients instead of point est's
*se[(fmt)]:display standard errors instead of t statistics
*star( * 0.1 ** 0.05 *** 0.01):标记不同显著性水平对应的P值
*r2|ar2|pr2[(fmt)]:display (adjusted, pseudo) R-squared
*p[(fmt)]:display p-values instead of t statistics
*label:make use of variable labels
*title(string):specify a title for the table
esttab fe re , b se r2 star( * 0.1 ** 0.05 *** 0.01)
*hausman检验
*进行hausman检验前，回归中没有使用稳健标准误(没用“r”), *是因为传统的豪斯曼检验建立在同方差的前提下
*constant:include estimated intercepts in comparison; default is to exclude
*df(#):use # degrees of freedom
*sigmamore:base both (co)variance matrices on disturbance variance estimate from efficient estimator
*sigmaless:base both (co)variance matrices on disturbance variance estimate from consistent estimator
hausman fe re
**有时我们还会得到负的chi2值,即chi2<0，表明模型不能满足Hausman检验的渐近假设。

产生这些情况的原因可能有多种，*但我认为一个主要的原因是我们的模型设定有问题，导致Hausman 检验的基本假设得不到满足。

*这时，我们最好先对模型的设定进行分析，看看是否有遗漏变量的问题，或者某些变量是非平稳的等等。

*在确定模型的设定没有问题的情况下再进行Hausman 检验，如果仍然拒绝原假设或是出现上面的问题，
*那么我们就认为随机效应模型的基本假设（个体效应与解释变量不相关）得不到满足。

*此时，需要采用工具变量法或是使用固定效应
模型。

*连玉君（论文（2014）：Hausman检验统计量有效性的Monte Carlo模拟分析）
*研究了hausman检验统计量的小样本性质，结果表明，
*内生性问题（解释变量与个体效应相关）是导致hausman统计量出现负值的主要原因，*进一步分析表明，修正后的hausman统计量，以及过度识别检验方法能够很好地克服上述缺陷，
*且具有很好的有限样本性质。

*陈强（高级计量经济学及其stata应用P153）介绍工具变量法与豪斯曼的stata命令及实例
qui reg lw iq s expr tenure rns smsa
est sto ols
qui ivregress 2sls lw s expr tenure rns smsa (iq=med kww) est sto iv
hausman iv ols, constant sigmamore
*由于传统的豪斯曼检验在异方差的情形下不成立，故进行异方差稳健的DWH检验，estat endogenous
*使用ivreg2检验选择的工具变量是否为内生解释变量
ivreg2 lw s expr tenure rns smsa (iq=med kww),r endog(iq) *endog(iq)表示检验变量iq是否为内生变量
*若果存在异方差，则GMM比2SLS更有效率，故进行最优GMM 估计
ivregress gmm lw s expr tenure rns smsa (iq=med kww)
*进行过度识别检验
estat overid
*若P值不显著，则认为所有工具变量均为外生。

*接下来考虑迭代GMM
ivregress gmm lw s expr tenure rns smsa (iq=med kww),igmm
*(迭代GMM与两步GMM的系数估计值相差无几)。