统计建模与R软件（薛毅）第九章答案Word文档下载推荐.doc

资源描述

统计建模与R软件（薛毅）第九章答案Word文档下载推荐.doc

《统计建模与R软件（薛毅）第九章答案Word文档下载推荐.doc》由会员分享，可在线阅读，更多相关《统计建模与R软件（薛毅）第九章答案Word文档下载推荐.doc（12页珍藏版）》请在冰豆网上搜索。

统计建模与R软件（薛毅）第九章答案Word文档下载推荐.doc

第一主成分：

第二主成分：

第三主成分：

第四主成分：

表（26）各行业按主成分得分进行排序结果

图（21）主成分碎石图

图（22）第一主成分与第二主成分下的散点图

习题程序与结论：

industry<

-data.frame（

+X1=c（90342,4903,6735,49454,139190,12215,2372,11062,17111,1206,2150,5251,14341），

+X2=c（52455,1973,21139,36241,203505,16219,6572,23078,23907,3930,5704,6155,13203），

+X3=c（101091,2035,3767,81557,215898,10351,8103,54935,52108,6126,6200,10383,19396）,

+X4=c（19272,10313,1780,22504,10609,6382,12329,23804,21796,15586,10870,16875,14691）,

+X5=c（82.0,34.2,36.1,98.1,93.2,62.5,184.4,370.4,221.5,330.4,184.2,146.4,94.6）,

+X6=c（16.1,7.1,8.2,25.9,12.6,8.7,22.2,41.0,21.5,29.5,12.0,27.5,17.8）,

+X7=c（197435,592077,726396,348226,139572,145818,20921,65486,63806,1840,8913,78796,6354）,

+X8=c（0.172,0.003,0.003,0.985,0.628,0.066,0.152,0.263,0.276,0.437,0.274,0.151,1.574））

industry.pr<

-princomp（industry,cor=T）

summary（industry.pr）####做主成分分析，得到4个主成分，累积贡献率达94.68%

Importanceofcomponents:

Comp.1Comp.2Comp.3Comp.4Comp.5

Standarddeviation1.76207621.70218730.96447680.801325320.55143824

ProportionofVariance0.38811410.36218020.11627690.080265280.03801052

CumulativeProportion0.38811410.75029430.86657120.946836490.98484701

Comp.6Comp.7Comp.8

Standarddeviation0.294274970.1794000620.0494143207

ProportionofVariance0.010824720.0040230480.0003052219

CumulativeProportion0.995671730.9996947781.0000000000

load<

-loadings（industry.pr）####求出载荷矩阵

load

Loadings:

Comp.1Comp.2Comp.3Comp.4Comp.5Comp.6Comp.7Comp.8

X1-0.477-0.296-0.1040.1840.7580.245

X2-0.473-0.278-0.163-0.174-0.305-0.5180.527

X3-0.424-0.378-0.156-0.174-0.781

X40.213-0.4510.5160.5390.288-0.2490.220

X50.388-0.331-0.321-0.199-0.4500.5820.233

X60.352-0.403-0.1450.279-0.317-0.714

X7-0.2150.377-0.1400.758-0.4180.194

X8-0.2730.891-0.3220.122

Comp.1Comp.2Comp.3Comp.4Comp.5Comp.6Comp.7Comp.8

SSloadings1.0001.0001.0001.0001.0001.0001.0001.000

ProportionVar0.1250.1250.1250.1250.1250.1250.1250.125

CumulativeVar0.1250.2500.3750.5000.6250.7500.8751.000

plot（load[,1:

2]）

text（load[,1],load[,2],adj=c（-0.4,-0.3））

screeplot（industry.pr,npcs=4,type="

lines"

）####得出主成分的碎石图

biplot（industry.pr）####得出在第一，第二主成分之下的散点图

-predict（industry.pr）####预测数据，讲预测值放入p中

order（p[,1]）;

order（p[,2]）;

order（p[,3]）;

order（p[,4]）;

####将预测值分别以第一，第二，第三，第四主成分进行排序

[1]51324613119712108

[1]58491011312711623

[1]81539127102611413

[1]11657101312918324

kmeans（scale（p）,4）####将预测值进行标准化，并分为4类

K-meansclusteringwith4clustersofsizes5,1,4,3

Clustermeans:

Comp.1Comp.2Comp.3Comp.4Comp.5Comp.6

10.5132590-0.03438438-0.3405983-0.51300310.23551510.22441040

2-2.5699693-1.32913757-0.4848689-0.9460127-0.9000187-0.06497950

30.23815810.72871986-0.29959180.3126036-0.4744091-0.19709710

4-0.3163193-0.471273331.12874260.75353800.5400265-0.08956137

Comp.7Comp.8

1-0.38197798-0.7474855

2-0.675002090.4569548

30.090630690.9826915

40.74078975-0.2167643

Clusteringvector:

[1]4334211113134

Withinclustersumofsquaresbycluster:

[1]19.411370.0000024.4950416.61172

（between_SS/total_SS=37.0%）

Availablecomponents:

[1]"

cluster"

centers"

totss"

withinss"

tot.withinss"

[6]"

betweenss"

size"

9.2

####用数据框的形式输入数据

sale<

X1=c（82.9,88.0,99.9,105.3,117.7,131.0,148.2,161.8,174.2,184.7）,

X2=c（92,93,96,94,100,101,105,112,112,112）,

X3=c（17.1,21.3,25.1,29.0,34.0,40.0,44.0,49.0,51.0,53.0）,

X4=c（94,96,97,97,100,101,104,109,111,111）,

Y=c（8.4,9.6,10.4,11.4,12.2,14.2,15.8,17.9,19.6,20.8）

）

####作线性回归

lm.sol<

-lm（Y~X1+X2+X3+X4,data=sale）

summary（lm.sol）

显示结果

Call:

lm（formula=Y~X1+X2+X3+X4,data=sale）

Residuals:

1234567

0.0248030.0794760.012381-0.007025-0.2883450.216090-0.142085

8910

0.158360-0.1359640.082310

Coefficients:

EstimateStd.ErrortvaluePr（>

|t|）

（Intercept）-17.667685.94360-2.9730.03107*

展开阅读全文