数学建模作业实验6数理统计实验

合集下载

相关主题

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

实验 6 数理统计实验 1. 区间估计解：（1）根据题意，用 R 软件求解 > X<-c(1067,919,1196,785,1126,936,918,1156,920,948) > t.test(X,al="g") 得到如下结果： One Sample t-test data: X t = 23.9693, df = 9, p-value = 9.148e-10 alternative hypothesis: true mean is greater than 0 95 percent confidence interval: 920.8443 Inf sample estimates: mean of x 997.1 由此知道这批灯泡中大约 95%的灯泡至少使用 920.8443 小时。（2） > x<-c(1067,919,1196,785,1126,936,918,1156,920,948) >x [1] 1067 919 1196 785 1126 936 918 1156 920 948 > pnorm(1000,mean(x),sd(x)) [1] 0.5087941 x<=1000 的概率为 0.5087941，故 x 大于 1000 的概率为 0.4912059 即这个星期生产出的灯泡能使用 1000h 以上的概率为 0.4912059。
4. 假设检验 III 解：根据题意提出假设：建立检验假设，确定检验水准：H0：p=p0=2% 即患病率相符；H1： p≠p0 即患病率不符；=0.05。使用 R 软件进行校验： > binom.test(400,10000,p=0.002) 得到如下结果： Exact binomial test data: 400 and 10000
2. 假设检验 I 解：根据题意，用 R 软件求解 > x<-c(220, 188, 162, 230, 145, 160, 238, 188, 247, 113, 126, 245, 164, 231, 256, 183, 19 0, 158, 224, 175) > t.test(x,mu=225,alternative="less") One Sample t-test data: x t = -3.4783, df = 19, p-value = 0.001258 alternative hypothesis: true mean is less than 225 95 percent confidence interval: -Inf 208.4806 sample estimates: mean of x
警告信息： In chisq.test(Y, p = p) : Chi-squared 近似算法有可能不准得到警告，因为 Pearson χ2 检验要求在分组后，至少要大于等于 5，而后两组中出现的顾客数是 1，0，均小于 5，重新分组，合并频数小于 5 的组： > Z<-c(92,68,28,12) > n<-length(Z); p<-p[1:n-1]; p[n]<-1-q[n-1] > chisq.test(Z,p=p) 得到如下结果： Chi-squared test for given probabilities data: Z X-squared = 0.9113, df = 3, p-value = 0.8227 分析结果， p-值为 0.8227>0.1，因此，接受原假设，即每分钟顾客数 X 服从 Poisson 分布。
8. 列联表检验 I 解：根据题意，利用 R 软件输入数据，使用 chisq.test()作检验。 > compare<-matrix(c(28,42,34,31,23,9,7,11,13,7,15,20), nr=2, dimnames=list(c("女性",
"男性"), c("配偶", "父母", "子女", "兄弟姐妹", "姻亲", "其他亲属"))) > chisq.test(compare, correct=TRUE) 得到如下结果： Pearson's Chi-squared test data: compare X-squared = 12.4666, df = 5, p-value = 0.02892 由于 p-值为 0.02892<0.05，因此拒绝原假设，认为女性和男性在关于给谁买节日礼物最难的看法上是有显著差异的。 9. 列联表检验 II 解：根据题意，其所给数据不满足χ 2 检验条件，固使用 Fisher 精确检验。 > x<-matrix(c(9,1,3,3), nc=2) > fisher.test(x) 得到如下结果： Fisher's Exact Test for Count Data data: x p-value = 0.1181 alternative hypothesis: true odds ratio is not equal to 1 95 percent confidence interval: 0.4313171 521.0928115 sample estimates: odds ratio 7.63506 由此计算出的 p-值=0.1181>0.05,并且区间估计得到的区间包含有 1，因此说明两个变量是独立的，即认为左右半球恶性肿瘤的发病率并无显著差异。
number of successes = 400, number of trials = 10000, p-value < 2.2e-16 alternative hypothesis: true probability of success is not equal to 0.002 95 percent confidence interval: 0.03624378 0.04402702 sample estimates: probability of success 0.04 检验出 P-值<0.05，因此不符合原假设，即这组数据不能说明乳腺癌的患病率与家族遗传有关。 5. 分布检验 I 解：根据题意提出假设：建立检验假设，确定检验水准：H0：结果符合自由组合规律；H1：结果不符合自由组合规律；=0.05。使用 R 软件进行校验，利用 pearson 卡方检验是否符合特定分布： > chisq.test(c(315,101,108,32),p=c(9,3,3,1)/16) 得到如下结果： Chi-squared test for given probabilities data: c(315, 101, 108, 32) X-squared = 0.47, df = 3, p-value = 0.9254 分析结果结果 p-值为 0.9254>0.05，所以接受原假设，即此结果符合自由组合规律。
6. 分布检验 II 解：根据题意提出假设：建立检验假设，确定检验水准：H0：每分钟顾客数 X 服从 Poisson 分布；H1：每分钟顾客数 X 不服从 Poisson 分布；=0.1。使用 R 软件进行校验：首先利用 pearson 卡方检验是否符合泊松分布： > X<-0:5;Y<-c(92,68,28,11,1,0) > q<-ppois(X, mean(rep(X,Y))); > n<-length(Y) > p<-numeric(n); > p[1]<-q[1]; > p[n]<-1-q[n-1]; > for (i in 2:(n-1)) + p[i]<-q[i]-q[i-1] > chisq.test(Y,p=p) 得到如下结果： Chi-squared test for given probabilities data: Y X-squared = 2.1596, df = 5, p-value = 0.8267
10 1
件进行检验： x<-c(6,2,8,6,1,11,10,3,4,6) ks.test(x,"pexp",0.1) 得到如下结果： One-sample Kolmogorov-Smirnov test data: x D = 0.3329, p-value = 0.2178 alternative hypothesis: two-sided 因此 P-值为 0.2178>0.05，因此接受原假设，即打进电话的时间间隔是否服从指数分布。
因此试验组与对照组的没有显著差异。进行成对 t 检验： > t.test(x,y,paired=TRUE) 得到如下结果： Paired t-test data: x and y t = -0.6464, df = 19, p-value = 0.5257 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: -2.373146 1.253146 sample estimates: mean of the differences -0.56 即试验组与对照组的结果也没有显著差异。故三中检验的结果都显示两组数据均值无差异。对比三种检验方式，如果两个样本是成对的，应该使用成对的 t 检验，如果不使用成对 t 检验，t 值会变小，p 值会变大，准确性差了很多。（2）方差检验： > var.test(x,y) 得到如下结果： F test to compare two variances data: x and y F = 1.5984, num df = 19, denom df = 19, p-value = 0.3153 alternative hypothesis: true ratio of variances is not equal to 1 95 percent confidence interval: 0.6326505 4.0381795 sample estimates: ratio of variances 1.598361 故两组数据方差相同。
7. 分布检验 III 解：根据打进的电话时间算出时间间隔： 1 :0 0 :0 6 6 1 :0 8 2 1 :1 6 8 1 :2 2 6 1 :2 3 1 1 1 :3 4 1 0 1 :4 4 1 1 :4 7 3 1 :5 1 4 1 :5 7 6 1
建立检验假设，确定检验水准：H0：打进电话的时间间隔服从指数分布；H1：打进电话的时间间隔不服从指数分布；=0.05。假设指数分布的参数 λ 为 =0.1，利用 R 软
192.15 油漆工人的血小板计数与正常成年男子有差异。 p 值为 0.002516 小于 0.05，原假设不成立，因此认为油漆作业对男子血小板含量有影响。
ຫໍສະໝຸດ Baidu
3. 假设检验 II 解：根据题意：建立检验假设，确定检验水准：H0：1=2 即阿卡波糖胶囊组与拜糖平胶囊组空腹血糖下降值总体均数相等；H1：1≠2 即阿卡波糖胶囊组与拜糖平胶囊组空腹血糖下降值总体均数不相等；=0.05。使用 t 检验，若两组数据方差相同时，编辑 R 软件程序如下： >x<-c(-0.70,-5.60,2.00,2.80,0.70,3.50,4.00,5.80,7.10,-0.50,2.50,-1.60,1.70,3.00,0.40,4.50,4. 60,2.50,6.00,-1.40) >y<-c(3.70,6.50,5.00,5.20,0.80,0.20,0.60,3.40,6.60,-1.10,6.00,3.80,2.00,1.60,2.00,2.20,1.20 ,3.10,1.70,-2.00) > t.test(x,y,var.equal=TRUE) 得到如下结果： Two Sample t-test data: x and y t = -0.6419, df = 38, p-value = 0.5248 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: -2.326179 1.206179 sample estimates: mean of x mean of y 2.065 2.625 分析结果，p-value=0.5248>0.05，所以接受原假设 H0，即试验组与对照组没有显著差异。根据题意，若两组数据方差不同时，利用 R 软件进行 t 检验： > t.test(x,y) 得到如下结果 Welch Two Sample t-test data: x and y t = -0.6419, df = 36.086, p-value = 0.525 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: -2.32926 1.20926 sample estimates: mean of x mean of y 2.065 2.625

数学建模作业 实验6数理统计实验

数学建模作业实验6数理统计实验