n种统计方法的应用条件

合集下载
  1. 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
  2. 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
  3. 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。

一parameter test

numerical data

1.Z:independence,normality,homogeneity

used in variance of population is known,or sample size is large(n>50,n1+n2>60)

2.t:independence,normality,homogeneity(INH)

used in variance of population is unknown,or sample sizw is small

one sample

two independent sample

paired sample

(df=n-1)

3.ANOV A(F test):INH

used in comparison of more than two means of groups.

pletely randomized design(df of error =N-g)

b.randomized block design(df of error=(g-1)(k-1))

tin square design

(b and c can control confounding factors)

further:want to know which pairwise groups have significant difference

a.SNKq unplaned comparison

b.Dunnett t:planed comparison

c.LSDt:compare choosen two groups

categorical data

chi square test:usd in

a.how closely an observed distribution matches expected distribution(goodness of fit)

b.whether two variables are independent

c.whether the difference of proportion is significant

1.fourfold table df=1

n≥40,E≥5:pearson chi square test

1≤E<5:Yate's continuity correction

n<40:fisher's exact test

2.paired fourfold table

b+c<40:continuity correction

used in:to same objects,different methods

3.R*C table :df=(r-1)(c-1)

cell numbers of E<5 more than 1/5,use fisher's exact test

phi coefficient ,cramer's V,contigency coefficient are the measures of association for two categorical variables

二.non parameter test

1.wilcoxon signed rank test :used in paired sample and one sample ,when they do not satisfy the condition of parameter test.(statistics:T-,T+任选)

当n大于50无法查表时,T~N(n(n+1)/4,n(n+1)(2n+1)/24)

且当n不足够大时,需要continuity correction

当秩次重复超过1/5时,Zc

2.wilcoxon test for two independent sample:混合编秩,分别求和(T1,T2,选n较小者的T)

3.rank sum test for ordinal data.对于单向有序,选用行列表检验时无法得出疗效差别时

4.K-W test for :the number of groups more than two(statistics:H)

三.regression

1.simple linear regression :used in analysis of the influence of explanatory variable(independent variable) to the outcome variable(dependent variable)

a.X,Y are numerical data

b.X,Y have linear relationship

c.Y are nomal distribution for each given X

d.observations are independence

e.equal variance

(line)

几个概念:

linear regression model

coefficience of determinationR^2

residual

residual standard deviation

residual plot(residual analysis)

2.multiple linear regression:对Y的条件和simple linear regression一样,X可以是numerical data,binary data,ordinal data

几个概念:

opimum subset regression

stepwise regression

adjust R^2

dummy variable

3.logistic regression:Y is binary data,X is numerical,binary,ordinal ed to find the etiology of diseases,because the coefficient of logistic regression β has relationship with OR 概念:

maximum likelihood function

四correlation

1.linear correlation:pearson correlation analysis

要求XY服从双变量正态分布(binariate normal distribution X,Y,e~N),才能用tr进行假设检验估计总体相关系数(且tr=tb,df=n-2)

2.rank correlation:spearman correlation

XY不服从双变量正太分布时

3.association of two categorical variables

phi coefficient

cramer v

continuity coefficient

相关文档
最新文档