Statistic, Statistical Hypothesis Test(ing), Test Statistic, t-test and p-value

September 20, 2014 2 分钟阅读

总结自：

1. StatisticPermalink

1.1 DefinitionPermalink

A statistic, is a single measure of some attribute of a sample (e.g. sample mean). It is calculated by applying a function to the values of the sample.

More formally, statistical theory defines a statistic as a function of a sample where the function itself is independent of the sample’s distribution; that is, the function can be stated before realization of the data. The term statistic is used both for the function and for the value of the function on a given sample.

A statistic is distinct from a statistical parameter, which is not computable because often the population is much too large to examine and measure all its items.

A statistic is an observable random variable, computed on a sample.
A parameter is a generally unobservable quantity describing a property of a statistical population, which can only be computed exactly if the entire population can be observed without error.

However, a statistic, when used to estimate a population parameter, is called an estimator. For instance, the sample mean is a statistic that estimates the population mean, which is a parameter.

1.2 TypesPermalink

When a statistic (a function) is being used for a specific purpose, it may be referred to by a name indicating its purpose:

in descriptive statistics, a descriptive statistic is used to describe the data;
in estimation theory, an estimator is used to estimate a parameter of the distribution (population);
in statistical hypothesis testing, a test statistic is used to test a hypothesis, e.g.
- t statistics
- chi-squared statistics
- f statistics

1.3 Statistical PropertiesPermalink

Important potential properties of statistics include

completeness
consistency
sufficiency
unbiasedness
minimum mean square error
low variance
robustness
computational convenience

2. Statistical Hypothesis Test(ing)Permalink

A statistical hypothesis test is a method of statistical inference. In statistics, a result is called statistically significant if it has been predicted as unlikely to have occurred by chance alone, according to a pre-determined threshold probability, the significance level.

Statistical hypothesis testing is sometimes called confirmatory data analysis, in contrast to EDA, which may not have pre-specified hypotheses.

简单说，Statistical hypothesis testing 就是指

提出 $H_{0}$ , $H_{a}$
建立 test statistic
计算是否应该 reject hypothesis

这么一套流程和方法。

3. Test StatisticPermalink

A test statistic is a statistic used in statistical hypothesis testing.

4. t-testPermalink

A t-test is a statistical hypothesis test in which the test statistic follows a Student’s t distribution if the null hypothesis is supported.

5. p-valuePermalink

以 t-test 为例。

在使用 t-test 时，如果 we assume $H_{0}$ is true，然后我们用的是一个 t-statistic following a Student’s t distribution，这时，我们手头上不是有一个 sample 嘛，我们用这个 sample 来算一下这个 t-statistic 的具体值，称为 t-value.

然后 p-value 就可以用来 answers this question: If my null hypothesis were true, what is the probability of getting a t-value at least as big as mine?

也就是 $p-value = P (t-statistic \geq | t-value | ∣ H_{0} = t r u e)$ . Obviously, the lower this value is, the less likely it is that you would find a difference like yours by chance.

结合分位数的概念来看，当 p-value 越小时，t-value 越靠近 tail，说明在 $H_{0} = t r u e$ 时取到这个 sample 对应的 t-value 的几率越小，于是我们越有信心来 reject $H_{0}$ 。

一般我们会给 p-value 取个阈值，常用的是 0.05，当 p-value < 0.05 时我们判定 reject $H_{0}$ 。这个阈值我们称为 Significance Level。

X Facebook LinkedIn Bluesky

Statistic, Statistical Hypothesis Test(ing), Test Statistic, t-test and p-value

1. StatisticPermalink

1.1 DefinitionPermalink

1.2 TypesPermalink

1.3 Statistical PropertiesPermalink

2. Statistical Hypothesis Test(ing)Permalink

3. Test StatisticPermalink

4. t-testPermalink

5. p-valuePermalink

分享

留下评论

猜您还喜欢

LR Parsing #4: Runtime Encoding of LR(0)/SLR(1) Parsing DFA (How to Construct the Parsing Tables)

LR Parsing #3: Simulation of the Parsing DFA (Configuration / Shift-Reduce / Structure of Parsing Table)

LR Parsing #2: Structural Encoding of LR(0) Parsing DFA

LR Parsing #1: Intuition