Model selection given 2 models
Aka Confirmatory data analysis: Test hypotheses, as against Exploratory data analysis: Find hypotheses worth testing.
Which process is more likely to have generated the data? Which model is better at explaining the observations? Model selection, with only 2 models.
Hypotheses
Null hypothesis
Alternate hypothesis
The decision
So, you decide if parameter
Experiment/ Test
Pick sample; find value of estimate test statistic
Errors
Type 1
Erroneously accept
Type 2
Erroneously fail to reject
Tradeoff
Trying to decrease type 1 error involves increasing t’; But that increases type 2 error rate. Visualize error zones with regions in 2 bell curves with means slightly apart.
To simultaneously decrease both, must increase sample size.
In case of
p-value of the statistic
Given a sample, got
Power of a test
Take
Test design
Consider goodness of test with
Best test for given \htext{ {alpha}}
(Neyman-Pearson): Testing
Difference in differences
Suppose that using experiment A, where one compares hypotheses H1 and N (for null hyp), it is determined that N cannot be dismissed. In experiment B, one compares H2 and N and observe that N can be dismissed. From this, one cannot conclude that, while comparing H2 and H1, H1 can be dismissed: it is possible that the difference in evidence supporting H1 and H2 is small.
One should instead conduct an experiment comparing H1 and H2 directly. This has been a common mistake in medical research as of 2011!