Statistics

Spring 2016
course
navigation

April 7

where we are

Today :
For next week :

project discussion

articles & reading from last time

Talk about briefly ... experimental design and p-values
But what is wrong with looking at the data? Answer: you *should* look at some data. But you *should not* do a numerical experiment on *that* data. Otherwise, strong risk of cherry picking the question.
Instead: if you get data from an outside source, divide into parts. Stare at one part, make plots, form conjectures. Then design experiment, and preform experiment on a *different part*.
Example of problem :
The "dark chocolate and weight loss" paper (which was doing bad science on purpose) gives an example of the problem.
The p-value is an important idea ... but not the only idea, and not the only check on whether the conclusion is valid.

practice

Textbook :
R practice with the attached data for this question :
Does drinking affect memory? The data gives experimental subjects, one line per subject, who have been given a memory test after being given a glass of fluid with or without alcohol.
The task is to set up a hypothesis test and see what conclusion can be drawn from the data.
(The memory.R source file is what I used to generate the data and do some visualization and analysis. The comments describe what it is and how it can be used. The functions illustrate some data manipulation and graphing with R.)
http://cs.marlboro.edu/ courses/ spring2016/statistics/ notes/ April_7
last modified Friday April 8 2016 2:54 pm EDT

attachments [paper clip]

     name last modified size
   alcohol.csv Apr 7 2016 1:00 am 1.27kB    memory.R Apr 7 2016 1:24 am 5.45kB