Tues Nov 3 - Statistics Lecture Notes
  - Go over homework; answer questions 
- Discuss mid-term project assignment
- This week: Student's t-Test
    
      -  Several variations; more complicated formulas.
      
-  This is the "real thing" - 
           one of the more common hypothesis testing tools.
      
- New concept: "degrees of freedom" : how many independent variables
 one sample: N - 1
 unpaired two sample: N1 + N2 - 2
 paired two sample (subtract corresponding numbers) : N - 1
 
- Basic idea is that with small N, our estimate of the standard
          deviation isn't very good - sigma is also "fuzzy".  This takes
          that into account, depending on N.
      
- Critical values of t are given in Table D-5 in textbook 
 ...
          but in practice you'd usually let a statistics tool do the work 
          for you.
- Variations described in the text :
 * one sample mean (least common)
 * unpaired two sample assuming equal variances
 (Allison Lennox, one of Jenny's student's asked how to do this just this semester)
 * paired two sample
 
 
- Aside : "p-value"; see pg 235 in text.
    
     - Minor variation on our procedure, 
         slightly different but essentially same.
     
- Instead  of critical values, 
        find probability (one or two tail) of a value at least
         as extreme as the one you found.  This probability is the p-value.
     
- Then accept or reject by comparing p directly to alpha.
   
 
   
- Complications for t-test.
    
      - we typically use a "pooled estimate" of the standard deviation
      
- if "before", "after" data, better to use a "paired" test
      and look at differences in corresponding values.
    
 
  
   
- An example done 3 ways: {10, 12, 12, 13} and {9, 10, 12, 12}, 
      two tail means tests, H0 is that these are the same.
    
      - old "normal" method. (But with N this small, this isn't quite right.)
      
 m1=11.75,s1=1.26,m2=10.75,s2=1.5, diff=1.0, sigma_diff=0.98,
 p-value=2*(1-NORMDIST(1.0,0,0.98,TRUE))=0.33
 The NORMDIST gives area to the left of x=1.0 
                  for mean=0, sigma=0.98 normal curve.
 (1-NORMDIST) is the area to the right.
                  2*(1-NORMDIST) is the two-tailed extreme probability.
- unpaired t-test (assuming nothing about list order)
      
 result: t=1.02, sigma_pooled=1.38, 6 degrees of freedom, p-value=0.346
- paired t-test (assuming elements of lists match)
        
 result: t=2.45, 3 degrees of freedom, p-value=0.092
- There's an excel version of these results 
          in the t-test/ directory.
          Or you can type numbers into the website give below for
	  the t-test results.
    
 
   
- Student's t-TestCalculation tools and references
    
      - See 
          http://www.physics.csbsju.edu/stats/
 for an online tool to do two-tailed paired/unpaired tests.
 (That site also has some good explanations.)
- Also see "Data Analysis" menu in Excel.
 * t-Test: Paired two sample for means
 * t-Test: Two-sample assuming equal variances
 * t-Test: Two-sample assuming different variances (rarer; not in text)
 
<& /home/footer.html, 'Statistics Fall 2003', '../Statistics.html' &>