STA2101: Methods of Applied Statistics I
and
STA442: Methods of Applied Statistics
University of Toronto, Fall
2011
Lecture Friday 2:105:00 p.m. Ramsay Wright 117
Office Hours Wednesday 111 and Friday 1011 in SS6025
This page is under construction, and will be updated frequently throughout the course.
Office Hours Monday Dec 12th: 10am  noon.
Information about the final exam,
including one last computer assignment.
If you are getting strange data errors trying to read data you've
found online, check and see if the problem is with the last variable
you're trying to read. If so, SAS for unix is probably choking on
DOS/Windows line breaks. These are invisible characters that signal the end
of a line, and they are not the same for the 2 operating systems. Say the
data are called something like lifetime.data. Try something like
this:
bash3.00$ dos2unix < lifetime.data > deathtime.data
You'll see a couple of lines that look like error messages, like unable to
open something, but don't worry. Are you able to read the data
from deathtime.data?
 Course Outline
 Term Paper for STA2101: No paper is required for STA442
 Computing information and links
 Homework Assignments
 Assignment
1: Do these matrix algebra review questions in preparation
for Quiz 1 on Friday Sept. 23d. If you don't have a linear
algebra textbook handy, see Section A2 in Appendix A.
 Assignment
2: Do these questions in preparation for Quiz 2 on Friday
Sept. 30th. Bring the R parts to the quiz; they may (or may
not) be handed in.
 Assignment
3: Do these questions in preparation for Quiz 3 on Friday
Oct. 7th. Bring the R parts to the quiz; they may (or may not)
be handed in.
 Assignment
4: Do these questions in preparation for Quiz 4 on Friday
Oct. 14th. Bring the R parts to the quiz; they may (or may
not) be handed in.
 Assignment
5: Do these questions in preparation for Quiz 5 on Friday
Oct. 21st. Bring the R parts to the quiz; they may (or may
not) be handed in.
 Assignment
6: Do these questions in preparation for Quiz 6 on Friday
Oct. 28th. Bring the R parts to the quiz; they may (or may
not) be handed in.
 Assignment
7: Do these questions in preparation for Quiz 7 on Friday
Nov. 4th. Bring the R parts to the quiz; they may (or may
not) be handed in.
 Assignment
8: Do these questions in preparation for Quiz 8 on Friday
Nov. 11th. Bring the R parts to the quiz; they may (or may
not) be handed in.
 Assignment
9: Do these questions in preparation for Quiz 9 on Friday
Nov. 18th. Bring your SAS log and list files to class. You know
they'll probably be handed in.
 Assignment
10: Do these questions in preparation for Quiz 10 on Friday
Nov. 25th. Bring your SAS log and list files to class. They may
be handed in.
 Assignment
11: Do these questions in preparation for Quiz 11 on Friday
Dec. 2nd. Bring your SAS log and list files to class. They may
be handed in.
 Handouts and Overheads: You may want to print these and bring to class so you can write notes on them.
 Introduction to R
 Large sample tools
 Random vectors and the multivariate normal
 Maximum Likelihood
 Wald (and Score) Tests
 Likelihood with R, Part One
 Distributionfree Waldlike tests
 Linear Regression Part 1
 Analysis of Residuals
 Regression on the Trees data
 Linear Regression Part 2: Categorical independent variables and a bit about multiple comparisons (revised).
 Analysis of the (little) tubes data
 Some unix and emacs commands. You might as well print this handout and keep it with you while you work, at first.
 Brief Introduction to SAS
 Little Tube data with SAS: Data definition, Contrasts, multiple comparisons, custom hypothesis tests with proc glm and proc reg.
 SENIC data with SAS Part One: Read and describe the data, elementary tests
 Read and Check the Math data with SAS, including how to read character data.
 Introduction to Logistic Regression (Updated)
 Birth weight data: Logistic regression with R
 Birth weight with SAS: Part 1 (Data Description)
 Birth weight with SAS: Part 2 (Logistic Regression)
 Birth weight with SAS: Part 3 (More Logistic Regression)
 Stepwise Logistic Regression on the birth weight data with R
 Birth weight with SAS: Part 4 (Yet more Logistic Regression). This is for reference, and will not be covered in class.
 Factorial ANOVA: An elementary discussion.
 Factorial ANOVA with SAS proc glm and proc reg.
 Within Cases Part
One: The humble matched ttest
 Within Cases Part
Two: Hotelling's t^{2} with R (The
Multivariate ttest)
 Within Cases Part
Three: Multivariate Linear Model
 Within Cases Part
Four: Multivariate Tests on Withincases data
 Within Cases Part
Five : The Covariance Structure Approach
 Within Cases Part
Six: SAS Examples
 Readings and resources: These are substitutes for a printed textbook.
 library.nu: If you register on this site, it will be easy to download free textbooks including the following:
 Statistical models by A. C. Davison: A strong, highlevel book. If you want to know exactly what is going on with most applied statistical topics, this is a good place to look.
 Applied linear statistical models (5th ed) by Kutner, Nachtsheim, Neter and Li: This books give a simple and thorough explanation of many standard topics in univariate linear models and experimental design.
 A course in large sample theory by T. Ferguson: More than you need for this course, but a gem.
 Applied Statistics and the SAS Programming Language by Ronald P. Cody and Jeffrey K. Smith.
 Modern Applied Statistics with S (4th ed.) by W.N. Venables and B.D. Ripley. This includes R.
 Introduction to
R: A very nice 100page PDF
document. Highly recommended if you are going to be using
R for serious work.
 Appendix A
from the text for another course. It has review material
and some material from the beginning of the course. This is
work in progress but it still may be useful.
 Introduction to
SAS and unix

SAS manuals: You probably want SAS/STAT
 Data sets used in homework and lectures