Statistics Department
Courselist for Fall 2005/Spring 2006

CourseNumberInstructorTime
Introduction to Statistics101a-106aJonathan Reuning-Scherer and staffTues, Thurs 1:00 - 2:15
Probability and Statistics for Scientists238a/538aJoseph ChangMon, Wed, Fri 2:30-3:20
Probability Theory with Applications241a/541aDavid PollardMon, Wed, Fri 9:30 - 10:20
Linear Models312a/612aHannes LeebTues, Thurs 9:00-10:15
Data analysis361a/661aJohn HartiganMon, Wed 2:30 - 3:45
Statistical Inference610aHarrison ZhouTues, Thurs 10:30-11:45
Statistical Case Studies625aJohn EmersonTues, Thurs 1:00 - 2:15
Monte Carlo Methods636aJoseph ChangTues, Thurs 4:00 - 5:15
Bayes Theory653aJohn HartiganTues, Thurs 2:30 - 3:45
Information and Probability668aAndrew Barron and Mokshay MadimanTues, Thurs 10:30 -11:45
Internship in Statistical Research695aJohn Hartigan -
Introductory Statistics 100b/500b John Emerson Mon, Wed, Fri 10:30 - 11:20
Introductory Data Analysis230b/530bHannes LeebMon, Wed 2:30 - 3:45
Theory of Statistics242b/542bHarrison ZhouMon, Wed, Fri 9:30 - 10:20
Stochastic Processes251b/551bMokshay Madiman Mon, Wed 1:00 - 2:15
Information theory364b/664bAndrew BarronTues, Thurs 9:00 - 10:15
Data Mining and Machine Learning 365b/665bHannes LeebMon, Wed 11:30 - 12:45
Applied Math Senior Seminar and Project AM490bAndrew BarronWed 1:00 - 2:50
Advanced Probability600b/330bDavid PollardTues, Thurs 2:30 - 3:45
Markov Processes and Random Fields606bDavid PollardTues, Fri 10:30-11:45
Practical Work626bJohn EmersonWed 2:30-3:45
Statistical Consulting627bJohn EmersonWed 2:30-3:45; Friday either 1-2:15 or 2:30-4:00
Statistical Methods in Genetics and Bioinformatics645bJoseph ChangTues, Thurs 2:30-3:45
Multivariate Statistics for Social Sciences660bJonathan Reuning-Scherer Tues, Thurs 1:00 - 2:15
Nonparametric Statistics680bHarrison ZhouTues, Thurs 4:00-5:15
Research Seminar in Statistics699abSekhar Tatikonda and David PollardThursday 10:30 - 12:30
Departmental Seminar700ab-Monday 4:15 - 5:30
[Stochastic Calculus]603-not taught this year

Introductory Statistics (STAT 100b / STAT 500b)
Instructor: John Emerson
Time: Mon, Wed, Fri 10:30 - 11:20
Place: Mason 211
Webpage: http://www.stat.yale.edu/Courses/QR/stat100.html
Every day we are inundated with data. How do we recognize dishonest or even unintentionally distorted representations of quantitative information? How can we reconcile two medical studies with seemingly contradictory conclusions? How many observations do we need in order to make a sound decision? This course introduces statistical reasoning, emphasizing how Statistics can help us understand the world. Topics include numerical and graphical summaries of data, data acquisition and experimental design, probability, hypothesis testing, confidence intervals, correlation and regression. Students will learn to apply statistical concepts to data using Excel and reach conclusions about real-world problems.
[back to top]

Introduction to Statistics (STAT 101a-106a/STAT 501a-506a)
Instructor: Jonathan Reuning-Scherer and staff
Time: Tues, Thurs 1:00 - 2:15
Place: OML 202
Webpage: http://www.stat.yale.edu/Courses/QR/stat101106.html
Statistics is the science and art of prediction and explanation. In most fields of study research relies on statistical analysis of data. Each of these courses, led by an expert from the field of study, introduces statistical reasoning and emphasizes how Statistics is applied to the particular discipline. Topics include numerical and graphical summaries of data, data acquisition and experimental design, probability, hypothesis testing, confidence intervals, correlation and regression. Students will learn to apply statistical concepts to data using Minitab and reach conclusions about real-world problems. Each course focuses on applications to a particular field of study and is taught jointly by two instructors, one specializing in statistics and the other in the discipline particular to the course (Life Sciences for Stat 101, Political Science for Stat 102, and so on). The courses meet together for the first seven weeks and separately for the final six weeks. The first part of the course is taught by Jonathan Reuning-Scherer and covers fundamentals of probability and statistics. Periodic examples are provided by individual course instructors. The courses separate by area of specialty for the final six weeks.
[back to top]

Introduction to Statistics: Life Sciences (STAT 101a/E&EB 210aG/MCDB 215a)
Instructor: Jonathan Reuning-Scherer and Gunter Wagner
Statistical and probabilistic analysis of biological problems presented with a unified foundation in basic statistical theory. Problems are drawn from genetics, ecology, epidemiology, and bioinformatics.
[back to top]

Introduction to Statistics: Political Science (STAT 102a/EP&E 203a/PLSC 425a)
Instructor: Jonathan Reuning-Scherer and Donald Green
Statistical analysis of politics and quantitative assessments of public policies. Problems presented with reference to a wide array of examples: public opinion, campaign finance, racially motivated crime, and health policy.
[back to top]

Introduction to Statistics: Social Sciences (STAT 103a/SOCY 119a)
Instructor: Jonathan Reuning-Scherer
Descriptive and inferential statistics applied to analysis of data from the social sciences. Introduction of concepts and skills for understanding and conducting quantitative research.
[back to top]

[Introduction to Statistics: Psychology] (STAT 104a/PSYC 201a)
Time: not taught this year
[back to top]

Introduction to Statistics: Medicine (STAT 105a)
Instructor: Jonathan Reuning-Scherer and David Salsburg
Statistical methods relied upon in medicine and medical research. Practice in reading medical literature competently and critically, as well as practical experience performing statistical analysis of medical data.
[back to top]

Introduction to Statistics: Data Analysis (STAT 106a)
Instructor: Andrew Barron
An introduction to Probability and Statistics with emphasis on data analysis.
[back to top]

Introductory Data Analysis (STAT 230b/STAT 530a/PLSC 530b)
Instructor: Hannes Leeb
Time: Mon, Wed 2:30 - 3:45
Survey of statistical methods: plots, transformations, regression, analysis of variance, clustering, principal components, contingency tables, and time series analysis. Uses SPLUS and Web data sources. After or concurrent with Statistics 101-105.
[back to top]

Probability and Statistics for Scientists (STAT 238a/STAT 538a)
Instructor: Joseph Chang
Time: Mon, Wed, Fri 2:30-3:20
Place: ML 104
Fundamental principles and techniques that help scientists think probabilistically, develop statistical models, and analyze data. Essentials of probability: conditional probability, random variables, distributions, law of large numbers, central limit theorem, Markov chains. Statistical inference with emphasis on the Bayesian approach: parameter estimation, likelihood, prior and posterior distributions, Bayesian inference using Markov chain Monte Carlo. Introduction to regression and linear models. Computers are used throughout for calculations, simulations, and analysis of data. After MATH 118a or b or 120a or b. Some acquaintance with matrix algebra and computing assumed.
[back to top]

Probability Theory with Applications (STAT 241a/STAT 541a/MATH 241a)
Instructor: David Pollard
Time: Mon, Wed, Fri 9:30 - 10:20
Place: WLH 208
Webpage: http://www.stat.yale.edu/~pollard/Courses
A first course in probability theory: probability spaces, random variables, expectations and probabilities, conditional probability, independence, some discrete and continuous distributions, central limit theorem, law of large numbers. After or concurrent with Mathematics 120a or b or equivalents.
[back to top]

Theory of Statistics (STAT 242b/STAT 542b/MATH 242b)
Instructor: Harrison Zhou
Time: Mon, Wed, Fri 9:30 - 10:20
Webpage: http://www.stat.yale.edu/~hz68/242/
Principles of statistical analysis: maximum likelihood, sampling distributions, estimation, confidence intervals, tests of significance, regression, analysis of variance, and the method of least squares. After Statistics 241a; after or concurrent with Mathematics 222.
[back to top]

Stochastic Processes (STAT 251b/STAT 551b)
Instructor: Mokshay Madiman
Time: Mon, Wed 1:00 - 2:15
Introduction to the study of random processes, including Markov chains, Markov random fields, martingales, random walks, Brownian motion and diffusions. Tecniques in probability, such as coupling and large deviations. Applications to image reconstruction, Bayesian statistics, finance, probabilistic analysis of algorithms, genetics and evolution. After Statistics 241a or equivalent.
[back to top]

Linear Models (STAT 312a/STAT 612a)
Instructor: Hannes Leeb
Time: Tues, Thurs 9:00-10:15
Place: 24 Hillhouse
The geometry of least squares; distribution theory for normal errors; regression, analysis of variance, and designed experiments; numerical algorithms (with particular reference to Splus); alternatives to least squares. Generalized linear models. Linear algebra and some acquaintance with statistics assumed.
[back to top]

Data analysis (STAT 361a/STAT661a)
Instructor: John Hartigan
Time: Mon, Wed 2:30 - 3:45
Place: Statlab, 140 Prospect Street
Through analysis of data sets using the Splus statistical computing language, study of a selection of statistical topics such as linear and nonlinear models, maximum likelihood, resampling methods, curve estimation, model selection, classification and clustering. After Statistics 242 and Mathematics 222b or 225a or b, or equivalents.
[back to top]

Information theory (STAT364b/STAT664b)
Instructor: Andrew Barron
Time: Tues, Thurs 9:00 - 10:15
Foundations of information theory in mathematical communications, statistical inference, statistical mechanics, probability, and algorithmic complexity. Quantities of information and their properties: entropy, conditional entropy, divergence, redundancy, mutual information, channel capacity. Basic theorems of data compression, data summarization, and channel coding. Applications in statistics and finance. After Statistics 241.
[back to top]

Data Mining and Machine Learning (STAT365b/STAT665b)
Instructor: Hannes Leeb
Time: Mon, Wed 11:30 - 12:45
Techniques for data mining and machine learning are covered from both a statistical and a computational perspective, including support vector machines, bagging, boosting, neural networks, and other nonlinear and nonparametric regression methods. The course will give the basic ideas and intuition behind these methods, a more formal understanding of how and why they work, and opportunities to experiment with machine learning algorithms and apply them to data. After STAT 242b.
[back to top]

Applied Math Senior Seminar and Project (AM490b)
Instructor: Andrew Barron
Time: Wed 1:00 - 2:50
Place: 24 Hillhouse
Under the supervision of a member of the faculty, each student works on an independent project. Students participate in seminar meetings at which they speak on the progress of their projects. Some meetings are devoted to talks by visiting applied mathematicians.
[back to top]

Advanced Probability (STAT 600b/STAT 330b)
Instructor: David Pollard
Time: Tues, Thurs 2:30 - 3:45
Place: WLH 113
Webpage: http://www.stat.yale.edu/~pollard/Courses
Measure theoretic probability, conditioning, laws of large numbers, convergence in distribution, characteristic functions, central limit theorems, martingales. Some knowledge of real analysis is assumed.
[back to top]

[Stochastic Calculus] (STAT 603)
Instructor: -
Time: not taught this year
[back to top]

Markov Processes and Random Fields (STAT 606b)
Instructor: David Pollard
Time: Tues, Fri 10:30-11:45
Place: 24 Hillhouse
Webpage: http://www.stat.yale.edu/~pollard/Courses/
Markov chains on general state spaces; diffusions; Markov random fields; Gibbs measures; percolation. After STAT 600.
[back to top]

Statistical Inference (STAT 610a)
Instructor: Harrison Zhou
Time: Tues, Thurs 10:30-11:45
Place: 24 Hillhouse
Webpage: http://www.stat.yale.edu/~hz68/610
A systematic development of the mathematical theory of statistical inference covering methods of estimation, hypothesis testing, and confidence intervals. An introduction to statistical decision theory. Undergraduate probability at the level of Statistics 241a assumed.
[back to top]

Statistical Case Studies (STAT 625a)
Instructor: John Emerson
Time: Tues, Thurs 1:00 - 2:15
Place: 24 Hillhouse, basement
Webpage: http://www.stat.yale.edu/~jay/625.html
Statistical analysis of a variety of problems including the value of a baseball player, the fairness of real estate taxes, how to win the Tour de France, energy consumption in Yale buildings, and interactive questionnaires for course evaluations. We will emphasize methods of choosing data, acquiring data, and assessing data quality. Graduate, professional, and undergraduate students from any department are welcome, but must seek permission (discussing their background in statistics and goals for the semester) at or before the first class meeting. At least one prior course in statistics is required, but the most important prerequisite is a willingness to get your hands dirty working with real data sets. This will entail a certain amount of "programming," which we believe can be best taught by example, trial and error.
[back to top]

Practical Work (STAT 626b)
Instructor: John Emerson
Time: Wed 2:30-3:45
Place: 24 Hillhouse
Individual one-semester projects, with students working on studies outside the Department, under the guidance of a statistician.
[back to top]

Statistical Consulting (STAT 627b)
Instructor: John Emerson
Time: Wed 2:30-3:45; Friday either 1-2:15 or 2:30-4:00
Place: 24 Hillhouse
Webpage: http://www.stat.yale.edu/~jay/627.html
Statistical consulting and collaborative research projects often require statisticians to explore new topics outside their area of expertise. This course exposes students to real problems, requiring them to draw on their expertise in probability, statistics, and data analysis. Students complete the course with individual projects supervised jointly by faculty outside the department and by one of the instructors.
[back to top]

Monte Carlo Methods (STAT 636a)
Instructor: Joseph Chang
Time: Tues, Thurs 4:00 - 5:15
Place: 24 Hillhouse
Theory and practice of Monte Carlo methods, with emphasis on Markov chain Monte Carlo and statistical applications. Generation of random variables, importance sampling, Metropolis Hastings, Gibbs sampling, variable dimension methods and model selection, multilevel and population based methods, convergence diagnostics. Markov chains in general state spaces and rates of convergence. Applications in Bayesian inference, simulation, and optimization.
[back to top]

Statistical Methods in Genetics and Bioinformatics (STAT 645b)
Instructor: Joseph Chang
Time: Tues, Thurs 2:30-3:45
Place: 24 Hillhouse
Stochastic modeling and statistical methods applied to problems such as mapping quantitative trait loci, analyzing gene expression data, sequence alignment, and reconstructing evolutionary trees. Statistical methods include maximu likelihood, Bayesian inference, Monte Carlo Markov chains, and some methods of classification and clustering. Models introduced include variance components, hidden Markov models, Bayesian networks, and coalescent. Recommended background: Stat 541, Stat 542. Prior knowledge of biology is not required.
[back to top]

Bayes Theory (STAT 653a)
Instructor: John Hartigan
Time: Tues, Thurs 2:30 - 3:45
Place: 24 Hillhouse
Axioms and interpretations of probability. Construction of probablilty distributions. Optimality of Bayes procedures. Martingales. Asymptotics. Markov sampling. Robustness against violations in the assumed distributions. Choice among models.
[back to top]

Multivariate Statistics for Social Sciences (STAT 660b)
Instructor: Jonathan Reuning-Scherer
Time: Tues, Thurs 1:00 - 2:15
A practical introduction to the analysis of multivariate data as applied to examples from the social sciences. Topics to include multivariate analysis of variance (MANOVA), principle components analysis, cluster analysis (hierarchical clustering, k-means), canonical correlation, multidimensional scaling, factor analysis, discriminant analysis, and structural equations modeling. Emphasis is placed on practical application of multivariate techniques to a variety of examples in the social sciences. There are regular homework assignments and a final project. Regular use of some statistical software package (students may choose among SAS, SPSS, and MINITAB). A complete syllabus will be available on the classes server.
[back to top]

Information and Probability (STAT 668a)
Instructor: Andrew Barron and Mokshay Madiman
Time: Tues, Thurs 10:30 -11:45
Place: 24 Hillhouse, basement
Study of several key results in probability using ideas and methods from information theory. Topics include entropy and its relationship to Fisher information, the law of large numbers, central limit theorem (normal approximation), law of small numbers (Poisson approximation), large deviations, martingales, Markov chains, and information projection. The approach we take quantitifies the increase in entropy or more generally the drop in information distance from an approximating distribution. Interpretations from statistics, physics, and finance.
[back to top]

Nonparametric Statistics (STAT 680b)
Instructor: Harrison Zhou
Time: Tues, Thurs 4:00-5:15
Place: 24 Hillhouse
Webpage: http://www.stat.yale.edu/~hz68/680
Introduction to nonparametric methods such as kernel estimation, Fourier basis estimation, wavelet estimation. Optimal minimax convergence rates and constants for function spaces, with connections to information theory. Adaptive estimators (e.g., adaptive shrinkage estimation). If time permits: high dimensional function estimation, functional data estimation, classification, or nonparametric asymptotic equivalence. Applications to real data. Some knowledge of statistical theory at the level of STAT 610a is assumed.
[back to top]

Internship in Statistical Research (STAT 695a)
Instructor: John Hartigan
Time: -
The Internship is designed to give students an opportunity to gain practical exposure to problems in the analysis of statistical data, as part of a research group within industries such as: medical and pharmaceutical research, financial, information technologies, telecommunications, public policy, and others. The Internship experience often serves as a basis for the Ph.D. dissertation. Students will work with the Director of Graduate Studies and other faculty advisors to select suitable placements. Students will submit a one-page description of their Internship plans to the DGS by May 1st, which will be evaluated by the DGS and other faculty advisors by May 15th. Upon completion of the Internship, students shall submit a written report of their work to the DGS, no later than October 1st. The Internship will be graded on a Satisfactory/ Unsatisfactory basis, and will be based on the student's written report and an oral presentation. This course is a one-credit elective requirement for the Ph.D. degree.
[back to top]

Research Seminar in Statistics (STAT 699ab)
Instructor: Sekhar Tatikonda and David Pollard
Time: Thursday 10:30 - 12:30
Place: 24 Hillhouse
Continuation of the dissertation seminar: message passing algorithms, random graphs, the objective method. Not for credit.
[back to top]

Departmental Seminar (STAT 700ab)
Instructor: -
Time: Monday 4:15 - 5:30
Place: 24 Hillhouse Avenue, room 107
Webpage: http://www.stat.yale.edu/seminars.html
Important activity for all members of the department. See webpage for weekly seminar announcements.
[back to top]


Revised: January 6, 2006