Course description


STAT 439 will provide an introduction to the principles and general methods for the analysis of categorical data. Categorical data occur extensively in both observational and experimental studies, as well as in industrial and data science applications. Topics include, but are not limited to, contingency table analysis, Poisson regression, logistic regression, log-linear models, and multicategory logit models. While some theoretical statistical detail is given, the primary focus will be on methods of data analysis. Problems will be motivated from a scientific perspective.

Learning outcomes

At the completion of this course, students will:

  1. Formulate scientific questions involving categorical response data.
  2. Identify appropriate statistical methods for categorical data and describe how they differ from methods for continuous data.
  3. Perform and interpret statistical inference for proportions, including the use of likelihood, Wald procedures, score procedures and likelihood-ratio procedures.
  4. Carry out and interpret contingency table analysis, including 2x2 and higher-order tables, estimation and testing for odds ratios, chi-squared tests of independence, and Fisher’s Exact Test.
  5. Understand the basic theory of generalized linear models—including logistic regression, multinomial logistic regression, and Poisson regression—and use these models to answer a research question.
  6. Perform and interpret a statistical analysis of matched pairs categorical data.
  7. Understand the basics of the analysis of correlated and clustered categorical response data including generalized estimating equations and generalized linear mixed models.

Time and location


TR 9:25-10:40am
Wilson Hall 1-143


Instructor


Dr. Stacey Hancock
email:
Office: Wilson 2-195
Phone: (406) 994-5350

Office hours:

  • Virtual (see D2L for Zoom link): Mondays 3:10-4:00pm
  • In person (Wilson 2-195): Tuesdays and Thursdays 3:10-4:00pm
  • By appointment — Please feel free to email me to schedule an appointment outside of office hours.

Prerequisites


Entrance to STAT 439 requires completion of STAT 412 or STAT 512 (Methods for Data Analysis II). Students may take STAT 412/512 concurrently with permission of the instructor.


Course materials


Textbook

The required textbook for the course is Alan Agresti’s An Introduction to Categorical Data Analysis, 3rd edition, Wiley, 2018.

Important: Agresti has another textbook called Categorical Data Analysis. Make sure you are using his book that starts with “An Introduction…”.

Computing

We will use the statistical programming language R through the RStudio interface. See my statistical computing resources page for download instructions and extra resources.

For announcements, grades, discussion forums, and turning in assignments, see our D2L page.

  • Make sure you are receiving email notifications for any D2L activity. In D2L, click on your name, then Notifications. Check that D2L is using an email address that you regularly check; you have the option of registering a mobile number. Check the boxes to get notifications for announcements, content, discussions, and grades.
  • If you have a question about the course materials, computing, or logistics, please post your question to your D2L discussion board instead of emailing your instructors. This ensures all students can benefit from the responses. Other students are encouraged to respond.

Course assessment


Your grade in STAT 408 will be comprised of the following components.

  1. Homework (25%): Homework and data analysis problems will be assigned approximately bi-weekly.

  2. Quizzes (40%): In lieu of midterm exams, we will have bi-weekly 30-minute in-class quizzes over material from the past two weeks. The lowest quiz grade will be dropped.

    • You may use one sheet (8.5” x 11”, front and back) of notes on each quiz.
    • You will not be asked to write R code on the quizzes, but you may be asked to interpret R code and output.
  3. Project (15%): A group data analysis project will be completed throughout the course. Due dates and details are posted in D2L. We will use our final exam time (Thursday, May 12, 8:00–9:50am in our usual classroom) for project presentations.

  4. Final Exam (20%):

    • A take-home final exam will be distributed on the last day of classes, due by 11:00pm on Wednesday, May 11.

Letter grades generally follow the typical scale:

93-100 = A
90-92 = A-
88-89 = B+
83-87 = B
80-82 = B-
etc.

These cutoffs may be adjusted down (never up!) at the end of the semester, depending on the grade distribution in the course. Thus, a 93% will guarantee an A, a 90% will guarantee an A-, etc.


Course policies


Classroom community

All members of the classroom community (instructor, students, visitors) are expected to treat each other with courtesy and respect. Our comments to others should be factual, constructive, and free from harassing statements. You are encouraged to disagree with others, but such disagreements need to be based upon facts and documentation (rather than prejudices and personalities). It is the instructor’s goal to promote an atmosphere of mutual respect in the classroom.

The success of all students in this course depends on all members of the classroom community agreeing to:

  • Show up and be present
  • Listen
  • Contribute
  • Build on other’s ideas
  • Speak clearly and loudly enough for all to hear

Please contact the instructor if you have suggestions for improving the classroom environment.

Absences, illnesses, and late work policy

You will note that attendance is not a required component of your grade. However, there is no better predictor of college grades than class attendance (Crede, Roch, & Kieszczynka, 2010). In order to participate in class discussions and build a complete classroom community, please plan to attend every class. If you are ill or have a conflict, an email letting me know you won’t be in class is appreciated.

Homework

It is expected that students turn in homework by the posted deadline, but understood that life sometimes prevents us from meeting a deadline. Thus, homework will be accepted after the deadline for no penalty up until the point that homework solutions are posted. After homework solutions are posted, you may still earn half credit on your homework by turning in your completed assignment with extended explanations added to each problem. Extended explanations are required to demonstrate that you fully understand each problem and are not just mimicking the posted solutions. Late assignments which directly copy the posted solutions will be treated as plagiarism and will receive no credit.

Quizzes

The last quiz day of the semester will be a designated make-up quiz. Only students who had to miss a quiz earlier in the semester will be allowed to take this quiz, and the grade from this quiz will serve as the grade for your missed quiz. As an added tool to address missed quizzes, the lowest quiz grade will be dropped.

Policy on collaboration and academic misconduct

At a minimum, any act of academic dishonesty, which includes but is not limited to plagiarism, cheating, multiple submissions, or facilitating others’ misconduct, will result in a score of zero on the assignment/quiz/exam in question and notification of department and university officials. Further action may be taken as warranted. If you have any questions about the limits of collaboration or about using and citing sources, you are expected to ask for clarification.

Collaboration on individual assignments

After attempting to complete homework problems on your own, you are permitted to collaborate on homework in a constructive manner for all involved—each individual in the collaboration needs to ensure they understand and could explain the process of solving each problem. While I encourage you to talk through problems with fellow students, the work you turn in must be your own and must be written in your own words (unless the assignment specifically states otherwise).

Each homework will require a “citations” page where you cite all sources (including web forums such as Stack Overflow) and individuals used to complete that homework assignment. Paraphrasing or quoting another’s work without citing the source is a form of academic dishonesty. Even inadvertent or unintentional misuse or appropriation of another’s work (such as relying heavily on source material that is not expressly acknowledged) is considered plagiarism. Homework assignments that do not cite sources or individuals, or assignments where answers are copied directly from another student, will be considered and treated as plagiarism, and will receive a zero grade.

Guidance on Citing Sources:

  • You do not need to include in-text citations—only a reference list at the end of your assignment.
  • The citation style you choose is up to you, but each entry should include enough information for someone to find the source on their own. This includes the author’s name (could be an organization), date of publication (if not known, then state “n.d.”), title of source, location and name of publisher (if applicable), and the URL where the source was retrieved.
  • I typically use APA style. Mendeley has a nice reference webpage for how to cite different sources in APA style.
  • If one of your sources on your homework assignment is another student in the class or a tutor, your citation needs only to state their name, e.g., “Direct correspondence with Jane Doe.”
  • You do not need to cite any assigned readings, notes, or material made available on our course website, nor do you need to cite asking questions of your instructor or teaching assistant.
  • If you did not use any external sources on your assignment, please state, “No external sources used.”

If you have any questions about the limits of collaboration or about using and citing sources, you are expected to ask for clarification.

MSU policy

Students in an academic setting are responsible for approaching all assignments with rigor, integrity, and in compliance with the University Code of Student Conduct. This responsibility includes:

  1. consulting and analyzing sources that are relevant to the topic of inquiry;
  2. clearly acknowledging when they draw from the ideas or the phrasing of those sources in their own writing;
  3. learning and using appropriate citation conventions within the field in which they are studying; and
  4. asking their instructor for guidance when they are uncertain of how to acknowledge the contributions of others in their thinking and writing.

More information about Academic Misconduct from the Dean of Students

Diversity and inclusivity statements

Respect for Diversity: It is our intent that students from all diverse backgrounds and perspectives be well-served by this course, that students’ learning needs be addressed both in and out of class, and that the diversity that students bring to this class be viewed as a resource, strength and benefit. It is our intent to present materials and activities that are respectful of diversity: gender identity, sexual orientation, disability, age, socioeconomic status, ethnicity, race, religion, culture, perspective, and other background characteristics. Your suggestions about how to improve the value of diversity in this course are encouraged and appreciated. Please let us know ways to improve the effectiveness of the course for you personally or for other students or student groups.

In addition, in scheduling exams, we have attempted to avoid conflicts with major religious holidays. If, however, we have inadvertently scheduled an exam or major deadline that creates a conflict with your religious observances, please let us know as soon as possible so that we can make other arrangements.

Support for Inclusivity: We support an inclusive learning environment where diversity and individual differences are understood, respected, appreciated, and recognized as a source of strength. We expect that students, faculty, administrators and staff at MSU will respect differences and demonstrate diligence in understanding how other peoples’ perspectives, behaviors, and worldviews may be different from their own.