The learning goals of this first mini project are
- to solidify your understanding of what a true experiment is, specifically an experiment with crossed factors,
- to learn how to use the Qualtrics software to deliver experiments, and
- to gain experience with analyzing the data generated from an experiment with crossed factors.
You will conduct an online experimental study using survey methodology. Your experiments will be set up using the Qualtrics survey platform. Your experiments will utilize vingettes or very simple visual stimuli presentation.
Your study is required to be experimental, meaning you will manipulate factors and randomly assign levels of those factors to units (people). For this first mini project you must employ a basic factorial design with two factors (called a BF[2] in our textbook). Your factors will each have at least two levels and your factors will be crossed, thus you will be creating four carefully controlled conditions (i.e., four vignettes). These four conditions will be randomly assigned to people (fellow classmates). They are your “experimental material”.
This mini project will require you to design a BF[2] study, implement that study in Qualtrics, collect and analyze relevant data, and hand in a short written report describing your study and its findings. Your project will involve fitting an analysis of variance (ANOVA) model. ANOVA is the primary statistical technique we are learning in this course. The project is an opportunity to apply and synthesize the material we are learning in class to a specific context. It will also give you the opportunity to show off what you have learned about data analysis, visualization, and variance decomposition in this course and from other SDS courses you may have taken. These projects are a major component of the class, and successful completion is required to pass.
Research Questions
After settling on a research question, you will learn how to use Qualtrics and begin creating your experimental stimuli. Reading chapter four of your textbook could be helpful as you decide on the content of your study.
Picking a good research question might be tricky. But start by brain storming things about people that you are interested in. Perhaps you are interested in how stressful people perceive different situations to be, or their helping behavior given different characteristics of a person in need. You might also assess health behaviors — for example, making judgement about which exercises to do, or foods to eat, or which situations might lead people to be more or less likely to want to get vaccinated. Starting with something your are curious about with regards to your own behavior is often and interesting place to start. When choosing your research question the most important feature is that you would be able to write short (two or three sentences) vignettes, varying the scenarios, characteristics, features, etc., to answer your question. Secondary in importance is how interested you are in the question, and then as a distant third would be how important your research question is for society. This is not the time for earth shattering research discoveries. I will not be grading on creativity for this assignment. Just make it something very doable that you capture your interest for a few weeks.
Hypotheses
The topics I list above are meant to get you started, you should pose the question your want to answer as precisely as possible at the outset. Next, identify the factor(s) of interest, the levels of these factors (i.e., treatment conditions), and think about how you will measure your response variable. You should also make three hypotheses, a priori (before you collect and analyze the data), about the results you expect to see. Remember that each design comes with a set of questions that it can answer. For example, think about the the three important questions from Kelly’s hamster study. Your three hypotheses will be about
- The main effect of factor 1,
- The main effect of factor 2, and
- The interaction of factor 1 and factor 2.
General Rules
You may discuss your project with other students, but each of you will have a different topic, so there is a limit to how much you can help each other. You may consult other sources for information about the non-statistical, substantive issues in your problem, but you should credit these sources in your report. Feel free to consult Randi or the Stats TAs about statistical questions.
Submission
All deliverables must be delivered electronically via Moodle by the due date. Please submit
- your exported Qualtrics survey experiment,
- the cleaned data generated by your experiment,
- the
.Rmd
file you use to create you report, and
- your project technical report as an
.html
file.
Technical Report
Content
You should not need to present all of the R code that you wrote throughout the process of working on this project. Rather, the technical report should contain the minimal set of R code that is necessary to understand your results and findings in full. If you make a claim, it must be justified by explicit calculation. A knowledgeable reviewer should be able to compile your .Rmd
file without modification, and verify every statement that you have made. All of the R code necessary to produce your figures and tables must appear in the technical report. In short, the technical report should enable a reviewer to reproduce your work in full.
Tone
This document should be written for peer reviewers and/or your collaborating partner. You should aim for a level of complexity that is more statistically sophisticated than an article in the Science section of The New York Times, but less sophisticated than an academic journal. For example, you may use terms that that you will likely never see in the Times (e.g. bootstrap), but should not dwell on technical points with no obvious ramifications for the reader (e.g. reporting why the F-distribution is used for ratios of chi-square distributed random variables). Your goal for this paper is to convince a statistically-minded reader (e.g. a student in this class, a student from another school who has taken an introductory statistics class that you have addressed your research question in a meaningful way. Even a reader with no background in statistics should be able to read your paper and get the gist of it.
Additional Thoughts
The technical report is not simply a dump of all the R code you wrote during this project. Rather, it is a narrative, with technical details, that describes how you addressed your research question. You should not present tables or figures without a written explanation of the information that is supposed to be conveyed by that table or figure. Keep in mind the distinction between data and information. Data is just numbers, whereas information is the result of analyzing that data and digesting it into meaningful ideas that human beings can understand. Your technical report should allow a reviewer to follow your steps from converting data into information. There is no limit to the length of the technical report, but it should not be longer than it needs to be. You will not receive extra credit for simply describing your data ad infinitum. For example, simply displaying a table with the means and standard deviations of your variables is not meaningful. Writing a sentence that reiterates the content of the table (e.g. “the mean of variable \(x\) was 34.5 and the standard deviation was 2.8…”) is equally meaningless. What you should strive to do is interpret these values in context (e.g. “although variables \(x_1\) and \(x_2\) have similar means, the spread of \(x_1\) is much larger, suggesting…”).
No figure or table is required for mini project 1. If you do choose to include them, you should present figures and tables in your technical report in context. These items should be understandable on their own, in the sense that they have understandable titles, axis labels, legends, and captions. Someone glancing through your technical report should be able to make sense of your figures and tables without having to read the entire report. That said, you should also include a discussion of what you want the reader to learn from your figures and tables.
Your report should be submitted via Moodle as an R Markdown (.Rmd
) file and the corresponding rendered output (.html
) file.