The amount of data in the world is vast and is increasing exponentially. It is easy to become overwhelmed and lose sight of the goal of data: to answer questions we have about the world in a specific, concise manner. The goal of this course is to help craft answerable questions—and then answer them. In order to do this, we will be using a programming language (“R”) to help us organize data, make clean, clear graphs, and help with appropriate analysis of the data.
This course will serve two main goals. The first is an introductory statistics course: gain knowledge of the basic statistical tests, how to interpret their results in a reasonable manner, and understand what those tests are doing at a conceptual level. The second is to learn the computational language of R: how to sort, shape, and handle data, create simulations and interpret the results, and build clean, clear graphical representations of the data presented.
This course is taught at the introductory level and has no prerequisites, but does require a significant amount of time and energy outside of the classroom as we are working towards the two aforementioned goals at once. This course is appropriate for students who plan to seriously create and analyze their own statistics for their work. It may be taken alone, or as a sequel to Presentation of Statistics. There is some overlap between the two courses, but their focus and goals are different. Students who take Presentation of Statistics first will get a broader skill set and a more gentle introduction.