Data Analysis Practice I-E2
Numbering Code | U-LAS11 20005 SE55 | Year/Term | 2022 ・ First semester | |
---|---|---|---|---|
Number of Credits | 2 | Course Type | Seminar | |
Target Year | All students | Target Student | For all majors | |
Language | English | Day/Period | Wed.4 | |
Instructor name | Martin Robert (Graduate School of Pharmaceutical Sciences Program-Specific Associate Professor) | |||
Outline and Purpose of the Course |
The world around us, is filled with numbers (data) that range over many scales of space and time and that describe its organization. In biology, traditionally, data feature parts lists and partial views of the connections between those parts. However, there is also a vast amount of quantitative (numerical data) that is accumulating, whether from sequences of DNA, concentrations of various biomolecules, or other types of data. The ability to handle, process, explore, and visualize data are important skills for all students. While in this course many examples will be derived from biology, the mindset and basic analysis workflows are widely applicable in any domain of science, engineering and beyond. In this course you will learn how to use R, RStudio, and the Tidyverse packages to clean, process, manipulate, explore, and visualize data. This course is for beginners and there is no specific requirement. |
|||
Course Goals |
By the end to this course participants should be able to: - Perform basic data processing and analysis using R - Find and describe different forms of (biological) data - Elaborate specific questions about the data - Clean and process raw data - Transform data - Draw various types of plots to interpret from its results - Gain insight into data - Develop analysis workflows - Effectively communicate the results of data analysis |
|||
Schedule and Contents |
Week 1 Guidance and introduction Week 2 What is data? Getting started with R Week 3 Workflow demonstration Week 4-5 Importing and cleaning up data Week 6-7 Data transformation Week 8 Data visualization Week 9 Digging deeper into R using dplyr Week 10 Dealing with specific data (strings, dates, etc.) Week 11 Getting to grips with ggplot - producing publication-quality figures Week 12 Working with single variables Week 13 Exploring relationships among variables Week 14 Looking and back and looking forward Week 16 Feedback |
|||
Evaluation Methods and Policy |
20 % Class attendance/ participation 60 % In-class exercises and homework assignments 20 % Project and presentation |
|||
Course Requirements | Students should bring a computer to class to complete in-class exercises and tutorials as well as homework assignments. | |||
Study outside of Class (preparation and review) | Mainly in the form of assigned reading and homework assignments. Students should expect to spend about 1-2 hours per week preparing for the class and completing assignments. | |||
Textbooks | Textbooks/References | Insights from Data with R : An Introduction for the Life and Environmental Sciences, Owen L. Petchey, Andrew P. Beckerman, Natalie Cooper, and Dylan Z. Childs, (Oxford University Press USA, 2021) | ||
References, etc. | R for data science, Wickham and Grolemund, (O'Reilly Media, 2017) |