
Description A typical data analysis project may involve several parts, each including several data files and different scripts with code. Keeping all this organized can be challenging. Part of our Professional Certificate Program in Data Science, this course explains how to use Unix/Linux as a tool for managing files and directories on your computer and…
Description In this course, part of our Professional Certificate Program in Data Science,we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data…
Description Linear regression is commonly used to quantify the relationship between two or more variables. It is also used to adjust for confounding. This course, part ofourProfessional Certificate Program in Data Science, covers how to implement linear regression and adjust for confounding in practice using R. In data science applications, it is very common to…
Description Perhaps the most popular data science methodologies come from machine learning. What distinguishes machine learning from other computer guided decision processes is that it builds prediction algorithms using data. Some of the most popular products that use machine learning include the handwriting readers implemented by the postal service, speech recognition, movie recommendation systems, and…
Description To become an expert data scientist you need practice and experience. By completing this capstone project you will get an opportunity to apply the knowledge and skills in R data analysis that you have gained throughout the series. This final project will test your skills in data visualization, probability, inference and modeling, data wrangling,…
Description As part of our Professional Certificate Program in Data Science, this course covers the basics of data visualization and exploratory data analysis. We will use three motivating examples and ggplot2, a data visualization package for the statistical programming language R. We will start with simple datasets and then graduate to case studies about world…
Description While the United States is one of the world’s wealthiest nations, it is far from the healthiest. Our nation’s burden of disease affects businesses every day, from sick employees and families reducing productivity and increasing costs, to product recalls and failures, to environmental scandals such as toxic chemical emissions harming communities and reputations.Named Runner…
Description Our world’s climate is changing. Of the top twenty hottest years ever recorded, sixteen have occurred in the last two decades. This warming has already had a profound effect. Many feel powerless in the face of this challenge, but you can make a difference. By looking at air quality, nutrition, infectious diseases, and human…
Description How can we eradicate malaria? Explore the scientific and technological underpinnings of malaria, as well as the historical, political, social, and economic contexts in which control, elimination, and eradication efforts unfold. Through foundational lectures and supplemental interviews, this course provides participants with a toolbox of knowledge and skills. Learners will be guided through the…
Description This short literature course, based on the first half of the Masterpieces of World Literature edX MOOC, examines how civilizations and cultures of the ancient world defined themselves through literature and how that literature has continued to contribute to our understanding of those civilizations and cultures today.Cities, nations, and empires from antiquity through the…