HarvardX: Data Science: Wrangling

Learn to process and convert raw data into formats needed for analysis.

4.3|Reviews (18)
₹12367
✓ Compare courses before making a decision
Check Latest Price →
Price may vary. Check latest price on provider site.

Course Insight

Suitable for beginner learners. This course serves as an entry point into Computer Science, building foundational knowledge before moving on to advanced frameworks or specialized paths.

Beginner FriendlyCertification IncludedSelf-Paced LearningProject-Based

SKILLS TO
MASTER

Computer Science Basics
Fundamental principles and concepts
Practical ApplicationTrending
Real-world project implementation
Best Practices
Industry standard workflows and guidelines
Problem Solving
Core Concepts
Implementation
Workflow Integration
Optimization
Careers:Data Scientist, Data Analyst, Machine Learning Engineer.

Quick Facts

8 weeks
Beginner
Online Course
Below sections are verified from last major sync. For real-time updates and today's latest lectures, Check official page here.

What You’ll Learn

In this course, part of our Professional Certificate Program in Data Science,we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data scientist will likely face them all at some point.

Very rarely is data easily accessible in a data science project. It's more likely for the data to be in a file, a database, or extracted from documents such as web pages, tweets, or PDFs. In these cases, the first step is to import the data into R and tidy the data, using the tidyverse package. The steps that convert data from its raw form to the tidy form is called data wrangling.

This process is a critical step for any data scientist. Knowing how to wrangle and clean data will enable you to make critical insights that would otherwise be hidden.

See how this course curriculum compares with alternatives

Outcomes

  • Importing data into R fromdifferent file formats.
  • Web scraping.
  • How to tidy data using the tidyverse tobetter facilitateanalysis.
  • String processing with regular expressions (regex).
  • Wrangling data using dplyr.
  • How to workwith dates and times as file formats.
  • Text mining.
See side-by-side differences in learning outcomes

FAQs

Top Alternatives

Highly-rated courses worth your attention

HarvardX: Data Science: R Basics
4.5· 8 weeks
Beginner
₹18,177
HarvardX: Data Science: Inference and Modeling
4.3· 8 weeks
Beginner
₹12,367
HarvardX: Data Science: Productivity Tools
4.3· 8 weeks
Beginner
₹12,367
HarvardX: Introduction to Data Science with Python
4.5· 8 weeks
Intermediate
₹24,817
HarvardX: Data Science: Linear Regression
4.3· 8 weeks
Beginner
₹12,367
IBM: Python Basics for Data Science
4.5· 3 weeks
Beginner
₹8,217
HarvardX: Data Science: Wrangling
4.3(18+ learners)
✓ Compare side-by-side before spending money
Check Latest Price →
Price may vary. Check latest price on provider site.