4.64 out of 5
4.64
57 reviews on Udemy

Cleaning Data In R with Tidyverse and Data.table

Get your data ready for analysis with R packages tidyverse, dplyr, data.table, tidyr and more
Instructor:
R-Tutorials Training
428 students enrolled
English [Auto-generated]
Convert raw and dirty data into clean data
Understand how clean data looks and how to achieve it
Use the R Tidyverse packages to clean data
Handle missing values in R
Detect outliers
Filter and query tables
Select a proper class for your data
Clean various classes of data (numeric, string, categorical, integer, ...)

Welcome to this course on Data Cleaning in R with Tidyverse, Dplyr, Data.table, Tidyr and many more packages!

You may already know this problem: Your data is not properly cleaned before the analysis so the results are corrupted or you can not even perform the analysis.

To be brief: you can not escape the initial cleaning part of data science. No matter which data you use or which analysis you want to perform, data cleaning will be a part of the process. Therefore it is a wise decision to invest your time to properly learn how to do this.

Now as you can imagine, there are many things that can go wrong in raw data. Therefore a wide array of tools and functions is required to tackle all these issues. As always in data science, R has a solution ready for any scenario that might arise. Outlier detection, missing data imputation, column splits and unions, character manipulations, class conversions and much more – all of this is available in R.

And on top of that there are several ways in how you can do all of these things. That means you always have an alternative if you prefer that one. No matter if you like simple tools or complex machine learning algorithms to clean your data, R has it.

Now we do understand that it is overwhelming to identify the right R tools and to use them effectively when you just start out. But that is where we will help you. In this course you will see which R tools are the most efficient ones and how you can use them.

You will learn about the tidyverse package system – a collection of packages which works together as a team to produce clean data. This system helps you in the whole data cleaning process starting from data import right until the data query process. It is a very popular toolbox which is absolutely worth it.

To filter and query datasets you will use tools like data.table, tibble and dplyr.

You will learn how to identify outliers and how to replace missing data. We even use machine learning algorithms to do these things.

And to make sure that you can use and implement these tools in your daily work there is a data cleaning project at the end of the course. In this project you get an assignment which you can solve on your own, based on the material you learned in the course. So you have plenty of opportunity to test, train and refine your data cleaning skills.

As always you get the R scripts as text to copy into your RStudio instance. And on course completion you will get a course certificate from Udemy.

R-Tutorials Team

 

Introduction

1
Intro
2
Why Clean and Tidy Data Is Necessary for a Successful Analysis
3
Why to Choose R for Data Cleaning
4
How to Easily Importing Data into R
5
Best Table Types in R
6
Script Course Intro

Handling Missing Values And Detecting Outliers

1
Welcome to: Missing Data Handling and Outlier Detection
2
Introduction to Missing Data Handling
3
Script Missing Values and Outliers
4
Simple Methods for Missing Data Handling
5
Machine Learning Learning for Missing Data Imputation
6
Statistical Outliers
7
Detecting Outliers in Univariate Datasets
8
Detecting Outliers in Multivariate Datasets

The Tidyverse Toolbox For Efficient Data Cleaning

1
Welcome to: The Tidyverse
2
What Is the Tidyverse?
3
Script Tidyverse
4
Using the Pipe Operator
5
Exploring the Tibble
6
Tidy Data as the Underlying Principle of the Tidyverse
7
Changing Table Formats
8
How to split Columns
9
Converting from Long to Wide Format
10
String Manipulations with Stringr

Subsetting, Filtering And Queries With data.frame, data,table And tibble

1
Welcome to: Query Systems in R
2
Script Queries
3
Filtering and Querying - General Background
4
Using dplyr for Queries
5
Queries with 'data.table'

Course Project - Apply What You Learned On Real World Data

1
Welcome to: The Great Course Project
2
Project Data - Get the Data here!
3
Project Assignment
4
Script Course Project
5
Solution: Data Import
6
Solution: Data Cleaning
7
Solution: Querying
8
Course Summary
You can view and review the lecture materials indefinitely, like an on-demand channel.
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don't have an internet connection, some instructors also let their students download course lectures. That's up to the instructor though, so make sure you get on their good side!
4.6
4.6 out of 5
57 Ratings

Detailed Rating

Stars 5
24
Stars 4
24
Stars 3
7
Stars 2
1
Stars 1
0
6db713390cb2e8a668093f8beeb68042
30-Day Money-Back Guarantee

Includes

4 hours on-demand video
5 articles
Full lifetime access
Access on mobile and TV
Certificate of Completion