dplyr is a very powerful tool in R that is used widely for data manipulation and analysis. It is fairly simple to learn and apply to your work. Oftentimes, with just a few elegant lines of code, your data becomes that much easier to dissect and analyze. For these reasons, it is an essential and foundational skill to master for any aspiring data scientist.
What You'll Learn
- Setting up dplyr and using this incredibly useful package in data processing
- Learning the basic functions that dplyr can perform, including “Arrange,” “Group_By,” “Summarize” “Select,” “Filter,” “Intersect,” and “Setdiff”
- Using the functions to perform basic data manipulation tasks
- Using dplyr on a real-world dataset on wine ratings from kaggle