Duplicates - Cghlewis/data-wrangling-functions GitHub Wiki
This section is about working with duplicate cases in your data. In education research duplicate data may occur for reasons such as a participant completed a survey twice. In the cases of duplicates, there are often two things we wan to do:
- Find and remove duplicates.
- Count duplicates in your data.
Review and remove duplicates
- Find and remove duplicates
- [Count duplicates](See Count)
Main functions used in examples
| Package | Functions |
|---|---|
| janitor | get_dupes() |
| dplyr | distinct() |
Other functions used in examples
| Package | Functions |
|---|---|
| dplyr | arrange(); pick() |
Resources