dplyr in R


The dplyr package is a member of R’s tidyverse package. The tidyverse package is a set of packages. dplyr is a grammar of data manipulation, providing a consistent set of verbs that help you solve the most common data manipulation challenges. The five main dplyr functions that allow you to solve most of your data-manipulation challenges: filter(), arrange(), select(), mutate() and summarize().

  • mutate() adds new variables that are functions of existing variables
  • select() picks variables based on their names.
  • filter() picks cases based on their values.
  • summarise() reduces multiple values down to a single summary.
  • arrange() changes the ordering of the rows.

Leave a Reply