EN | PT | TR | RO | BG | SR
;
Marked as Read
Marked as Unread


NEXT TOPIC

Module 1: Introduction to R and Data Import/Manipulation




Data Manipulation with dplyr: A Grammar for Data


Data manipulation often entails tasks like filtering, summarizing, grouping, and joining datasets. The dplyr package, authored by Hadley Wickham, simplifies these operations by providing a consistent and intuitive grammar for data manipulation. It introduces five core verbs:

  • filter(): Use this verb to extract specific rows from your dataset based on certain conditions.
  • arrange(): Arrange the rows of your dataset based on one or more variables, either in ascending or descending order.
  • select(): Choose a subset of columns from your dataset, making it easier to focus on the relevant data.
  • mutate(): Create new variables or modify existing ones by applying functions or operations to your data.
  • summarize(): Condense your data into summary statistics, aggregating information in a meaningful way.