How to Prepare Datasets for Analysis

6 Lessons Learned Tidy Data

In Tidy Data Hadley Wickham outlines a series of principles those working with data can employ to do so more effectively.

Original notes these lessons are from.

1. Tidy Data is a Framework

Lesson Learned

2. Formatting of Tidy Data

  1. Each variable forms a column.
  2. Each observation forms a row.
  3. Each type of observational unit forms a table

Lesson Learned

3. Get the Most out of R Using Tidy Data

Lesson Learned

4. Four Fundamental Verbs of Data Manipulation

Transform: adding or modifying variables. These modifications can involve either a single variable (e.g., log-transformation), or multiple variables (e.g., computing density from weight and volume).

Aggregate: collapsing multiple values into a single value (e.g., by summing or taking means).

Sort: changing the order of observations.

Lesson Learned

5. Manipulating Data in R

Lesson Learned

6. Tidy visualisation tools only need to be input-tidy as their output is visual.

Thank you for Reading

You’ll learn how to understand, prepare, and protect yourself from the risks of Covid-19. To create a plan for the precautions needed to safeguard you and your loved ones

Connect: Email | Twitter | LinkedIn

Originally published at https://stevenlmiller.me on December 23, 2020.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Steven L. Miller

Steven L. Miller

48 Followers

Helping investors hack through the weeds to find the crypto gems at cryptojungle.io. Musings at stevenlmiller.me