Working with multiple data frames

Lecture 6

Dr. Elijah Meyer + Konnie Huang

Duke University
STA 199 - Fall 2022

September 14, 2022

Checklist

– Clone ae-05

– Watch ae-04 video + turn in ae-04 by Thursday 11:59pm

– Turn in hw1 by Thursday 11:59pm on Gradescope

R4DS: Chp 20 - Joins - Sections 20.1 - 20.4

– Any questions from prepare materials?

  • Clone your ae-04 repo.

Announcements

Videos

– Requesting videos for missed classes

AEs

– Ae: 80% rule + keys posted after deadline

Homework + Labs

– Late work policy

– Drop 1

Goals

– Understand join functions

– Join multiple data frames

Joining datasets

Data merging is the process of combining two or more data sets into a single data set. Most often, this process is necessary when you have raw data stored in multiple files, worksheets, or data tables, that you want to analyze together.

AE-05

Clone ae-05

Recap of AE

– This is important! Data are messy!

– Think carfully about the join you use