background-color: black background-position: center background-size: contain background-image: url(../images/kelp_minecraft.jpg) class: middle, inverse # From kelp foRests to coRonavirus ## an inteRactive jouRney in doing science<br/>with R ### Jarrett E.K. Byrnes ### UMass Boston --- class: middle ## Thanks to Karthik Ram, Kyle Edwards, Jay Stachowicz, and others who facilitated or tolerated grad school R chicanery. ## Thanks to NCEAS fellows and working group collaborators. ## Thanks to students who have put up with learning R and data science from me. ##Thanks to all who have ever contributed to R in any way. --- class: inverse .center[data:image/s3,"s3://crabby-images/6b0ca/6b0ca8bb68064d3ee2fc72c4bb337f6b9f6a8e56" alt=""] # Hello! I will be your tour guide through my mind today! --- class: center, middle # Your methods guide your science --- class: center, middle # Do not stop learning. Ever. --- class: middle # Or really.... this is to your benefit! .center[ data:image/s3,"s3://crabby-images/6e866/6e866834d38ef359daa0b8b1ba6a79342d3ea814" alt=""<!-- --> ] .left[Credit @ashten28] --- # How I started R .center[ data:image/s3,"s3://crabby-images/007d4/007d4d2bf1175ecb92c9fcc85f059d6def929bcf" alt="" ] --- .pull-left[data:image/s3,"s3://crabby-images/ad3d2/ad3d202aa3f1dac65ef5d5804797d07d6c4005b8" alt=""] -- .pull-right[data:image/s3,"s3://crabby-images/9df54/9df5479d5252022f78c17e233164dba9022c453d" alt=""] ??? The mixed model course with a book that integrated R not only opened up my quantative skills, but the idea that R could be used as a teaching tool blew my mind. I also started to learn how you could use R to understand complex concepts. That for many students - and many on this call - all of the equations in the world won't make a fig. But seeing something in action on a computer screen, or in a boiled down notation that made a computer do something - that was a real way to learning. --- # More On the Job Learning... -- .image-400[.center[data:image/s3,"s3://crabby-images/a44b5/a44b5739562829aa46689a2d2686ac5b947d7f1c" alt=""]] --- # More On the Job Learning... .image-80[.center[data:image/s3,"s3://crabby-images/f0387/f0387ca9b48cf882c3a3c37f63701a04e680bcd5" alt=""]] --- class: center, middle, inverse data:image/s3,"s3://crabby-images/32134/32134ad96c04d619fbfcf65ed821ee40b04474e5" alt="" --- background-color: black background-position: center background-size: contain background-image: url(../images/darkside.jpg) ??? https://www.entertainart.com/star-wars/the-power-of-the-dark-side/ --- background-color: black background-position: center background-size: contain background-image: url(../images/clabuesch_Nereo_forest-1.JPG) class: bottom, inverse, left .small[Photo by Steve Clabuesch] --- background-color: black background-position: center background-size: contain background-image: url(../images/nceas_kelp_working_group.jpg) class: bottom, inverse # NCEAS Kelp-Climate Working Group, 2012 --- background-color: black class: inverse # Our Goal .center[data:image/s3,"s3://crabby-images/c6c22/c6c22421b4b90c5c3c99836686d8553aec2138d4" alt=""] --- class: middle # Reality Check - Other People's Data is Messy .center[data:image/s3,"s3://crabby-images/5972f/5972f80e316fd95abd06d5987ae3f741c3a2604c" alt=""] --- class: middle # Reality Check - Other People's Data is Messy .center[data:image/s3,"s3://crabby-images/13559/13559cbc54857f2eaf819a6147913b16c87cf29b" alt=""] --- class: middle # Reality Check - Other People's Data is Messy .center[data:image/s3,"s3://crabby-images/8de1b/8de1b33215dbbe922fb209e8325313c9469114a3" alt=""] --- class: middle # Reality Check - Other People's Data is Messy .center[data:image/s3,"s3://crabby-images/7ce33/7ce3362a2b8b7e2fe8e4624034de259cb47db18c" alt=""] --- # Our Saviour .center[.image-80[data:image/s3,"s3://crabby-images/9bb51/9bb51be110b93bacfe7eedc3965340b349eb8218" alt=""]] --- # In Which We Are Baptized as Data Scientists .pull-left[data:image/s3,"s3://crabby-images/2e2bb/2e2bbe667ccd5aa66be3c173da537623ddd64892" alt="" data:image/s3,"s3://crabby-images/cb977/cb977953749d74a449775563823cadd5f4ed01d2" alt="" ] -- .pull-right[ data:image/s3,"s3://crabby-images/a8a2c/a8a2ce1b8de271f9c566234c36ad3b558995cedd" alt="" data:image/s3,"s3://crabby-images/052be/052be73aa63521d2a92e3d3bfc56ea908c44b2bf" alt="" ] --- class: middle # Data manipulation with plyr ```r ######### 3) Create Derived Data Set at lowest sample unit #remove the date column, as it screws up ddply kelpcover$Date <- as.character(kelpcover$Date) temporal_data <- ddply(kelpcover, c("Sample.Year", "Sample.Month", "Quad"), summarise, Stipe.Density.num.per.sq.m = length(Quad), Individual.Density.num.per.sq.m = length(Quad), Biomass.kg.wet.per.sq.m = sum(St.wght)+sum(Fr.wght), Depth.m = max(Depth, na.rm=T)) #correct for rows that have 0 plants zeroPlants <- which(temporal_data$Biomass.kg.wet.per.sq.m==0) temporal_data$Individual.Density.num.per.sq.m[zeroPlants] <- 0 temporal_data$Stipe.Density.num.per.sq.m[zeroPlants] <- 0 names(temporal_data)[names(temporal_data)=="Quad"] <- "Sample.ID" ``` --- class: middle # Data manipulation with plyr ```r ######### 3) Create Derived Data Set at lowest sample unit *#remove the date column, as it screws up ddply kelpcover$Date <- as.character(kelpcover$Date) temporal_data <- ddply(kelpcover, c("Sample.Year", "Sample.Month", "Quad"), summarise, Stipe.Density.num.per.sq.m = length(Quad), Individual.Density.num.per.sq.m = length(Quad), Biomass.kg.wet.per.sq.m = sum(St.wght)+sum(Fr.wght), Depth.m = max(Depth, na.rm=T)) #correct for rows that have 0 plants zeroPlants <- which(temporal_data$Biomass.kg.wet.per.sq.m==0) temporal_data$Individual.Density.num.per.sq.m[zeroPlants] <- 0 temporal_data$Stipe.Density.num.per.sq.m[zeroPlants] <- 0 names(temporal_data)[names(temporal_data)=="Quad"] <- "Sample.ID" ``` --- class: middle # Data grouping with plyr ```r ######### 3) Create Derived Data Set at lowest sample unit #remove the date column, as it screws up ddply kelpcover$Date <- as.character(kelpcover$Date) *temporal_data <- ddply(kelpcover, c("Sample.Year", * "Sample.Month", * "Quad"), summarise, Stipe.Density.num.per.sq.m = length(Quad), Individual.Density.num.per.sq.m = length(Quad), Biomass.kg.wet.per.sq.m = sum(St.wght)+sum(Fr.wght), Depth.m = max(Depth, na.rm=T)) #correct for rows that have 0 plants zeroPlants <- which(temporal_data$Biomass.kg.wet.per.sq.m==0) temporal_data$Individual.Density.num.per.sq.m[zeroPlants] <- 0 temporal_data$Stipe.Density.num.per.sq.m[zeroPlants] <- 0 names(temporal_data)[names(temporal_data)=="Quad"] <- "Sample.ID" ``` --- class: middle # Data summarization with plyr ```r ######### 3) Create Derived Data Set at lowest sample unit #remove the date column, as it screws up ddply ] kelpcover$Date <- as.character(kelpcover$Date) temporal_data <- ddply(kelpcover, c("Sample.Year", "Sample.Month", "Quad"), * summarise, Stipe.Density.num.per.sq.m = length(Quad), Individual.Density.num.per.sq.m = length(Quad), Biomass.kg.wet.per.sq.m = sum(St.wght)+sum(Fr.wght), Depth.m = max(Depth, na.rm=T)) #correct for rows that have 0 plants zeroPlants <- which(temporal_data$Biomass.kg.wet.per.sq.m==0) temporal_data$Individual.Density.num.per.sq.m[zeroPlants] <- 0 temporal_data$Stipe.Density.num.per.sq.m[zeroPlants] <- 0 names(temporal_data)[names(temporal_data)=="Quad"] <- "Sample.ID" ``` --- class: middle # Still no easy rename... ```r ######### 3) Create Derived Data Set at lowest sample unit #remove the date column, as it screws up ddply kelpcover$Date <- as.character(kelpcover$Date) temporal_data <- ddply(kelpcover, c("Sample.Year", "Sample.Month", "Quad"), summarise, Stipe.Density.num.per.sq.m = length(Quad), Individual.Density.num.per.sq.m = length(Quad), Biomass.kg.wet.per.sq.m = sum(St.wght)+sum(Fr.wght), Depth.m = max(Depth, na.rm=T)) #correct for rows that have 0 plants zeroPlants <- which(temporal_data$Biomass.kg.wet.per.sq.m==0) temporal_data$Individual.Density.num.per.sq.m[zeroPlants] <- 0 temporal_data$Stipe.Density.num.per.sq.m[zeroPlants] <- 0 *names(temporal_data)[names(temporal_data)=="Quad"] <- "Sample.ID" ``` --- # reshape2 as a "clever" way of adding 0s ```r # Ensure there are zero entries for all species # Melt to long first dat2<-melt(dat, measure.vars='Count') # Cast to wide # absences are treated as zero counts, not missing surveys # and will fill in 0 dat3<-dcast(dat2, ... ~ Taxon, fill=0) # re-melt to long for data standard dat4<-melt(dat3, id.vars = names(dat3)[1:6], measure.var = names(dat3)[8:9], variable.name = 'Taxon', value.name = 'Count') ``` --- background-color: black background-position: center background-size: contain background-image: url(../images/hailey_born.jpeg) --- background-color: black background-position: center background-size: contain background-image: url(../images/hailey_sleeping.jpeg) --- # New Saviours! .pull-left[data:image/s3,"s3://crabby-images/fa5db/fa5db1e19f64ac3a9899e2063f5bb98a775f5b9e" alt=""] .pull-right[data:image/s3,"s3://crabby-images/10aac/10aac95aa07be5469227cf59923785b0a4328377" alt=""] --- # I am now a full convert .center[data:image/s3,"s3://crabby-images/cdb84/cdb848e68568f7c9f7d04a8f7c5d329e22c2893c" alt=""] --- # Floating Forests data:image/s3,"s3://crabby-images/e3f89/e3f89fa0f8d67e47cb7c060adc84ee074f7552aa" alt="" http://floatingforests.org --- # Circle the Forests! The Data will be Easy, Right? data:image/s3,"s3://crabby-images/74462/74462e1b3dc631de5a6abc36433fcb45ec25904b" alt="" http://floatingforests.org --- # Floating Forets - the data - Gigabytes of data -- - JSON shoved into a CSV -- - The individual classifications: .font-courier[ {""path"":""M 408.296875,142 L 406.296875,141 L 400.296875,137 L 390.296875,130 L 365.296875,120 L 335.296875,106 L 308.296875,96 L 294.296875,92 L 282.296875,91 L 270.296875,90 L 260.296875,90 L 255.296875,93 L 245.296875,105 L 237.296875,122 L 231.296875,139 L 222.296875,160 L 217.296875,179 L 215.296875,195 L 214.296875,206 L 214.296875,216 L 222.296875,227 L 234.296875,235 L 248.296875,244 L 254.296875,247 L 261.296875,250 L 264.296875,251 L 265.296875,252 Z"",""tool"":0,""frame"":0,""details"":[],""tool_label"":""Marking tool""} ] --- class:middle, center .image-50[data:image/s3,"s3://crabby-images/6785a/6785acc163bc5f8ba1402614edfe9589e97371b5" alt=""] --- # Complex Geospatial Visulization and Time-Series Analysis now Possible .center[data:image/s3,"s3://crabby-images/23d17/23d176bffe6c778642a865de3409cafb628de37f" alt=""] --- class: center, middle # Your methods guide your science --- class: center, middle # Do not stop learning. Ever. --- # Teaching Keeps you Nimble! .center[data:image/s3,"s3://crabby-images/db01e/db01e9cd62e2a319cce04563ed17a0c7f7bb8d6a" alt=""] --- # Teaching Keeps you Fresh! .center[data:image/s3,"s3://crabby-images/9e2ae/9e2ae3dffe3d9e4c70e6a193db263490084a6414" alt=""] --- # New R Packages Find You Even When You Don't Expect It .center[data:image/s3,"s3://crabby-images/eb7a9/eb7a96311d8f07cab707b65d6a9b469fb40d5dd8" alt=""] .bottom[.left[https://jebyrnes.shinyapps.io/logistic_growth/]] --- # Teaching Makes You Current! .center[data:image/s3,"s3://crabby-images/28699/28699cd57f1dad9b41ec9437d355b1cdf9f2d98a" alt=""] ??? twitter as helper as well --- class: middle, center, inverse .image-50[data:image/s3,"s3://crabby-images/cb4b6/cb4b608ec6681e886ea57473659197c97c8bfcbf" alt=""] --- class: middle, center, inverse data:image/s3,"s3://crabby-images/2ea16/2ea16d0b460d0b35759e102649bf2b3ec94e7916" alt="" --- class: middle, inverse # Making R Relevant to Students .center[data:image/s3,"s3://crabby-images/95ceb/95cebac3d5f6d154e1d1b3b18841430e0de9686b" alt=""] --- class: middle, inverse # Let Students use R to Explore Relevant Data .center[data:image/s3,"s3://crabby-images/a655c/a655c5b85308b372bdfc4666c1bcd3b2709b25a4" alt=""] --- class: middle, inverse # Relevant Data Speeds Learning New Data Science Tools .center[data:image/s3,"s3://crabby-images/4ee74/4ee74405d5e27717804bf930a06173765e29a0f0" alt=""] --- class: middle, inverse # Massive Returns on Learning R in the Long-Term .center[data:image/s3,"s3://crabby-images/dd78e/dd78e6efe76b905d87f7a8ca0fa3d81401ec65b2" alt=""] --- # Massive Returns on Learning R in the Long-Term .center[data:image/s3,"s3://crabby-images/f1fa3/f1fa3a5b1a09ba697d893c9ae8a5ae871a872ca1" alt=""] --- class: inverse, middle # sometimes...seRendipity .pull-left[data:image/s3,"s3://crabby-images/3227e/3227e1b48bc829361a71f1ec25ed73a0432b32dc" alt=""] -- .pull-right[data:image/s3,"s3://crabby-images/f5910/f5910878b4f53a7ec2027674fb1952f26bc3d84a" alt=""] --- class: inverse, middle # The Covid19R Project Team .center[data:image/s3,"s3://crabby-images/45672/45672878f3c7cd1e96c4b11a0a1bc6262ff997e1" alt=""] --- class: inverse, middle # Tidy Covid-19 Data .center[data:image/s3,"s3://crabby-images/f116f/f116f0dd3b02d09b4f3c7b591644120a1ec34aa5" alt=""] --- class: inverse, middle # Making a Community Project #### https://covid19r.github.io/documentation/ .center[data:image/s3,"s3://crabby-images/cc53c/cc53cda18aac924ff9d97ac9d5bcdcdc1c2b6762" alt=""] --- class: inverse, middle # Building Resources for Novices to Learn Package Building #### https://github.com/Covid19R/covid19_package_template .center[data:image/s3,"s3://crabby-images/d6cd1/d6cd1c31bcbc92cdaec6a6115e6047f9374f26b7" alt=""] --- class: inverse, middle # Enabling a Community #### https://github.com/Covid19R .center[data:image/s3,"s3://crabby-images/14564/1456481281d99856a469fdfd8c6e71a7db79f08d" alt=""] ??? This has been transformative for me, and I don't know where it is going to go from here. --- class: center, middle # Your methods guide your science --- class: center, middle # Do not stop learning. Ever. --- class: middle #As I started writing this talk... ```r > install.packages("xaringan") also installing the dependency ‘servr’ trying URL 'https://cran.rstudio.com/bin/macosx/el-capitan/contrib/3.6/servr_0.16.tgz' Content type 'application/x-gzip' length 85778 bytes (83 KB) ================================================== downloaded 83 KB ``` --- class: middle # Questions? .center[ data:image/s3,"s3://crabby-images/747a5/747a5dfc0e184b8da4648b440a07c76c3c1bd19f" alt=""<!-- --> ] .left[Credit @ashten28] ---