rOpenSci | Community Call - Reproducible Workflows at Scale with drake

Community Call - Reproducible Workflows at Scale with drake

Ambitious workflows in R, such as machine learning analyses, can be difficult to manage. A single round of computation can take several hours to complete, and routine updates to the code and data tend to invalidate hard-earned results. You can enhance the maintainability, hygiene, speed, scale, and reproducibility of such projects with the drake R package. drake resolves the dependency structure of your analysis pipeline, skips tasks that are already up to date, executes the rest with optional distributed computing, and organizes the output so you rarely have to think about data files. This talk demonstrates how to create and maintain a realistic machine learning project using drake-powered automation.

This 1-hour Community Call will include a presentation by drake developer, Will Landau, and at least 20 minutes for Q & A.

🎤 See speaker bio below.

🔗 Join the Call

🕘 Tuesday, September 24, 2019, 9-10 AM PDT (☎️ Everyone is welcome. No RSVP needed. (This event has passed)

🎥 After the Call, we’ll post the video and collaborative notes on the archive page.

🔗 Resources

🔗 Speaker

headshot of Will Landau

Will Landau is the creator and maintainer of the drake R package. Will is a Research Scientist at Eli Lilly and Company, where he develops methods and software for statisticians. Prior to joining Lilly, he earned his PhD in Statistics from Iowa State University.

Will on GitHub, Twitter, Website, rOpenSci