Scalable data science and deep learning with R
Scalable data science and deep learning with R


We provide an overview of the tools available to data scientists using R for Spark and TensorFlow, then discuss the latest developments at the intersections of these ecosystems. We organize the conversation around a diverse selection of use cases, such as ad hoc analysis on distributed datasets, building machine learning models for low latency scoring, and developing deep learning models for research, and demonstrate sample workflows. Various open source R packages will be featured, including the sparklyr, keras, and tensorflow projects.


Kevin is a software engineer at RStudio developing open source packages for big data analytics and machine learning. He has held data science positions across different industries, and has experience executing the end-to-end analytics process, from data engineering to model deployment and change management. Prior to RStudio, he was a principal data scientist at Honeywell, and also held roles at KPMG and Citi.

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google