Abstract: Data analysts and data scientists often struggle with getting data into a usable form. Indeed research shows that it can consume up to 90% of a data scientist's time preparing data. In this interactive workshop, you will learn how to use Apache Drill to rapidly explore a wide variety of data, from a variety of sources without having to write code.
Lesson One: Querying Basic Data
In this module, you will become familiar with Drill and learn how to query simple tabular data sources such as Excel, CSV and other similar files.
Lesson Two: Querying Complex Data Sources
You will learn how to query complex data sources such as JSON, Parquet, and others directly from Drill. We will also cover advanced query techniques such as windowing functions, dealing with nested data, and joining data sets.
Lesson Three: Connecting other Data Sources
You will learn how to configure Drill to directly query other source systems such as relational databases, APIs, Splunk, and others and merge this data with data from your local file system.
Users should be comfortable with SQL
Bio: Charles Givre recently joined JP Morgan Chase works as a data scientist and technical product manager in the cybersecurity and technology controls group. Prior to joining JP Morgan, Mr. Givre worked as a lead data scientist for Deutsche Bank. Mr. Givre worked as a Senior Lead Data Scientist for Booz Allen Hamilton for seven years where he worked in the intersection of cyber security and data science. At Booz Allen, Mr. Givre worked on one of Booz Allen's largest analytic programs where he led data science efforts and worked to expand the role of data science in the program. Mr. Givre is passionate about teaching others data science and analytic skills and has taught data science classes all over the world at conferences, universities and for clients. Mr. Givre taught data science classes at BlackHat, the O'Reilly Security Conference, the Center for Research in Applied Cryptography and Cyber Security at Bar Ilan University. He is a sought-after speaker and has delivered presentations at major industry conferences such as Strata-Hadoop World, Open Data Science Conference and others. One of Mr. Givre's research interests is increasing the productivity of data science and analytic teams, and towards that end, he has been working extensively to promote the use of Apache Drill in security applications and is a committer and PMC Member for the Drill project. Mr. Givre teaches online classes for O'Reilly about Drill and Security Data Science and is a coauthor for the O'Reilly book Learning Apache Drill. Prior to joining Booz Allen, Mr. Givre, worked as a counterterrorism analyst at the Central Intelligence Agency for five years. Mr. Givre holds a Masters Degree in Middle Eastern Studies from Brandeis University, as well as a Bachelors of Science in Computer Science and a Bachelor's of Music both from the University of Arizona. Mr. Givre blogs at thedataist.com and tweets @cgivre.