DATA SCIENCE LEARNATHON. FROM RAW DATA TO DEPLOYMENT: THE DATA SCIENCE CYCLE WITH KNIME

Abstract: This will be a Learnathon kind of workshop. A Learnathon is a workshop where we allow ourselves the luxury of learning new tools and new techniques.

For this particular event, we will cover the whole data science cycle, from the raw data to the final application on a production machine. That is: data access, data blending, data preparation, model training, optimization, testing, and finally deployment. The tool of choice for this Learnathon will be KNIME Analytics Platform.

KNIME Analytics Platform is an open, open-source, GUI-driven, data analytics platform, that covers all your data needs from data import to final deployment. Being open, KNIME Analytics Platform offers a vast integration and IDE environments for R, Python, SQL, and Spark.

After an initial introduction to the tool and to the data science cycle, we will split in groups. Each group will focus on one of three aspects of the data science cycle:
- Just pure raw data. Data Access and Data Preparation
- Machine Learning. Which model shall I use? Which parameters?
- I have a great model. Now what? The deployment phase

Bio: Kathrin Melcher is a Data Scientist at KNIME. She holds a Master Degree in Mathematics obtained at the University of Konstanz, Germany. She joined the Evangelism team at KNIME in 2017. She has a strong interest in data science, machine learning and algorithms, and enjoys teaching and sharing her knowledge about it.

Open Data Science Conference