GitOps and Multi-Tenancy Combined for an Enterprise Data Science Experience on Kubeflow
GitOps and Multi-Tenancy Combined for an Enterprise Data Science Experience on Kubeflow

Abstract: 

Kubeflow is a machine learning (ML) platform built on top of Kubernetes. The Kubeflow project is dedicated to making deployments of ML workflows on Kubernetes simple, portable and scalable.

GitOps is the methodology of defining all infrastructure as declarative code and tracking it using git. In Kubeflow and Kubernetes, GitOps is a best practice to achieve immutable, reproducible infrastructure that can scale according to an organization’s needs.

In this session, you will: 1) learn how to apply GitOps in order to deploy and manage a Kubeflow cluster; 2) learn how to enable multiple users to work together on the same cluster in a secure and isolated way, with authentication and authorization best practices; 3) follow a data scientist’s journey to running a hyperparameter tuning optimization workflow; 4) scale up your workloads in a UI driven environment.

Session Outline
* Lesson 1: GitOps and Declarative Infrastructure

Revisit the declarative nature of Kubernetes and apply GitOps best practices to get immutable, trackable and reproducible infrastructure. Deploy and manage Kubeflow using the GitOps methodology.

* Lesson 2: Multi-User Kubeflow

Learn how Kubeflow and Kubernetes enforce authentication and authorization. Then see this knowledge applied in practice in order to enable multiple users to share the same Kubeflow cluster in a secure and isolated manner.

* Lesson 3: Secure and Isolated User Workflows

Follow the steps of a data scientist deploying their pipelines in a secure and isolated manner. Learn how secrets are securely distributed and injected into the user’s environment. Try out an end-to-end user workflow right out of your Jupyter Notebook, by leveraging Kale, the easiest way to go from Notebook to Pipeline.

Background Knowledge
Attendees should be familiar with Kubernetes.

Bio: 

Yannis is a software engineer at Arrikto, working with Kubeflow, the cloud-native machine learning platform built on Kubernetes. He loves contributing to open source projects and has authored the Cassandra Operator in Rook and the official Scylla Operator, which he is currently maintaining.

Open Data Science

 

 

 

Open Data Science
One Broadway
Cambridge, MA 02142
info@odsc.com

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Youtube
Consent to display content from Youtube
Vimeo
Consent to display content from Vimeo
Google Maps
Consent to display content from Google