ODSC Europe Speakers
For 2022 we will have some of the best and brightest minds speaking at ODSC Europe. ODSC will host more than 150 presenters. Speaker profiles are added weekly. Check back for updates.
Full list will be coming soon
For 2022 we will have some of the best and brightest minds speaking at ODSC Europe. ODSC will host more than 150 presenters. Speaker profiles are added weekly. Check back for updates.
Full list will be coming soon
Gaël Varoquaux is a research director working on data science and health at Inria (French Computer Science National research). His research focuses on using data and machine learning for scientific inference, with applications to health and social science, as well as developing tools that make it easier for non-specialists to use machine learning. He has long applied it to brain-imaging data to understand cognition. Years before the NSA, he was hoping to make bleeding-edge data processing available across new fields, and he has been working on a mastermind plan building easy-to-use open-source software in Python. He is a core developer of scikit-learn, joblib, Mayavi and nilearn, a nominated member of the PSF, and often teaches scientific computing with Python using the scipy lecture notes.
Prediction with Missing Values(Tutorial)
Yonina C. Eldar is a Professor in the Department of Math and Computer Science at the Weizmann Institute of Science, Rehovot, Israel, where she heads the center for Biomedical Engineering and Signal Processing. She is also a Visiting Professor at MIT and at the Broad Institute and an Adjunct Professor at Duke University, and was a Visiting Professor at Stanford University. She is a member of the Israel Academy of Sciences and Humanities, an IEEE Fellow and a EURASIP Fellow. She has received many awards for excellence in research and teaching, including the IEEE Signal Processing Society Technical Achievement Award, the IEEE/AESS Fred Nathanson Memorial Radar Award, the IEEE Kiyo Tomiyasu Award, the Michael Bruno Memorial Award from the Rothschild Foundation, the Weizmann Prize for Exact Sciences, and the Wolf Foundation Krill Prize for Excellence in Scientific Research. She is the Editor in Chief of Foundations and Trends in Signal Processing, and serves the IEEE on several technical and award committees. She heads the Committee for Promoting Gender Fairness in Higher Education Institutions in Israel.
John Shawe-Taylor is professor of Computational Statistics and Machine Learning at University College London. He has helped to drive a fundamental rebirth in the field of machine learning, with applications in novel domains including computer vision, document classification, and applications in biology and medicine focussed on brain scan, immunity and proteome analysis. He has published over 250 papers and two books that have attracted over 70000 citations.
He has also been instrumental in assembling a series of influential European Networks of Excellence. The scientific coordination of these projects has influenced a generation of researchers and promoted the widespread uptake of machine learning in both science and industry that we are currently witnessing. He was appointed UNESCO Chair of Artificial Intelligence in November 2018 and is the leading trustee of the UK Charity, Knowledge 4 All Foundation, promoting open education and helping to establish a network of AI researchers and practitioners in sub-Saharan Africa.
Thomas Wiecki is the Chief Executive Officer at PyMC Labs (www.pymc-labs.io), the world’s first Bayesian consultancy. Prior to that Thomas was the VP of Data Science at Quantopian, where he used probabilistic programming and machine learning to help build the world’s first crowdsourced hedge fund. He is an author of the popular PyMC3 package — a probabilistic programming framework written in Python. He holds a PhD from Brown University.
Daria Stepanova is a research scientist at Bosch Center for Artificial Intelligence. Her research interests include Knowledge Representation and Reasoning with a special focus on the automatic acquisition of rules from structured knowledge. Previously Daria was a senior researcher at Max Plank Institute for Informatics (Germany), where she was heading a group on Semantic Data. Daria got her diploma degree in Applied Computer Science from the Department of Mathematics and Mechanics of St. Petersburg State University (Russia) in 2010 and a PhD in Computational Logic from Vienna University of Technology (Austria) in 2015. Before starting her PhD she worked as a visiting researcher at the School of Computing Science at Newcastle University (UK) in an industrially-oriented project.
Rule Induction and Reasoning in Knowledge Graphs(Tutorial)
Alex Peattie is the co-founder and CTO of Peg, a technology platform helping multinational brands and agencies to find and work with top YouTubers. Peg is used by over 1500 organisations worldwide including Coca-Cola, L’Oreal and Google.
An experienced digital entrepreneur, Alex spent six years as a developer and consultant for the likes of Grubwithus, Huckberry, UNICEF and Nike, before joining coding bootcamp Makers Academy as senior coach, where he trained hundreds of junior developers. Alex was also a technical judge at this year’s TechCrunch Disrupt conference.
Hearing is Believing: Generating Realistic Speech with Deep Learning(Workshop)
Alan Rutter is the founder of consultancy Fire Plus Algebra and is a specialist in communicating complex subjects through data visualization, writing, and design. He has worked as a journalist, product owner, and trainer for brands and organizations including Guardian Masterclasses, WIRED, Time Out, the Home Office, the Biotechnology and Biological Sciences Research Council, and Liverpool School of Tropical Medicine.
Laurence Moroney leads AI Advocacy at Google, working with the Google AI Research and product development teams. He’s the best-selling author of ‘AI and Machine Learning for Coders,’ as well as the instructor on the Fundamentals of TinyML course at HarvardX, and the popular TensorFlow specializations with deeplearning.ai and Coursera. He’s passionate about empowering software developers to succeed in Machine Learning, democratizing AI as a result. Laurence is based on Washington State in the USA.
A Hands-on Guide to Machine Learning with TensorFlow(Tutorial)
Dr. Anand S. Rao is the Global Artificial Intelligence Leader for PwC. He is also the leader of PwC’s AI and Emerging Technology practice. With over 35 years of industry and consulting experience, Anand leads a team of practitioners who advise C-level executives and implement advanced analytics and AI-based solutions on a variety of strategic, operational, and ethical use cases. With his PhD and research career in Artificial Intelligence and his subsequent experience in management consulting he brings business domain knowledge, software engineer expertise, and statistical expertise to generate unique insights into the practice of ‘data science’.
Prior to joining management consulting, Anand was the Chief Research Scientist at the Australian Artificial Intelligence Institute. He received his PhD from University of Sydney (with a University Postgraduate Research Award-UPRA) in 1988 and an MBA (with Award of Distinction) from Melbourne Business School in 1997. Anand has also co-edited four books on Intelligent Agents and has published over fifty papers in Computer Science and Artificial Intelligence in major journals, conferences, and workshops.
He has received widespread recognition for his extraordinary contributions in the field of consulting and Artificial Intelligence Research. He has received the Most Influential Paper Award for the Decade in 2007 from the Autonomous Agents & Multi-Agent Systems organization for his contribution on the Belief-Desire-Intention Architecture; MBA Award of Distinction from Melbourne Business School, 1997 and University Postgraduate Research Award (UPRA) from University of Sydney, 1985; Distinguished Alumnus Award from Birla Institute of Technology and Science, Pilani, India; He was recognized as one of Top 50 Data & Analytics professionals in USA and Canada by Corinium; one of Top 50 professionals in InsureTech; one of Top 25 Technology Leaders in Consulting; and has won a number of awards for his academic and business papers. Anand is an Adjunct Professor in BITS Pilani’s APPCAIR AI Center. He also serves on the Advisory Board of Oxford University’s Institute for Ethics in AI, World Economic Forum’s Global AI Council, OECD’s Network of Experts on AI (ONE), OECD’s AI Compute initiative, Advisory Board of Northwestern’s MBAi program, Responsible AI Institute, Nordic AI Institute, and International Congress for the Governance of AI. Anand Rao can be contacted on any of the following channels: Linkedin: https://www.linkedin.com/in/anandsrao/ Twitter:@AnandSRao Medium: https://anandsrao.medium.com/ Semantic Scholar: https://www.semanticscholar.org/author/Anand-Srinivasa-Rao/145946928
Hadrien Jean is a machine learning scientist working at My Medical Assistent where he is developing deep learning models in the medical domain. He wrote the book Essential Math for Data Science (https://www.essentialmathfordatascience.com/) aimed at helping people to get the math needed in data science from a coding perspective. He previously worked at Ava on speech diarization. He also worked on a bird detection project using deep learning. He completed his Ph.D. in cognitive science at the École Normale Supérieure (Paris, France) on the topic of auditory perceptual learning with a behavioral and electrophysiological approach. He has published a series of blog articles aiming at building intuition on mathematics through code and visualization (https://hadrienj.github.io/posts/).
Introduction to Linear Algebra for Data Science and Machine Learning With Python(Bootcamp)
Guglielmo is a Biomedical Engineer with an extensive background in Software Engineering and Data Science applied to different contexts, such as Biotech Manufacturing, Healthcare and DevOps, just to mention the latest, and a lifelong learner. Currently busy unlocking business value through Deep Learning projects, mostly in Computer Vision (not restricted to this field by the way).
He has been recognized as DataOps Champion at the Streamsets DataOps Summit 2019 and awarded as one of the Top 50 Tech Visionaries at the 2019 Dubai Intercon Conference.
He is also an international speaker and author of the following book: Hands-on Deep Learning with Apache Spark @Packt https://www.packtpub.com/big-data-and-business-intelligence/hands-deep-learning-apache-spark
Not Just Deep Fakes: Applications of Visual Generative Models in Pharma Manufacturing(Tutorial)
Leonardo De Marchi holds a Master in Artificial intelligence and has worked as a Data Scientist in the sports world, with clients such as the New York Knicks and Manchester United, and with large social networks, like Justgiving. His previous experience includes Head of Data Science and Analytics in Bumble, the largest dating site with over 500 million users, heading the team through an acquisition and an IPO. He is also the lead instructor at ideai.io, a company specialized in Reinforcement Learning, Deep Learning and Machine Learning training. He is also a contractor for several companies and for the European Commission, as an expert in AI and Machine Learning. As an author he wrote “Hands On Deep Learning” and he authored an online training course for O’Reilly, Introduction to Reinforcement Learning. In the academic world, he also helped set up the PhD center on Interactive Artificial Intelligence and will take part in the Inner Assessment Board to assign funding to Irish research in AI.
NLP Fundamentals(Training)
Sara is a Senior Research Associate in Biomedical Data Science and University Research Lecturer at the University of Oxford, where she is the Machine Learning Lead in the Centre for Statistics in Medicine. She has 12 years of experience in machine learning, signal processing, and intelligent remote monitoring research, with applications in biomedical and planetary health informatics. Sara has served on the NASA Frontier Development Lab Artificial Intelligence Panel and the NASA Climate Challenge Big Think. She is a National Geographic Society Explorer in Tracking Plastic Pollution with Remote Monitoring and Machine Learning. Sara is also a University of Oxford Ambassador for Women in Data Science.
Dipanjan (DJ) Sarkar is a data science consultant and published author, and was recognized as a Google Developer Expert in Machine Learning by Google in 2019. He currently works as a lead data science consultant at Schaffhausen Institute of Technology Academy, Zurich. Dipanjan has led advanced analytics initiatives working with Fortune 500 companies like Intel, Applied Materials, Red Hat / IBM. He works on leveraging data science, machine learning and deep learning to build large- scale intelligent systems. Dipanjan also works as an independent consultant, mentor and AI advisor in his spare time collaborating with multiple universities, organizations and startups across the globe. His passion includes solving challenging data problems as well as educating and helping people upskill in all things data. Dipanjan has also been recognized as one of the top ten Data Scientists in India in 2020, 40 under 40 Data Scientists, 2021 and Top 50 AI Thought Leaders by Global AI Hub, Switzerland. In his spare time he loves reading, gaming, watching interesting documentaries, football. He is also a strong supporter of open-source and publishes his code and analyses from his books, articles and experience on GitHub at https://github.com/dipanjanS and LinkedIn at https://www.linkedin.com/in/dipanzan
Advanced NLP: Deep Learning and Transfer Learning for Natural Language Processing(Training)
Nicole is a Data Scientist & Quant and Data Engineer currently working at impactvise as Data Science and Technology Lead and at quantmate as Quant. She has over 8 years of experience leading technology projects. She additionally reviews machine learning books and online courses for Manning Publications. Her research interests include time series prediction and natural language processing. She is dedicated to showing others how to succeed in machine learning and is committed to making STEM more attractive to women.
Dynamic and Context-Dependent Stock Price Prediction Using Attention Modules and News Sentiment(Talk)
Stefanie Molin is a data scientist and software engineer at Bloomberg in New York City, where she tackles tough problems in information security, particularly those revolving around anomaly detection, building tools for gathering data, and knowledge sharing. She is also the author of “Hands-On Data Analysis with Pandas,” which is currently in its second edition. She holds a bachelor’s of science degree in operations research from Columbia University’s Fu Foundation School of Engineering and Applied Science. She is currently pursuing a master’s degree in computer science, with a specialization in machine learning, from Georgia Tech. In her free time, she enjoys traveling the world, inventing new recipes, and learning new languages spoken among both people and computers.
Ville has been developing infrastructure for machine learning for over two decades. He has worked as an ML researcher in academia and as a leader at a number of companies, including Netflix where he led the ML infrastructure team that created Metaflow, a popular open-source framework for data science infrastructure. He is a co-founder and CEO of Outerbounds, a company developing modern human-centric ML. He is also the author of an upcoming book, Effective Data Science Infrastructure, published by Manning.
Human-Friendly, Production-Ready Data Science with Metaflow(Talk)
Daniel has been teaching machine learning and distributed computing technologies at Data Science Retreat, the longest-running Berlin-based bootcamp, for more than three years, helping more than 150 students advance their careers. He writes regularly for Towards Data Science. His blog post “Understanding PyTorch with an example: a step-by-step tutorial” reached more than 220,000 views since it was published. The positive feedback from the readers motivated him to write the book Deep Learning with PyTorch Step-by-Step, which covers a broader range of topics.
Daniel is also the main contributor of two python packages: HandySpark and DeepReplay. His professional background includes 20 years of experience working for companies in several industries: banking, government, fintech, retail and mobility.
Hugo Bowne-Anderson is a data scientist, writer, educator & podcaster. His interests include promoting data & AI literacy/fluency, helping to spread data skills through organizations and society and doing amateur stand up comedy in NYC. He does many of these at DataCamp, a data science training company educating over 3 million learners worldwide through interactive courses on the use of Python, R, SQL, Git, Bash and Spreadsheets in a data science context. He has spearheaded the development of over 25 courses in DataCamp’s Python curriculum, impacting over 170,000 learners worldwide through my own courses. He hosts and produce the data science podcast DataFramed, in which he uses long-format interviews with working data scientists to delve into what actually happens in the space and what impact it can and does have. He earned PhD in Mathematics from the University of New South Wales, Australia and has conducted biomedical research at the Max Planck Institute in Germany and Yale University, New Haven.
Full-stack Machine Learning for Data Scientists(Tutorial)
Ben Auffarth is the head of data science at loveholidays. With a background and Ph.D. in computational and cognitive neuroscience, he has designed and conducted wet lab experiments on cell cultures, analysed experiments with terabytes of data, run brain models on IBM supercomputers with up to 64k cores. More recently, he’s built production systems processing hundreds of thousands of transactions per day, and trained neural networks on millions of text documents. He’s authored two books on machine learning. When he’s not at work, you might find him on a playground with his young son in West London. He co-founded and is the former president of Data Science Speakers, London.
Time-Series in Python – Preprocessing and Machine Learning(Workshop)
Anuj is a seasoned Machine Learning leader, having incubated and led multiple ML teams. During his career, he has worked in academia, early stage startups as well as Fortune 100. He has successfully led efforts for building commercially viable products in a wide spectrum of verticals and functions. He has authored over a dozen research papers and patents.
Anuj is a major speaker at various international forums. He has delivered technical talks, bootcamps and participated in panel discussions at prestigious forums like ODSC, Anthill inside, PyData DC, Fifth Elephant, ICDCN, PODC, to name a few. He was Co-editor of “Anthill inside 2018”. He is a well-known name in the Machine Learning ecosystem.
Julia Lintern currently works as an instructor for the Metis Data Science Flex Program. Previously, she worked as a Data Scientist for the New York Times. Julia began her career as a structures engineer designing repairs for damaged aircraft. Julia holds an MA in applied math from Hunter College, where she focused on visualizations of various numerical methods and discovered a deep appreciation for the combination of mathematics and visualizations. During certain seasons of her career, she has also worked on creative side projects such as Lia Lintern, her own fashion label.
Introduction to Machine Learning (Bootcamp)
Leonidas (Leo) is a Senior Data Scientist at Astrazeneca. His work is focused around machine learning in oncology, including clinical and non clinical applications. He is also enthusiastic about NLP applications in oncology and how this can be used to leverage patient treatment. He is also a workshop facilitator in the European Leadership University (ELU), NL and has also been a data science educator at DataCamp. He holds a PhD from the University of Warwick, UK. in bioinformatics and ML, an MSc in statistics from Imperial College London, UK and a BSc in Statistics and Insurance Science from the University of Piraeus, GR.
Introduction to Python for Data Analysis(Bootcamp)
Jacob Tomlinson is a senior Python software engineer at NVIDIA with a focus on deployment tooling for distributed systems. His work involves maintaining open source projects including RAPIDS and Dask. RAPIDS is a suite of GPU accelerated open source Python tools which mimic APIs from the PyData stack including those of Numpy, Pandas and SciKit-Learn. Dask provides advanced parallelism for analytics with out-of-core computation, lazy evaluation and distributed execution of the PyData stack. He also tinkers with the open source chatbot automation framework Opsdroid in his spare time. Jacob volunteers with the local tech community group Tech Exeter and lives in Exeter, UK.
GPU Development with Python 101(Workshop)
Christian Leibig is Director of Machine Learning at Vara, leading the development of methods from research to production. He obtained a Ph.D. in Neural Information Processing from the International Max Planck Research School in Tübingen and a diploma in physics from the University of Konstanz. Before joining Vara, he worked as a Postdoctoral Researcher at the University Clinics in Tübingen on the applicability of Bayesian Deep Learning and machine learning applications for the healthcare space for ZEISS and held research and internship positions with Max Planck, LMU Munich and the Natural and Medical Sciences Institute in Reutlingen. The method and software of his PhD work, an unsupervised solution for neural spike sorting from HDCMOS-MEA data is distributed by Multichannel Systems (Harvard Bioscience). His work on applying and assessing uncertainty methods to large scale medical imaging was among the first in the field and awarded with key note speaker invitations. He enjoys all of theory, software engineering, and people management, in particular for applications that have a meaningful impact, such as diagnosing cancer early.
Towards a Scalable Deployment of AI Models via Uncertainty Quantification(Workshop)
Bio Coming Soon!
Bio Coming Soon!
Sentiment Analysis Tricks with Keras, spaCy and Transformers(Tutorial)
Jo-fai (or Joe) has multiple roles (data scientist / evangelist / community manager) at H2O.ai. Since joining the company in 2016, Joe has delivered H2O talks/workshops in 40+ cities around Europe, US, and Asia. Nowadays, he is best known as the H2O #360Selfie guy. He is also the co-organiser of H2O’s EMEA meetup groups including London Artificial Intelligence & Deep Learning – one of the biggest data science communities in the world with more than 11,000 members (https://www.meetup.com/London-Artificial-Intelligence-Deep-Learning/).
Admissible AutoML – Efficient, Fair, and Interpretable Machine Learning with H2O(Workshop)
Thomas Galtier, PhD, is the Data Science and Analytics Director at CybelAngel with a PhD in Applied Mathematics. He is passionate about data, especially “telling stories with the right data”. As a former researcher he believes that reproducibility is a key concept in Machine Learning workflow and that sharing will allow us to tackle tomorrow’s challenges in AI field.
Reproducible and Shareable Notebooks Across a Data Science Team(Talk)
Dan Sullivan is a Principal Data Architect at 4 Mile Analytics. He has a PhD in Genetics, Bioinformatics and Computational Biology and is a former research scientist in infectious disease genomics. He is the author of Google Cloud Certification Study Guides for the Professional Data Engineer, Professional Architect, and Associate Cloud Engineer certifications and an instructor on LinkedIn Learning and Udemy where he provides courses on data science, data modeling, and cloud computing.
Mike Tapi Nzali PhD, is a machine learning engineer at CybelAngel with a PhD in Computer Science. He likes to work in a startup environment, also leading the development of machine learning products from idea to production. He is interested in cutting-edge technology, sharing knowledge and industrialization of Machine Learning.
Reproducible and Shareable Notebooks Across a Data Science Team(Talk)
Alex Athorne is a Research Engineer at Seldon, where he works on open-source libraries for explainability and drift detection. He studied mathematics at Warwick and went on to do a PhD at Imperial College London in dynamical systems. He’s passionate about open-source development and writing about his experiences in ML.
Open Source Explainability – Understanding Model Decisions Using Alibi(Talk)
Tuhin Sharma is Senior Principal Data Scientist at Redhat in the Corporate Development and Strategy group. Prior that he worked at Hpersonix as AI Architect. He also co-founded and has been CEO of Binaize, a website conversion intelligence product for e-commerce SMBs. He received master’s degree from Indian Institute of Technology Roorkee in Computer Science with specialization in Data Mining. He received bachelor’s degree from Indian Institute of Engineering Science and Technology Shibpur in Computer Science. He loves to code and collaborate on open source and research projects. He has 4 research papers and 5 patents in the field of AI and NLP. He is reviewer of IEEE MASS conference in the AI track. He writes deep learning articles for O’reilly with the collaboration with AWS MXNET team. He loves to play TT and Guitar in his leisure time. His favorite quote is “Life is Beautiful”.
Eagleeye: Data Pipeline for Anomaly Detection in Cyber Security(Talk)
Ryan Dawson is a technologist passionate about data. Ryan works with clients on large-scale data and AI initiatives, helping organizations get more value from data. His work includes strategies to productionize machine learning, organizing the way data is captured and shared, selecting the right data technologies and optimal team structures, as well as writing the code to make it happen. He has over 15 years of experience and, as well as many widely read articles about MLOps, software design, and delivery. is author of the Thoughtworks Guide to Evaluating MLOps Platforms.
Dieuwke Hupkes is a research scientist at Facebook AI Research in FAIR. Her work centers around the evaluation of models of natural language processing (NLP), with a specific focus on how such models can show more human-like behaviour, where they fail and what are areas where they should still improved. In the recent past, she has focussed specifically on large language models (LLMs) and neural machine translation (NMT) models.
Evaluating Generalisation in Natural Language Processing Models(Talk)
Shawn is passionate about harnessing the power of data strategy, engineering and analytics in order to help businesses uncover new opportunities. As an innovative technologist with over 13 years experience, Shawn removes technology as a barrier, and broadens the art of the possible for business and product leaders. His holistic view of technology and emphasis on developing and motivating strong engineering talent, with a focus on delivering outcomes whilst minimising outputs, is one of the characteristics which sets him apart from the crowd.
Shawn’s deep technical knowledge includes distributed computing, cloud architecture, data science, machine learning and engineering analytics platforms. He has years of experience working as a consultant practitioner for a variety of prestigious clients ranging from secret clearance level government organizations to Fortune 500 companies.
Danushka Bollegala is a Professor in the Department of Computer Science, University of Liverpool, UK. He obtained his PhD from the University of Tokyo in 2009 and worked as an Assistant Professor before moving to the UK. He has worked on various problems related to Natural Language Processing and Machine Learning. He has received numerous awards for his research excellence such as the IEEE Young Author Award, best paper awards at GECCO and PRICAI. His research has been supported by various research council and industrial grants such as EU, DSTL, Innovate UK, JSPS, Google and MSRA. He is an Amazon Scholar.
Social Biases in Text Representations and their Mitigation(Talk)
With over 10 years of experience (Google, AI-focused startups) with big data and as the CPO and head of customer success at Mona, the leading AI monitoring intelligence company, Itai has a unique view of the AI industry. Working closely with data science and ML teams applying dozens of solutions in over 10 industries, Itai encounters a wide variety of business use-cases, organizational structures and cultures, and technologies used in today’s AI world.
Utilizing Advanced Monitoring Capabilities to Promote Product-oriented Data Science(Tutorial)
Gilad has over 15 years of experience in product management and a solid R&D background. He combines analytical skills and technical innovation with Data Science market experience. Gilad’s passion is to define a product vision and turn it into reality. As Director of Product Management at Iguazio, Gilad manages both the Enterprise MLOps Platform product as well as MLRun, Iguazio’s open source MLOps orchestration framework. Prior to joining Iguazio, Gilad managed several different products at NICE-Actimize, a leading vendor of financial crime prevention solutions, including coverage of machine-learning based solutions, formation of a marketplace and addressing customer needs across different domains. Gilad holds a B.A in Computer Science, M.Sc. in Biomedical Engineering and MBA from Tel-Aviv University.
It worked on my Laptop, now what? Using OS tool MLRun to Automate the Path to Production(Demo Talk)
Karin is currently the leading developer community programming in the Developer Relations team at StarTree. Karin initially began her career in entertainment marketing working with the likes of names like Eminem and Live Nation. She also launched a successful professional women’s network in two major cities in the U.S., organized events for her local Data Science meetup, and helped lead a on-going hackathon to put machine learning in the hands of cancer biologists. Her journey working in data eventually let her to a position as Program Manager for Community Development for the leading graph database in the world, Neo4j. Most recently, she was brought on to StarTree to improve the adoption and success of the overall developer community.
Real-Time Analytics: Going Beyond Stream Processing with Apache Pinot(Workshop)
Jake is currently working as a Senior Product Marketing Manager over ML Lifecycle products at Cloudera. Before joining Cloudera, Jake worked as a Data Scientist and then as a Data Science and Analytics Solution Architect at ExxonMobil. Additionally, he worked as a Senior Data Scientist at FarmersEdge. Before starting his professional career, Jake obtained his bachelor’s and master’s degree from Brigham Young University. When he isn’t working, Jake enjoys skiing, golfing, and spending time with his family in the mountains.
Predicting the Next NBA Champion with Cloudera’s Applied ML Prototypes (AMPs)(Demo Talk)
Soon-Yau Cheong is the founder of Sooner.ai, an AI consulting and training company specialises in image/video generation and manipulation. Past projects include face swapping, portrait cartoonisation, shoes virtual try-on etc. He is well-versed in generative AI techniques which include GANs, autoregressive transformer and diffusion models. He authored the book “Hands-on Image Generation with TensorFlow” which is well-received for its hands-on approach in making difficult mathematical theories easy to understand. Soon-Yau is also currently doing PhD in AI digital media creation at University of Surrey.
Chandini Jain is the CEO/founder of Auquan – a london based fintech using NLP and AI to distill relevant and impactful information from unstructured text. Prior to Auquan, she worked as a derivatives trader at Optiver in Chicago/Amsterdam and Deutsche Bank. At Auquan, she oversee the development of our machine learning strategies.
Rob Magno is a Sales Engineer/Solution Architect at Run:AI based in New Jersey. He has been working in the Docker and Kubernetes space for the past five years. He enjoys tackling the diverse customer challenges that come with orchestrating AI/ML workloads through Kubernetes.
Building the Best AI Infrastructure Stack to Accelerate Your Data Science(Demo Talk)
Felipe is a Data Scientist in WhyLabs. He is a core contributor to whylogs, an open-source data logging library, and focuses on writing technical content and expanding the whylogs library in order to make AI more accessible, robust, and responsible. Previously, Felipe was an AI Researcher at WEG, where he researched and deployed Natural Language Processing approaches to extract knowledge from textual information about electric machinery. He is also a Master in Electronic Systems Engineering from UFSC (Universidade Federal de Santa Catarina), with research focused on developing and deploying fault detection strategies based on machine learning for unmanned underwater vehicles. Felipe has published a series of blog articles about MLOps, Monitoring, and Natural Language Processing in publications such as Towards Data Science, Analytics Vidhya, and Google Cloud Community.
Visually Inspecting Data Profiles for Data Distribution Shifts(Workshop)
Oliver is a software developer and architect from Hamburg, Germany. He has been developing software with different approaches and programming languages for more than 3 decades. Lately, he has been focusing on Machine Learning and its interactions with humans.
How to Teach Our World Knowledge to a Neural Network?(Workshop)
Aoife Cahill is a Natural Language Processing (NLP) expert and a director of AI research at Dataminr, the leading real-time information discovery platform. Since joining in 2021, Aoife has led a team of data scientists focused on the efficient iterative process of developing and evaluating AI technology that supports the expansion of Dataminr’s internal and external products.
Prior to Dataminr, Aoife led a team of research scientists and engineers working on high-stakes NLP applications in the educational domain at the Educational Testing Service (ETS). The NLP teams at ETS are known leaders in the field of developing and deploying robust, well-documented, scalable NLP prototypes that maintain fairness across user groups.
Aoife holds a PhD in Computational Linguistics from Dublin City University, Ireland, and has also spent time conducting NLP research in Germany, Norway and in the U.S. As an active member of the computational linguistics research community, her research has been published in top-tier journals including Computational Linguistics and the Journal of Research on Language and Computation, as well as conference proceedings at the annual conference for the Association for Computational Linguistics (ACL), the International Conference on Computational Linguistics (COLING) and the Conference on Empirical Methods in Natural Language Processing (EMNLP).
AI for Emergency Response(Demo Talk)
Isaiah Hull is a senior economist in the research division of Sweden’s Central Bank (Sveriges Riksbank). He holds a PhD in economics from Boston College and conducts research on computational economics, machine learning, and quantum computing. He is also the instructor for DataCamp’s “Introduction to TensorFlow in Python” course and the author of “Machine Learning for Economics in Finance in TensorFlow 2.”
Machine Learning for Economics and Finance in TensorFlow 2(Tutorial)
Kyriakos is a Data Scientist and works across a broad set of industries contributing to large scale digital transformation projects for the world’s largest organizations. He specializes in data visualization, predictive modelling and model management.
With his academic background in Mathematics and Operational Research, Applied Statistics and Financial Risk, his day-to-day includes helping clients make better business decisions through Data Management, Business Intelligence, Machine Learning and Artificial Intelligence solutions.
Optimizing Your Analytics Life Cycle with Machine Learning and Open Source(Demo Talk)
Terry is the Director of AI for Advancing Analytics and Microsoft Artificial Intelligence MVP with a focus on all things AI and Data Science. Terry has a passion for applying traditional Software Engineering techniques to Data, to improve the way teams deliver Machine Learning projects. Terry is the host of the popular podcasts Data Science in Production and Totally Skewed, and organises the Global AI Bootcamp London event.
Simplifying Model Production with MLFlow Pipelines and Delta(Demo Talk)
Lee is the General Secretary for the AI Infrastructure Alliance. Based out of the UK, he is responsible for crafting and nurturing relationships with companies to build a canonical stack for AI and ML. When not shuttling his 3 children around, he can most often be found cycling, running and swimming around England’s South Coast.
The Rapid Evolution of the Canonical Stack for Machine Learning(Demo Talk)
Christos has a PhD in Computing and has worked for many years as an ML consultant for many companies covering different domains (telcom, finance, gaming). For the last 3 years, he has been focussing on ML-Ops, defining and curating the ML-Development Lifecycle for the companies that hire him. He has recently embarked on a new adventure with Vortexa Ltd, working as a Lead ML Engineer and helping the company scale technically as it grows.
Dynamicio (a pandas I/O wrapper); Why you Should Start your ML-Ops Journey with Wrapping your I/O(Talk)
He is a computer scientist / bioinformatician who has turned to be a core developer of `scikit-learn` and `fairlearn`, and work as a Machine Learning Engineer at Hugging Face. He is also an organizer of PyData Berlin.
These days he mostly focus on aspects of machine learning and tools which help with creating more ethical and fair decision making systems. This trend has influenced him to work on `fairlearn`, and to work on aspects of `scikit-learn` which would help tools such as `fairlearn` to work more fluently with the package; and at Hugging Face, his focus is to enable the community of these libraries to be able to share their models more easily and be more open about their work.
Oryan is a ֿLead Software Engineer with a passion for Machine Learning and DevOps, with 7 years of experience developing services for production and development environments and leading teams.
Data-driven ML Retraining with Production Insights(Demo Talk)
Bio Coming Soon!
A Systematic Approach for Building Full-Spectrum Model Monitoring(Talk)
Keith McCormick serves as CloudFactory’s Chief Data Science Advisor. He’s also an author, LinkedIn Learning contributor, university instructor, and conference speaker. Keith has been building predictive analytics models since the late 90s. More recently his focus has shifted to helping organizations build and manage their data science teams.
Computer Perception Challenges in Drone Applications Using Quality Data Annotation(Talk)
Carl implemented his first neural net in 2000. He is a senior director of the AI / ML practice at Cognizant, focusing on communications, technology, and media customers. Previously he worked on deep learning and machine learning at Google and IBM. Carl is an author of over 20 articles in professional, trade, and academic journals, an inventor with 6 patents at USPTO, and holds 3 corporate awards from IBM for his innovative work. His machine learning book, “MLOps Engineering at Scale” continues to receive reader acclaim. You can find out more about Carl from his blog www.cloudswithcarl.com
Revealing the Inner Self: Automatic Differentiation (Autodiff) Clearly Explained(Workshop)
Doris Zhong is a Product Manager in Azure AI Platform organization at Microsoft, and she is focusing on the area of machine learning in hybrid cloud. She loves to communicate with customer to get deep insights, and help solve the real problem. In her early career, she worked on building Microsoft internal GPU training platform, that managed tens of thousands of GPUs, and served thousands of users.
Run Azure Machine Learning Anywhere in Multi-cloud or on Premises(Demo Talk)
Jason is the public face of TensorFlow.js, helping web engineers globally take their first steps with machine learning in JavaScript. He also combines his knowledge of the technical and creative worlds to develop innovative prototypes for Google’s largest customers and internal teams with over 15 years experience working within web engineering and investigating emerging technologies.
He holds an MEng in Computer Science, is a member of the British Computing Society, and is a certified information privacy technologist. Jason loves sharing knowledge online which has attracted a global following. In his spare time he can be found walking the wings of flying aircraft being one of the few people in the world who has been trained in the art of wing walking.
Suraj is an ML engineer and developer advocate at Meta AI. In a previous life, he was a data scientist in personal finance. After being bitten by the deep learning bug, he worked in healthcare research (predicting patient risk factors) and behavioral finance (preventing overly-risky trading). Outside of work, you can find him hiking barefoot in the Catskills or being tossed on the Aikido mat.
Simon is the Director of Engineering for Advancing Analytics, a Microsoft Data Platform MVP and one of the few Databricks Beacons Globally. Simon has pioneered Lakehouse Architectures for a some of the world’s largest companies, challenging traditional analytical solutions and pushing for the very best for the data industry. Simon runs the Advancing Spark YouTube channel, where he can often be found digging into Spark features, investigating new Microsoft technologies and cheering on the Delta Lake project.
A Dive into Delta Lake: A Modern File Format for the Next-Generation Lake(Workshop)
Corey studied Mathematics at Michigan State University and works as a Data Scientist with KNIME where he focuses on Time Series Analysis, Forecasting, and Signal Analytics. He is the creator and instructor of the KNIME Time Series Analysis course, author of the e-book: Alteryx to KNIME, creator of the KNIME Time Series Analysis components, and Co-Author of the upcoming Codeless Time Series Analysis Book with Packt.
Moving into the Frequency Domain with the Fourier Transform(Tutorial)
Prathiba is an experienced Data Scientist with a rich background in the Insurance industry. With a Master’s degree in Operational Research with Applied Statistics and Risk, her passion takes form through seeing the varying applications of Machine Learning and AI techniques, and how they propel data scientists to build better models and solutions. Skilled in data analysis and modelling, she utilizes SAS software and Open Source to assess and address problems within enterprise organizations.
Interpretability vs Explainability: Unpacking the Role of Human Morality in AI Models(Talk)
Adi Hirschtein brings 20 years of experience as an executive, product manager and entrepreneur building and driving innovation in technology companies. As the VP of Product at Iguazio, the MLOps platform built for production and real-time use cases, he leads the product roadmap and strategy. His previous roles spanned technology companies such as Dell EMC, Zettapoint and InfraGate, in diverse positions including product management, business development, marketing, sales and execution, with a strong focus on machine learning, database and storage technology. When working with startups and corporates, Adi’s passion lies in taking a team’s ideas from their very first day, through a successful market penetration, all the way to an established business. Adi holds a B.A. in Business Administration and Information Technology from the College of Management Academic Studies.
Yaron Haviv is a serial entrepreneur who has been applying his deep technological experience in data, cloud, AI and networking to leading startups and enterprise companies since the late 1990s. As the co-founder and CTO of Iguazio, Yaron drives the strategy for the company’s MLOps platform and led the shift towards the production-first approach to data science and catering to real-time AI use cases. He also initiated and built Nuclio, a leading open source serverless platform with over 4,000 Github stars and MLRun, Iguazio’s open source MLOps orchestration framework. Prior to co-founding Iguazio in 2014, Yaron was the Vice President of Datacenter Solutions at Mellanox (now NVIDIA), where he led technology innovation, software development and solution integrations. He was also the CTO and Vice President of R&D at Voltaire, a high-performance computing, IO and networking company which floated on the NYSE in 2007. Yaron is an active contributor to the CNCF Working Group and was one of the foundation’s first members. He presents at major industry events and writes tech content for leading publications including TheNewStack, Hackernoon, DZone, Towards Data Science and more.
MLOps Beyond Training: The Production-First Approach to AI(Track Keynote)
Gijsbert Janssen van Doorn is Director of Technical Product Marketing at Run:AI. He is a passionate advocate for technology that will shape the future of how organizations run AI. Gijsbert comes from a technical engineering background, with six years in multiple roles at Zerto, a Cloud Data Management and Protection vendor.
Dr. Yves J. Hilpisch is founder and CEO of The Python Quants (http://tpq.io), a group focusing on the use of open source technologies for financial data science, artificial intelligence, algorithmic trading, and computational finance. He is also founder and CEO of The AI Machine (http://aimachine.io), a company focused on AI-powered algorithmic trading based on a proprietary strategy execution platform.
Yves has a Diploma in Business Administration, a Ph.D. in Mathematical Finance and is Adjunct Professor for Computational Finance at Miami Herbert Business School.
Gal Naamani has been working as a data scientist for 4 years, with the past 3 years being at Fiverr. As the Senior Data Scientist, Gal works closely with developers, analysts, product managers, and business owners on growth opportunities and new ideas, from research to production. Gal currently has leading roles in projects that are focused around search engine ranking, promoted ads, online bidding optimization, exploration-exploitation problems, monitoring, and more.
Utilizing Advanced Monitoring Capabilities to Promote Product-oriented Data Science(Tutorial)
Sonam Srivastava is the founder of Wright Research, an India-based Robo-advisor, where she creates data-driven portfolios out of her deep passion for quant finance. Wright Research is a wealth creator in the digital space that uses scientific data-driven methods to tactically extract opportunities across assets in the public markets to grow clients’ wealth. Wright functions as SEBI registered Robo advisor and is among the most popular advisors among millennial investors with more than 30000 clients and 125 crore+ in assets. Wright Research has delivered a 90% + outperformance over the index in the last 2.5 years. She has 10+ years of experience in investment research and portfolio management, working on systematic strategies, long-short strategies, and algorithmic trading. She started her career in the field with Mumbai-based Forefront Capital, which got acquired by Edelweiss. At Edelweiss, she worked as an algorithm designer at Edelweiss’s institutional equity broking desk. After that, she worked at HSBC Europe as a quant building factor-driven portfolio solutions. Before starting Wright Research, she also worked at Qplum, doing portfolio management at the artificial intelligence-driven Robo-advisor. She graduated from IIT Kanpur and has a master’s in financial engineering from Worldquant University. She is a globally recognized researcher and works as a visiting faculty as AI in Finance Institute New York and BSE Institute Limited.
Deep Reinforcement Learning for Asset Allocation in US Equities (Tutorial)
Kaushik Bokka is a Senior Research Engineer at Grid.ai and one of the core maintainers of the PyTorch Lightning library. He has prior experience in building production scale Machine Learning and Computer Vision systems for several products ranging from Video Analytics to Fashion AI workflows. He has also been a contributor for few other open source projects and aims to empower the way people and organizations build AI applications.
Kaxil is currently working as the Director of Airflow Engineering Team @ Astronomer. Currently, he is one of the top three committers of the Airflow Project based on the number of commits. He is one of the release managers of Airflow. Most prominent works include co-authoring DAG Serialization, Scheduler HA, Secrets Backend.
He did his Masters in Data Science & Analytics from Royal Holloway, University of London. Started as a Data Scientist and then gained experience in Data Engineering, BigData and DevOps space. He began working on Airflow in 2017 while working at Data Reply as a BigData consultant and became a PMC member in 2018 and now works full-time at astronomer.io making Airflow better for everyone. He is a huge cricket fan and his favourite cricketers are Rahul Dravid and Virat Kohli.
Andre joined causaLens from Goldman Sachs, where he was an executive director in the Model Risk Management group in Hong Kong and Frankfurt. Today he is working with industry leading, global organisations to apply cutting edge Causal AI research in production level solutions that empower individuals and teams to make better decisions. Andre received his PhD in theoretical physics from the University of Munich, where he studied the interplay between quantum mechanics and general relativity in black-holes.
Laura is a ML Product Researcher at SeMI Technologies, the company behind the open-source vector search engine Weaviate. She researches new machine learning features for Weaviate and works on everything UX/DX related to Weaviate. For example, she is responsible for the GraphQL API design. She is in close contact with our open source community. Additionally, she likes to solve custom use cases with Weaviate, and introduces Weaviate to other people by means of Meetups, talks and presentations.
Bio Coming Soon!
A Novel Neural Embedding Approach for Fashion Recommender Systems(Workshop)
Alejandro is the Chief Scientist at the Institute for Ethical AI & Machine Learning, where he contributes to policy and industry standards on the responsible design, development and operation of AI, including the fields of explainability, GPU acceleration, privacy preserving ML and other key machine learning research areas. Alejandro Saucedo is also the Director of Engineering at Seldon Technologies, where he leads teams of machine learning engineers focused on the scalability and extensibility of machine learning deployment and monitoring products. With over 10 years of software development experience, Alejandro has held technical leadership positions across hyper-growth scale-ups and has a strong track record building cross-functional teams of software engineers. He is currently appointed as governing council Member-at-Large at the Association for Computing Machinery, and is currently the Chairperson of the GPU Acceleration Kompute Committee at the Linux Foundation.
Security Operations for Machine Learning at Scale with MLSecOps(Talk)
Glen Ford is VP of Product at iMerit — a leading AI data solutions company — where he leads the product management and design teams. Glen holds more than two decades of product development experience across the technology sector. A Graduate of Texas A&M University—Commerce, Glen began his career as a consultant where he handled full-stack web programming and architecture for clients including Time Warner and AIM Funds. Over the years, he has held senior and director-level product management roles at several companies including Demand Media, WP Engine and Humanify. Most recently, Glen spent four years at Alegion — an ML-powered data annotation platform — where he helped the company grow from eight full-time employees to more than 100 in a challenging, emerging market.
The Hidden Layers of Tech Behind Successful Data Labeling(Demo Talk)
Rachel is a Product Manager in Appen’s Autonomous Vehicles working group. In that role, she is working to provide high quality data on all levels of autonomy for motor vehicle clients. Prior to joining Appen, Rachel worked on data science tools to enable model interpretability, fairness testing and automated machine learning. Other passions of hers include using AI and technology to act as a catalyst towards solving humanitarian-centered problems for non-profits around the world.
Data for the AI Lifecycle in Computer Vision and Autonomous Vehicles(Talk)
Manu Kanwarpal is a Senior Specialist in the EMEA AI Global Black Belt team at Microsoft. He specialises in Azure Machine Learning and works with some of Microsoft’s largest customers on establishing their end to end processes for Data Science & MLOps. He is one of the authors of Microsoft’s unified MLOps accelerator called “MLOps v2”.
Microsoft’s Accelerator for MLOps(Workshop)
Originally from the USA but now living in the UK, Joe Depeau has over 25 years of varied experience in the IT industry across a number of domains and specialties. For the last 10 years Joe has focused on technical pre-sales and solution architecture in the data and analytics space, and he is a passionate evangelist for the use of graph databases and especially graph data science. When not geeking out over data and technology he enjoys camping, hiking with his dog, reading, tending to his Animal Crossing island, and playing boardgames and RPGs. He also bakes a mean cheesecake.
A Graph Data Science Framework for the Enterprise (Demo Talk)
Matt Beale is one of CloudFactorys Senior Solutions Consultants. He helps clients around the world overcome their data challenges with AI and ML projects across Autonomous Vehicles, Green Energy and FinTech. Matt joined CloudFactory due his interest in the ethical issue that impact AI and CloudFactorys mission to create meaningful work in the developing world. Away from work Matt has a passion for photography, travelling and unusual cars. In fact his passion for unusual cars bought him to import a Nissan Stagea from Japan to the UK.
Shayan Mohanty is the CEO and Co-Founder of Watchful, a company that largely automates the process of creating labeled training data. He’s spent over a decade of leading data engineering teams at various companies including Facebook, where he served as lead for the stream processing team responsible for processing 100% of the ads metrics data for all FB products. He is also a Guest Scientist at Los Alamos National Laboratory and has given talks on topics ranging from Automata Theory to Machine Teaching.
Applying Interactive Weak Supervision to NLP Tasks(Workshop)
Cindy Weng is a Senior Cloud Solution Architect at Microsoft in Data & AI. She specializes in architecting MLOps solutions for customers across a variety of industries including retail, financial services, consumer goods, and tech. She is one of the authors of the MLOps V2 unified accelerator by Microsoft.
Microsoft’s Accelerator for MLOps(Workshop)
Allan’s background covers a broad technology stack in infrastructure and cloud, working across a variety of roles in large enterprises before moving into Data Science and ML in recent years. His last role was working on time series forecasting at a fintech scale-up before joining Weights and Biases as the first member of the Customer Success team in EMEA.
Best Practices of Effective ML Teams(Demo Talk)
Bio Coming Soon!
Real-Time Analytics: Going Beyond Stream Processing with Apache Pinot(Workshop)
Tyler works as a Data Scientist at Vortexa where he focuses on building machine learning models that capture the dynamics of the energy markets. Prior to Vortexa, Tyler was doing research in clinical machine learning and published work in sports injury analytics and mathematical optimisation. He previously worked as a software engineer working for startups and clients in finance and uses this experience to contribute to the full lifecycle of building machine learning pipelines.
Dynamicio (a pandas I/O wrapper); Why you Should Start your ML-Ops Journey with Wrapping your I/O(Talk)
Adam is an experienced Data Scientist at Imperva’s threat research group where he works on creating machine learning algorithms to help protect Imperva’s customers against database attacks. Before joining Imperva, he obtained a PHD in Neuroscience from Ben-Gurion University of the Negev.
ML with Humans: Integrating Experts into the Learning Process(Workshop)
Luis Vargas is a Partner Technical Advisor to the CTO of Microsoft. Responsible for Microsoft’s AI at Scale initiative coordinating efforts across infrastructure, systems software, models, and products. He bootstrapped the productization of Automated ML and Reinforcement Learning in the Azure AI Platform, worked on the launch of Azure Database Services, and lead the high-availability area for SQL Server. Luis has a PhD in Computer Science from Cambridge University.
The Big Wave of AI at Scale(Keynote)
Brian Lucena is Principal at Numeristical, where he advises companies of all sizes on how to apply modern machine learning techniques to solve real-world problems with data. He is the creator of three Python packages: StructureBoost, ML-Insights, and SplineCalib. In previous roles he has served as Principal Data Scientist at Clover Health, Senior VP of Analytics at PCCI, and Chief Mathematician at Guardian Analytics. He has taught at numerous institutions including UC-Berkeley, Brown, USF, and the Metis Data Science Bootcamp.
StructureBoost: Gradient Boosting with Categorical Structure(Workshop)
Nuria Oliver is the Commissioner to the President of the Valencian Government on AI Strategy and Data Science against COVID-19; Cofounder and Vicepresident of ELLIS; Cofounder of the ELLIS Alicante Unit Foundation; Chief Data Scientist at Data-Pop Alliance.
Nuria earned her PhD from MIT; is a Fellow of the IEEE, an ACM Fellow and Fellow; Member of the Spanish Royal Academy of Engineering, SIGCHI Academy and Academia Europaea. She has 25+ years of research experience in human-centric AI and is the author of 160+ widely cited scientific articles as well as an inventor of 40+ patents and a public speaker. Her work is regularly featured in the media and has received numerous recognitions, including the Spanish National Computer Science Award (Angela Ruiz Robles category), the MIT TR100 (today TR35) Young Innovator Award (first Spanish scientist to receive this award); 2019 Data Scientist of the Year in Europe; 2020 Data Scientist of the Year by ESRI. She has recently co-led ValenciaIA4COVID, the winning team of the 500k XPRIZE Pandemic Response Challenge.
Bio Coming Soon!
How to Tune a Model using Feature Contribution and Simple Analytics(Workshop)