Lak is the Director for Data Analytics and AI Solutions on Google Cloud. His team builds software solutions for business problems using Google Cloud’s data analytics and machine learning products. He founded Google’s Advanced Solutions Lab ML Immersion program and is the author of three O’Reilly books and several Coursera courses. Before Google, Lak was a Director of Data Science at Climate Corporation and a Research Scientist at NOAA. Follow him on Twitter at @lak_gcp, read articles by him on Medium, and see more details at www.vlakshman.com
Practical Machine Learning on Images(Half-Day Training)
Chip Huyen is an engineer and founder working to develop tools that leverage real-time machine learning. Through her work with Snorkel AI, NVIDIA, and Netflix, she has helped some of the world’s largest organizations deploy machine learning systems. She teaches Machine Learning Systems Design at Stanford. She’s also published four bestselling Vietnamese books.
Kai-Wei Chang is an assistant professor in the Department of Computer Science at the University of California Los Angeles (UCLA). His research interests include designing robust machine learning methods for large and complex data and building fair, reliable, and accountable language processing technologies for social good applications. Dr. Chang has published broadly in natural language processing, machine learning, and artificial intelligence. His research has been covered by news media such as Wires, NPR, and MIT Tech Review. His awards include the Sloan Research Fellowship (2021), the EMNLP Best Long Paper Award (2017), the KDD Best Paper Award (2010), and the Okawa Research Grant Award (2018). Dr. Chang obtained his Ph.D. from the University of Illinois at Urbana-Champaign in 2015 and was a post-doctoral researcher at Microsoft Research in 2016. Additional information is available at http://kwchang.net
Noemi Derzsy is a Senior Inventive Scientist at AT&T Chief Data Office within the Data Science and AI Research organization. Her research is centered on understanding and modeling customer behavior and experience through large-scale consumer and network data, using machine learning, network analysis/modeling, Spatio-temporal mining, text mining, and natural language processing techniques.
Prior to joining AT&T, Noemi was a Data Science Fellow at Insight Data Science NYC and a postdoctoral research associate at Social Cognitive Networks Academic Research Center at Rensselaer Polytechnic Institute. She holds a Ph.D. in Physics, MS in Computational Physics, and has a research background in Network Science and Computer Science.
Noemi is also involved in volunteering in the data science community. She is a NASA Datanaut and former organizer of the Data Umbrella meetup group and NYC Women in Machine Learning and Data Science meetup group.
Ketan Umare is the TSC Chair for Flyte (incubating under LF AI & Data). He is also currently the Chief Software Architect at Union.ai. Previously he had multiple Senior Lead roles at Lyft, Oracle and Amazon ranging from Cloud, Distributed storage, Mapping (map making), and machine learning systems. He is passionate about building software that makes developers and other engineers’ lives easier and provides simplified access to large scale systems. With Flyte, he is trying to bridge gap from ideation to productionization for data and ML pipelines and bring a battle tested approach and structure to the data and ML world.
Deep Dive into Flyte(Half-Day Training)
Neil Sahota is an IBM Master Inventor, United Nations (UN) AI Advisor, author of the book Own the A.I. Revolution., and Chief Innovation Officer at UC Irvine. He is a business solution advisor to several large companies and sought-after keynote speaker. Over his 20+ year career, Neil has worked with enterprises on the business strategy to create next generation products/solutions powered by emerging technology as well as helping organizations create the culture, community, and ecosystem needed to achieve success such as the U.N.’s AI for Good initiative. Neil also actively pursues social good and volunteers with nonprofits. He is currently helping the Zero Abuse Project prevent child sexual abuse as well as Planet Home to engage youth culture in sustainability initiatives.
Jimmy Whitaker is the Data Science Evangelist at Pachyderm. He focuses on creating a great data science experience and sharing best practices for how to use Pachyderm. When he isn’t at work, he’s either playing music or trying to learn something new, because “You suddenly understand something you’ve understood all your life, but in a new way.
MLOps: From 0-60 with Pachyderm(Demo Talk)
Clinton Brownley, Ph.D., is a data scientist at Meta (formerly Facebook), where he’s responsible for a variety of analytics projects designed to empower employees to do their best work. Prior to this role, he was a data scientist at WhatsApp, working to improve messaging and VoIP calling performance and reliability. Before WhatsApp, he worked on large-scale infrastructure analytics projects to inform hardware acquisition, maintenance, and data center operations decisions at Facebook.
As an avid student and teacher of modern data analysis and visualization techniques, Clinton teaches a graduate course in interactive data visualization for UC Berkeley’s MIDS program, taught a short-term graduate course in regression analysis and machine learning workshop for NYU’s A3SR program, leads an annual machine learning in Python workshop, and is the author of two books, “Foundations for Analytics with Python” and “Multi-objective Decision Analysis”.
Clinton is a past-president of the San Francisco Bay Area Chapter of the American Statistical Association and is a council member for the Section on Practice of the Institute for Operations Research and the Management Sciences. Clinton received degrees from Carnegie Mellon University and American University.
Andrea Lowe, PhD is the Training and Enablement Engineer at Domino Data Labs where she develops training on topics including overviews of coding in Python, machine learning, Kubernetes, and AWS. She trained over 1000 data scientists and analysts in the last year. She has previously taught courses including Numerical Methods and Data Analytics & Visualization at the University of South Florida and UC Berkeley Extension. Her conference experience includes a deep learning tutorial at PyCon, 2 invited talks, 21 poster presentations, and 4 chair positions.
Charles Givre recently joined JP Morgan Chase works as a data scientist and technical product manager in the cybersecurity and technology controls group. Prior to joining JP Morgan, Mr. Givre worked as a lead data scientist for Deutsche Bank. Mr. Givre worked as a Senior Lead Data Scientist for Booz Allen Hamilton for seven years where he worked in the intersection of cyber security and data science. At Booz Allen, Mr. Givre worked on one of Booz Allen’s largest analytic programs where he led data science efforts and worked to expand the role of data science in the program. Mr. Givre is passionate about teaching others data science and analytic skills and has taught data science classes all over the world at conferences, universities and for clients. Mr. Givre taught data science classes at BlackHat, the O’Reilly Security Conference, the Center for Research in Applied Cryptography and Cyber Security at Bar Ilan University. He is a sought-after speaker and has delivered presentations at major industry conferences such as Strata-Hadoop World, Open Data Science Conference and others. One of Mr. Givre’s research interests is increasing the productivity of data science and analytic teams, and towards that end, he has been working extensively to promote the use of Apache Drill in security applications and is a committer and PMC Member for the Drill project. Mr. Givre teaches online classes for O’Reilly about Drill and Security Data Science and is a coauthor for the O’Reilly book Learning Apache Drill. Prior to joining Booz Allen, Mr. Givre, worked as a counterterrorism analyst at the Central Intelligence Agency for five years. Mr. Givre holds a Masters Degree in Middle Eastern Studies from Brandeis University, as well as a Bachelors of Science in Computer Science and a Bachelor’s of Music both from the University of Arizona. Mr. Givre blogs at thedataist.com and tweets @cgivre.
Rapid Data Exploration and Analysis with Apache Drill(Half-Day Training)
Jennifer Davis, Ph.D. is a Staff Field Data Scientist at Domino Data Labs, where she empowers clients on complex data science projects. She has completed two postdocs in computational and systems biology, trained at a supercomputing center at the University of Texas, Austin, and worked on hundreds of consulting projects with companies ranging from start-ups to the Fortune 100. Jennifer has previously presented topics at conferences for Association for Computing Machinery on LSTMs and Natural Language Generation and at conferences across the US and in Italy. Jennifer was part of a panel discussion for an IEEE conference on artificial intelligence in biology and medicine. She has practical experience teaching both corporate classes and at the college level. Jennifer enjoys working with clients and helping them achieve their goals.
Lara is a Risk Management Specialist at Federal Reserve Bank of Chicago and occasional adjunct at the University of Chicago’s Booth School of Business, teaching Python and R. Previously she’s taught a data science Bootcamp and built risk models for large financial institutions at McKinsey & Co.
Probabilistic Programming and Bayesian Inference with Python (Half-Day Training)
Victor has managed teams of quantitative analysts in multiple organizations. He is currently Senior Vice President, Data Science and Artificial Intelligence in Workplace Investing at Fidelity Investments. Previously he managed advanced analytics / data science teams in Personal Investing, Corporate Treasury, Managerial Finance, and Healthcare and Total Well-being at Fidelity Investments. Prior to Fidelity, he was VP and Manager of Modeling and Analysis at FleetBoston Financial (now Bank of America), and Senior Associate at Mercer Management Consulting (now Oliver Wyman).
For academic services, Victor is an elected board member of the National Institute of Statistical Sciences (NISS), where he provides guidance to the board and general education to the statistics community. He has also been a visiting research fellow and corporate executive-in-residence at Bentley University, as well as serving on the steering committee of the Boston Chapter of the Institute for Operations Research and the Management Sciences (INFORMS). Victor earned a master’s degree in Operational Research at Lancaster University, UK, and a PhD in Statistics at the University of Hong Kong, and was a Postdoctoral Fellow in Management Science at University of British Columbia. He has co-authored a graduate level econometrics book and published numerous articles in Data Science, Marketing, Statistics, and Management Science literature. and is co-authoring a graduate-level data science textbook titled “Cause-and-Effect Business Analytics.
Kumaran Ponnambalam is an AI and Big Data leader with 15+ years of experience. He is currently the Director of AI for Webex Contact Center at Cisco. He focuses on creating robust, scalable AI platforms and models to drive effective customer engagements. In his current and previous roles, he has built data pipelines, ML models, analytics, and integrations around customer engagement. He has also authored several courses on the LinkedIn Learning Platform in Machine Learning and Big Data areas. He holds an MS in Information Technology and advanced certificates in Deep Learning and Data Science.
Jess Garcia is the Founder of the global Cybersecurity/DFIR firm One eSecurity and a Senior Instructor with the SANS Institute.
During his 25 years in the field, Jess has led a myriad of complex multinational investigations for Fortune 500 companies and global organizations. As a SANS Instructor, Jess stands as one of the most prolific and veteran ones, having taught 10+ different highly technical Cybersecurity/DFIR courses in hundreds of conferences world-wide over the last 19 years.
Jess is also an active Cybersecurity/DFIR Researcher. With the mission of bringing Data Science/AI to the DFIR field, Jess launched in 2020 the DS4N6 initiative (www.ds4n6.io), under which he is leading the development of multiple open source tools, standards and analysis platforms for DS/AI+DFIR interoperability.
Matt currently leads instruction for GA’s Data Science Immersive in Washington, D.C. and most enjoys bridging the gap between theoretical statistics and real-world insights. Matt is a recovering politico, having worked as a data scientist for a political consulting firm through the 2016 election. Prior to his work in politics, he earned his Master’s degree in statistics from The Ohio State University. Matt is passionate about making data science more accessible and putting the revolutionary power of machine learning into the hands of as many people as possible. When he isn’t teaching, he’s thinking about how to be a better teacher, falling asleep to Netflix, and/or cuddling with his pug.
Good, Fast, Cheap: How to do Data Science with Missing Data(Half-Day Training)
Ron Li is a data science instructor and senior data scientist at Galvanize, Inc. Before that, He worked on machine learning and knowledge graphs at the Information Sciences Institute. Ron has published a 4.5-star rating book Essential Statistics for Non-STEM Data Analysts. He has also authored/co-authored several academic papers, taught data science to non-STEM professionals as pro bono service, and gave talks at conferences like PyData.
Jayeeta is a Senior Data Scientist with 6+ years of industry experience. She received her MS in Quantitative Methods and Modeling from NY, and a BS in Economics and Statistics. Currently, Jayeeta works at Fitch Ratings, a global leader in financial information services. Jayeeta is an avid NLP researcher and gets to explore a lot of state-of-the-art models to build cool products and firmly believes that data, of all forms, is the best storyteller. She also led multiple NLP workshops in association with Women Who Code, GitNation among others. Jayeeta has also been invited to speak at International Conference on Machine Learning (ICML 2020), ODSC East, MLConf EU, WomenTech Global Conference, and Data Summit Connect. Jayeeta is passionate about promoting initiatives to inspire more women to take up STEM. Jayeeta lives in New York, she loves to cook, and spends her summers hiking and traveling with her husband. Connect here – https://linktr.ee/JayeetaP
Transfer Learning in NLP with Transformers (Workshop)
Sujit Pal is an applied data scientist at Elsevier Labs, an advanced technology group within the Reed-Elsevier Group of companies. His areas of interests include Semantic Search, Natural Language Processing, Machine Learning and Deep Learning. At Elsevier, he has worked on several machine learning initiatives involving large image and text corpora, and other initiatives around recommendation systems and knowledge graph development. He has co-authored Deep Learning with Keras (https://www.packtpub.com/big-data-and-business-intelligence/deep-learning-keras) and Deep Learning with Tensorflow 2.x and Keras (https://www.packtpub.com/data/deep-learning-with-tensorflow-2-0-and-keras-second-edition), and writes about technology on his blog Salmon Run (https://sujitpal.blogspot.com/).
Known as a “player/coach”, with core expertise in data science, natural language, machine learning, cloud computing; 38+ years tech industry experience, ranging from Bell Labs to early-stage start-ups. Advisor for Amplify Partners, IBM Data Science Community, Recognai, KUNGFU.AI, Primer. Lead committer PyTextRank. Formerly: Director, Community Evangelism @ Databricks, and Apache Spark. Cited in 2015 as one of the Top 30 People in Big Data and Analytics by Innovation Enterprise.
Graph Data Science(Full-Day Training)
Stefanie Molin is a data scientist and software engineer at Bloomberg in New York City, where she tackles tough problems in information security, particularly those revolving around anomaly detection, building tools for gathering data, and knowledge sharing. She is also the author of “Hands-On Data Analysis with Pandas,” which is currently in its second edition. She holds a bachelor’s of science degree in operations research from Columbia University’s Fu Foundation School of Engineering and Applied Science. She is currently pursuing a master’s degree in computer science, with a specialization in machine learning, from Georgia Tech. In her free time, she enjoys traveling the world, inventing new recipes, and learning new languages spoken among both people and computers.
Yashesh Shroff is a Lead Strategy Planner at Intel where he focuses on enabling the AI ecosystem on heterogeneous compute. Recently, as a product manager, he was responsible for the AI and media/game graphics software ecosystem showcasing Intel’s latest-gen graphics architecture (10nm). He has over 15 years of technical and enabling experience, spanning optical modeling, statistical analysis, and capital equipment supply chain at Intel. He has over 20 published papers and 4 patents. He has a Ph.D. in EECS from UC Berkeley and a joint MBA from UC Berkeley Haas & Columbia Graduate School of Business.
Ravi Ilango is a Lead Data Scientist at a silicon valley startup in stealth mode. He is passionate in developing deployable deep learning solutions. Previously he was at StatesTitle and at Foghorn Systems as a Sr. Data Scientist and has over 10 years of experience at Apple as a data Scientist & at Applied Materials in Supply Chain Program Management. Ravi has a Graduate Certificate in Data Mining & Machine Learning from Stanford and completed a Masters Program in Aeronautics and Production Engineering from IIT Madras. He has a BS in Mechanical Engineering, Madras University.
Karl Weinmeister is a Developer Relations Engineering Manager at Google, based out of Austin, Texas. Karl leads a global team of data science and ML engineering experts in the Developer Advocacy organization, who build technical assets and consult with enterprise customers on Artificial Intelligence and Machine Learning. Karl was a contributor to Proverb, an AI-based crossword puzzle solver, which competed at the American Crossword Puzzle Tournament.
Ryan Kasichainula is a data science instructor at Galvanize, Inc, an industry leader in technology education, with data science and software engineering immersive bootcamps. They are also an independent data consultant with experience in the technology, agriculture, energy, and pharmaceutical industries. Ryan enjoys applying data science techniques to a wide variety of domains, and they always have at least one side project in the works, usually in the realm of natural language generation.
Byron has developed large scale data pipelines and processing systems across a variety of industries including Life Sciences, Advertising and Enterprise Software systems. In particular he focuses on distributed systems with low latency requirements for both read and write workloads. Trained as a Statistician with a focus on statistical computing he is also the author of Real-time Analytics published by John Wiley and Sons, which describes both the operational and computational aspects of delivering these systems at scale.
Filipa Peleja is the Levi Strauss & Co Europe Lead Data Scientist at the Data Analytics & AI team. She has always been enthusiastic about technology where she first stepped into the tech world as an undergrad in Computer Science and later Ph.D. in the Machine Learning domain. Her academic accomplishments were recognized with the 1st prize of an industry challenge from a telco and publications in international conferences among which, top tier conferences like SIGIR and ACL. Before joining Levi, Filipa interned at Yahoo! Research and, later, worked as a Sr Data Scientist at Vodafone. Filipa loves to work in an area that she feels very passionate about and also enjoys passing along knowledge, hence, she lectures, supervises projects/thesis for CodeOp, Neueda and Barcelona Technical School.
MLOps… From Model to Production(Workshop)
Ajay K Baranwal is the Center Director at CDLe (Center for Deep Learning in Electronics Manufacturing). He leads applied data science research and development efforts to solve electronics and semiconductor manufacturing problems. Many of his work at the Center relates to machine vision, learning from limited data, and building digital twins to synthesize new data. Before the Center, he has worked on several TensorFlow-based applications, including a Prediction and Diagnostic system, a Document retrieval, and an information extraction system. He holds multiple patents, is coauthor of industrial papers and has been a speaker at related conferences. He is also a co-author of a book named “What’s new in TensorFlow 2.0”.
Sourav Mazumder is an IBM Data Scientist Thought Leader and The Open Group Distinguished Data Scientist. Sourav has consistently driven business innovation and values through methodologies and Technologies related to Artificial Intelligence, Data Science and Big Data transpired through his knowledge, insights, experience and influencing skills across multiple industries including Manufacturing, Insurance, Telecom, Banking, Media, Health Care and Retail industries in USA, Europe, Australia, Japan and India. Over the last 10 years, he has influenced key decision makers of several fortune 500 companies to adopt Artificial Intelligence, Data Science, and Big Data related technologies to address complex business needs. Sourav has also consistently provided directions to and successfully led numerous challenging Artificial Intelligence, Data Science and Big Data projects, applying various related methodologies ranging from Descriptive statistics, Probabilistic Modelling, Algorithmic Modelling, Natural Language Processing, etc., to solve critical business problems. Sourav has also successfully partnered with academia within North America, India, South Africa to mentor students and enable them in this field. Sourav has experience and exposure in working with a variety of Artificial Intelligence, Data Science and Big Data related technologies such as Watson Open Scale, Watson Natural Language Processing, Watson Machine Learning, IBM Cloud Pak for Data, Spark, Hadoop, BigSQL, HBase, MongoDb, Solr, System ML, Cognos, R, Python, Scala/Java and using them in projects involving phases from creation of Minimum Viable Product to Productionization at an enterprise level. Sourav is an Open Source enthusiast and contributes to Open Source regularly. Sourav holds patents in the Data and AI space (patent profile https://patents.justia.com/search?q=Sourav+Mazumder). Sourav consistently publishes papers/blogs/articles in various industry forums. Sourav is co-author, guest editor and chief editor of multiple books in AI, Data Science and Big Data space (https://www.researchgate.net/profile/Sourav-Mazumder). Sourav is regularly invited to speak in various Industry conferences, like Open Data Science Conference, Spark Summit, IBM Think, Global AI Conference, etc in this subject area. He can be found on Linkedin (https://www.linkedin.com/in/souravmazumder/)
Christopher Crowley has 20 years of experience managing and securing networks, beginning with his first job as an Ultrix and VMS systems administrator at 15 years old. Today, Crowley is a Senior Instructor at the SANS Institute and the course author for SOC-Class.com: the culmination of his thoughts on effective cybersecurity operations.
He works with a variety of organizations across industries providing cybersecurity technical analysis, developing and publishing research, sharing expert security insights at conferences, and chairing security operations events. He has provided training to
thousands of students globally.
Crowley holds a multitude of cybersecurity industry certifications and provides independent consulting services specializing in effective computer network defense via Montance®, LLC, based in Washington, DC.
Data Analysis for SOC Survey(Workshop)
Zhenya Antić is an NLP consultant and founder of Practical Linguistics Inc. Her projects include document summarization, information extraction, topic modeling and sentiment analysis of consumer reviews, and document similarity. She is the author of the recently published Python Natural Language Processing Cookbook. Zhenya holds a PhD in Linguistics from the University of California Berkeley and a BS in Computer Science from the Massachusetts Institute of Technology.
Introduction to NLP and Topic Modeling(Workshop)
Leo Meyerovich co-founded Graphistry in early 2014. Previously, he researched programming language design at UC Berkeley and Brown University. His PhD introduced the first multicore web browser (3 PLDI SRC awards) and led to browser parallelization at Mozilla, Samsung, Google, Microsoft Research, and Qualcomm. Leo also performed the largest scale analysis of programming language adoption and social underpinnings (OOPSLA best paper) and, with security researchers at Google, Microsoft, and Brown University, designed several secure web scripting languages. Earlier, he designed Flapjax, the first functional reactive language for highly concurrent web software (OOPSLA best paper). His research was supported by the first Qualcomm Innovation Fellowship (winner among 50 Ph.D. teams at Berkeley and Stanford), the NSF GRFP, and grants from Samsung, Nokia, Microsoft, NVIDIA, Intel, and others.
Thomas J. Fan is a Senior Software Engineer at Quansight Labs, working to sustain and evolve the PyData open-source ecosystem. He is a maintainer for scikit-learn, an open-source machine learning library written for Python. Previously, he worked at Columbia University, improving the interoperability between scikit-learn and AutoML systems. Thomas holds a Masters in Physics from Stony Brook University and a Masters in Mathematics from New York University.
Magnus Ekman is a Director of Architecture at NVIDIA, where he leads an engineering team working on CPU performance and power efficiency. As the deep learning (DL) field exploded in the past few years, fueled by NVIDIA’s GPU technology and CUDA, he found himself in the midst of a company expanding beyond computer graphics and becoming a DL powerhouse. As a part of that journey, he challenged himself to stay up to date with the most recent developments in the field. In collaboration with NVIDIA Deep Learning Institute (DLI) he recently published the book “Learning Deep Learning: Theory and Practice of Neural Networks, Computer Vision, Natural Language Processing, and Transformers Using TensorFlow.”
Jacob Schreiber is a post-doctoral researcher at the Stanford School of Medicine. As a researcher, he has developed machine learning approaches to integrate thousands of genomics data sets, to design biological sequences with desired characteristics, and has described how statistical pitfalls can be encountered and accounted for in genomics data sets. As an engineer, he has contributed to the community as a core contributor to scikit-learn and as the developer of several machine learning toolkits, including pomegranate for probabilistic modeling and apricot for submodular optimization.
Jeffrey is a VP of Data Science, Data Engineering, and Platform Engineering at the Store Associate Technology of Walmart Global Technology. His prior roles include the Chief Data Scientist at AllianceBernstein, a global asset management firm that managed nearly $700 billion, Vice President and Head of Data Science at Silicon Valley Data Science, and senior leadership position at Charles Schwab Corporation and KPMG. He has also taught econometrics, statistics, and machine learning at UC Berkeley, Cornell, NYU, University of Pennsylvania, and Virginia Tech. Jeffrey is active in the data science community and often speaks at data science conferences and local events. He has many years of experience in applying a wide range of econometric and machine learning techniques to create analytic solutions for financial institutions, businesses, and policy institutions. Jeffrey holds a Ph.D. and an M.A. in Economics from the University of Pennsylvania and a B.S. in Mathematics and Economics from UCLA.
Brian Lucena is Principal at Numeristical, where he advises companies of all sizes on how to apply modern machine learning techniques to solve real-world problems with data. He is the creator of three Python packages: StructureBoost, ML-Insights, and SplineCalib. In previous roles he has served as Principal Data Scientist at Clover Health, Senior VP of Analytics at PCCI, and Chief Mathematician at Guardian Analytics. He has taught at numerous institutions including UC-Berkeley, Brown, USF, and the Metis Data Science Bootcamp.
Haytham is a co-founder and CTO of Union.AI. And a co-founder and a maintainer of the Flyte Open Source Project. Haytham has gained experience in building distributed systems and cloud native solutions through his tenure at Microsoft, Google and Lyft.
Opportunities to form working relationships with some of the world’s top data scientists for follow-up questions and advice.
Access to 40+ training sessions and 70+ workshops.
Hands-on experience with the latest frameworks and breakthroughs in data science.
Affordable training–equivalent training at other conferences costs much more.
Professionally prepared learning materials, custom- tailored to each course.
Opportunities to connect with other ambitious, like-minded data scientists.