Boston | April 14th – April 17th, 2020

Natural Language Processing Track

Learn the latest models, advancements, and trends from the top practitioners and researchers behind NLP

NLP has seen rapid advances in recent years. With some of the sharpest minds in data science presenting, get the latest insights, natural language processing training, trends, and discoveries in data science languages, tools, topics – and beyond.

Connect with some of the most innovative people and ideas in the world of data science, while learning first-hand from core practitioners and contributors. Learn about the latest advancements and trends in NLP, including pre-trained models, with use-cases focusing on deep learning, speech-to text, and semantic search.

Some of Our Current NLP Speakers


See our full speaker list
2020 Speakers

Sample Talk, Workshop, and Training Sessions

Natural Language Processing Sessions
Friday, April 17th
Thursday, April 16th
Wednesday, April 15th
Friday, April 17th
Thursday, April 16th
Wednesday, April 15th
10:40 - 12:10
An Introduction to Transfer Learning in NLP and HuggingFace Tools

Workshop | NLP | Beginner-Intermediate

 

In this session, I’ll start by introducing the recent breakthroughs in NLP that resulted from the combination of Transfer Learning and Transformer architectures. Then, we’ll learn to use the open-source tools released by HuggingFace like the Transformers and Tokenizers libraries and the distilled models.
Learning outcomes: understanding Transfer Learning in NLP, how the Transformers and Tokenizers libraries are organized and how to use them for downstream tasks like text classification, NER and text generation...more details

An Introduction to Transfer Learning in NLP and HuggingFace Tools image
Thomas Wolf, PhD
Chief Science Officer | Hugging Face 🤗
Session Title by Veysel Kocaman Coming Soon!

Workshop

Session Title by Veysel Kocaman Coming Soon! image
Veysel Kocaman, PhD
Senior Data Scientist | John Snow Labs
12:40 - 14:10
Transform your NLP Skills: Using BERT (and Transformers) in Real Life

Workshop | NLP | Machine Learning | Intermediate-Advanced

 

This workshop teaches you the use of transformer neural networks and their incarnations (BERT, RoBERTa, GPT-2) for solving real-world natural language use cases. NLP has advanced tremendously over the last few years and BERT is at the forefront of this success having achieved state-of-the-art results on 11 different NLP tasks. For businesses, BERT has unlocked new NLP use cases that have been previously unattainable.

This workshop will teach you what transformers and systems like BERT and GPT-2 are and how to use and modify them for your needs. Organizations have a wealth of unstructured text sources in every line of business, such as employee feedback in human resources, purchase orders and legal documents in contracting and procurement, communication records throughout the org, and many more. Making sense of this information and organizing it into knowledge and actionable insights to improve business outcomes is a key function every data scientist should be aware of…more details

Transform your NLP Skills: Using BERT (and Transformers) in Real Life image
Niels Kasch, PhD
Data Scientist and Founding Partner | Miner & Kasch
12:45 - 13:30
Alternatives to Reinforcement Learning for Real World Problems

Talk | Machine Learning | Beginner-Intermediate

 

So what can we use today? Two related approaches for agent-based learning that exist for real world use cases are Contextual Bandits and Imitation Learning. Both can be seen as simplifications to the full RL problem by relaxing certain assumptions, such as the number of environmental states or the need to balance online exploitation vs exploration, respectively. This talk will introduce both the formal Contextual Bandit and Imitation Learning problems, how they differ from full RL, what their limitations are, and where they can be used to solve real world problems...more details

Alternatives to Reinforcement Learning for Real World Problems image
Byron Galbraith, PhD
Chief Data Scientist | Talla
13:35 - 14:20
Level Up: Fancy NLP with Straightforward Tools

Talk | NLP | Open-source | Intermediate

 

Natural language processing has exploded in popularity during the last decade. No longer confined to academia, many companies now see NLP as a critical portion of their business intelligence, with the NLP market size expected to double again in the next two years. Traditional NLP approaches like sentiment analysis and topic modeling provide undeniably meaningful insights, but what other techniques can be leveraged to mine information from text?

This talk focuses on lesser known NLP methods that can help unearth novel observations and make analyses more memorable. After a brief introduction to the topic, attendees will learn about various open-source Python packages they can apply to enhance their NLP workflows. Example use cases will also be discussed to further solidify how each technique may be leveraged with existing data. Attendees of this talk will discover several unconventional NLP tools such as:
– Scattertext for comparing word usage between two populations
– spaCy’s linguistic features to parse sentences by syntax
– DeepMoji for assigning emoji labels to short textmore details

Level Up: Fancy NLP with Straightforward Tools image
Kimberly Fessel, PhD
Senior Data Scientist, Instructor | Metis
13:35 - 14:20
Developing Natural Language Processing Pipelines for Industry

Talk | NLP | ML for Programmers | Intermediate

 

Machine learning has become a core technology underlying many modern applications, especially utilizing natural language processing, where the techniques provide powerful methods for analyzing large data sets, such as contracts, electronic health records, social interactions, and other unstructured text data. With the ability for recent powerful techniques to retain meaning, search, and perform machine translation at high fidelity, alongside many open source traditional and hybrid methods, transforming unstructured content to structured insights, events, and relationships is at the fingertips. Organizations are looking to leverage these emerging technologies and close capability gaps to ingest, monitor, error-check, automate, or improve their capabilities in processing and understanding hundreds of millions of documents. While certain tasks are well addressed by existing systems, organizations often still struggle with implementation, identification of the correct methods & algorithms, as well as properly scale their models to solve open challenges within their terminology. In this session, we examine the data strategy and technical use cases involving natural language processing, the algorithms appropriate for certain project objectives, and discuss the development and deployment of these solutions…more details

Developing Natural Language Processing Pipelines for Industry image
Michael Luk, PhD
Chief Technology Officer | SFL Scientific
14:25 - 15:10
Applying State-of-the-art Natural Language Processing for Personalized Healthcare

Talk | NLP | Deep Learning | Intermediate

 

Accelerating progress in personalized healthcare requires learning the causal relationships between diseases, genes, treatments, medications, labs, and other clinical information – at scale over a large population and time range. More than half of the clinically relevant data in oncology is only found in free-text pathology reports, radiology reports, sequencing reports, and progress notes.

Extracting and normalizing these facts from these clinical documents requires training oncology-specific models that can accurately extract these specific facts from a variety of documents. This talk describes results and lessons learned, from a real-world project doing this at scale…more details

Applying State-of-the-art Natural Language Processing for Personalized Healthcare image
David Talby, PhD
CTO | Pacific AI
15:00 - 16:30
State of the Art Natural Language Processing at Scale

Tutorial | NLP | Machine Learning | Intermediate


David Talby presents the open-source Spark NLP package for training distributed custom natural language machine-learned pipelines on Apache Spark. The library natively extends Spark ML and includes state-of-the-art deep learning models, language models, and 30+ pre-trained NLP models. The talk walks through the library’s goals, design and API’s, using Jupyter notebooks that will be made publicly available after the talk. Best practices and industry use cases where the library has been applied will be discussed as well…more details

State of the Art Natural Language Processing at Scale image
David Talby, PhD
CTO | Pacific AI
Select date to see events.

See all our talks and hands-on workshop and training sessions
See all sessions

What You'll Learn

Talks & Workshops on these topics:

Topics

  • Natural Language Processing

  • NLP Transformers

  • Pre-trained Models

  • Text Analytics

  • Natural Language Understanding

  • Sentiment Analysis

  • Natural Language Generation

  • Speech Recognition

  • Named Entity Extraction

Models

  • BERT

  • XLNet

  • GPT-2

  • Transformers

  • Word2Vec

  • Deep Learning Models

  • RNN & LSTM

  • Machine Learning Models

  • ULMFiT

  • Transfer Learning

Tools 

  • Tensorflow 2.0

  • Hugging Face Transformers

  • PyTorch

  • Theano 

  • SpaCy

  • NLTK

  • AllenNLP

  • Stanford CoreNLP

  • Keras

  • FLAIR 

You Will Meet

  • Some of the world’s best data science speakers

  • The brains and authors behind today’s most popular open data science tools, topics, and languages

  • Hundreds of attendees focused on data science

  • Chief Data Scientists

  • Thought leaders working in data science

  • Data Scientists and Analysts

  • Software Developers

  • CEOs, CTOs, CIOs

  • Data Visualization professionals

  • Venture Capitalists and Investors

  • Startup Founders and Executives

  • Attendees from Healthcare, Finance, Education, Business, Intelligence, and other industries

  • Big data and data science innovators

Why Attend?

Several of the best minds and biggest names in data science will be presenting

Network with attendees from leading data science companies to learn how others are tackling similar problems

Gain quality training in the hottest data science topics, tools, and languages

Learn the latest in data science from industry leaders without having to make room in the budget — tickets are surprisingly inexpensive

Sign Up for ODSC EAST 2020 | April 14th – April 17th

Register Now