Deep Learning and Natural Language Processing for Product Title Summarization

Abstract: Online marketplaces often have millions of products, and the product titles are typically intentionally made quite long for the purpose of being found by search engines. With voice shopping on the verge of taking off (voice shopping is estimated to hit $40+ billion across U.S. and U.K. by 2022), short versions (summaries) of product titles are desired to improve user experience with voice shopping.

In this talk, we present a few different approaches to solve this problem using Natural Language Processing and Deep Learning. We give a historical overview of the technology advancement in these approaches, and compare the evaluation results on a real world dataset.

Bio: Joan Xiao is a Lead Machine Learning Scientist at Figure Eight, a human-in-the-loop machine learning and artificial intelligence company. In her role, she leads research innovation and applies novel technologies to a broad range of real word problems. Previously she led the data science team at H5, a leading data search and analytics service company in e-Discovery industry. Prior to that, she led a Big Data Analytics team at HP.
Joan received her Ph.D in Mathematics and MS in Computer Science from University of Pennsylvania.

Open Data Science Conference