Abstract: Beginning with the deep learning revolution in computer vision and natural language research, deep learning is now permeating at every corner of science and technology. An inherent limitation in adapting deep learning models into new domains is the necessity to train models from scratch, which is particularly time consuming, and further compounded if many different tasks are to be predicted. In this presentation, we will examine how historical research in feature-engineering by domain experts can be used to develop a deep learning “domain expert” model that can be quickly adapted to predict other tasks that it was not trained on originally. We demonstrate this example in the chemistry domain, by predicting a wide range of physical, biological, and materials properties while using only a small set of labeled data. Our results not only show better performance relative to state-of-the-art models developed using engineered features, but also improved performance relative to analogous deep learning models trained in the conventional supervised learning approach. Our findings demonstrate the potential of using “domain expert” deep neural network AI to accelerate research progress, where we envision applications not just in chemistry, but in affiliated fields, such as biotechnology, pharmaceuticals, consumer goods, materials, and potentially in other technical fields as well.
Bio: Garrett Goh is a Scientist at the Pacific Northwest National Lab (PNNL), in the Advanced Computing, Mathematics & Data Division. He was previously awarded the Howard Hughes Medical Institute fellowship which supported his PhD in Computational Chemistry at the University of Michigan. At PNNL, he was awarded the Pauling Fellowship that supports his research initiative of combining deep learning and artificial intelligence with traditional chemistry applications. His current interests is in AI-assisted computational chemistry, which is the application of deep learning to predict chemical properties and the discovery of new chemical insights, while using minimal expert knowledge.