Towards Data Scientist – Friendly Natural Language Processing


We are witnessing the rapid adoption of Pandas as a main library for representation and manipulation of structured data in python. In the contrary, when it comes to the Natural Language Processing (NLP) applications, we are usually using various NLP libraries with complex and more importantly incompatible output structures. That makes the integration of NLP features and solutions into the machine learning and data science pipeline difficult and time-consuming. To resolve this issue, the Center for Open Source Data and AI Technologies (CODAIT) has developed Text Extensions for Pandas, an open source library of the extensions that turns Pandas data frames into the universal data structure for NLP and hence offers transparency, simplicity and compatibility. In this talk, we will first talk about how we can solve the real-world text analytic problems with NLP. We then show how the Text Extensions for Pandas make these analyses easier.


Monireh Ebrahimi is a Senior Cognitive Software Developer at IBM’s Center for Open-Source Data and AI Technologies (CODAIT) in San Francisco where she works on Open Source, Data & AI Technologies and she has been awarded “Outstanding Technical Award” in 2021. She has obtained her Ph.D. from Data Semantics (DaSe) lab at Kansas State University with a major focus on Neuro-Symbolic Integration. Throughout her dissertation titled "Generalizable Neuro-Symbolic Reasoners", she covers her recent works to bridge the neural and symbolic divide in the context of deep deductive reasoners. Her primary research interests include Deep Learning, Knowledge Graphs, Reasoning, Semantic Web, and Natural Language Processing. She is also really interested in applying NLP and Data Science in real world applications and get the chance to work with customers and partners in various industries and help them in their Data Science journey. She has organized several tutorials on "Current and Future Trends of Neural Knowledge Graph Representation and Reasoning" at IJCAI 2020, US2TS 2019, and US2TS 2020 and served as a PC member or reviewer for Artificial Intelligence (NeurIPS, AAAI, IJCAI, ICML, ICLR, JAIR), and Semantic Web conferences (ISWC, ESWC, TheWebCon) and received Most Outstanding Reviewer Award from WWW 2017.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google