Abstract: Your tools and workflow govern how quickly you can deliver results on new challenges. Often we're constrained by slow algorithms, inefficient data pipelines and suboptimal use of complex tools like Pandas.
We'll look at recent changes in the Python ecosystem enabling fast identification of slow code, simple compilation of CPU-bound numpy processing with Numba, efficient Pandas operations and parallelised medium-data operations with Dask. This talk will give you new tools and process to take back to the office.
This talk is based upon the forthcoming 2nd edition of High Performance Python by Ian Ozsvald & Micha Gorelick, due in 2020.
Bio: Ian is a Chief Data Scientist and Coach, he co-organises the annual PyDataLondon conference with 700+ attendees and the associated 9,000+ member monthly meetup. He runs the established Mor Consulting Data Science consultancy in London, gives conference talks internationally often as keynote speaker and is the author of the bestselling O'Reilly book High Performance Python. He has 16 years of experience as a senior data science leader, trainer and team coach. Past talks and articles can be found at: https://ianozsvald.com/