Drill to Detail Ep.44 'Pandas, Apache Arrow and In-Memory Analytics' With Special Guest Wes McKinney

Drill to Detail

English - December 08, 2017 00:25 - 46 minutes - 53.2 MB - ★★★★★ - 7 ratings
Technology News Tech News Homepage Download Apple Podcasts Google Podcasts Overcast Castro Pocket Casts RSS feed

Previous Episode: Drill to Detail Ep.43 'Oracle Analytics, Data Visualization Desktop 4.0 and The Art of Product Management' with Special Guest Mike Durran

Next Episode: Drill to Detail Ep.45 'Tellius, YellowFin and the State of AI in Analytics Today' With Special Guest Jen Underwood

Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.

Python Data Analysis Library "Ibis on Impala: Python at Scale for Data Science"Drill To Detail Ep.3 'Apache Kudu And Cloudera's Analytic Platform' With Special Guest Mike Percy Apache Arrow homepage "Apache Arrow and the "10 Things I Hate About pandas""Apache Arrow vs. Parquet and ORC: Do we really need a third Apache project for columnar data representation?""Some comments to Daniel Abadi's blog about Apache Arrow"Wes McKinney homepage