Organizing Google's Datasets
Linear Digressions
English - October 31, 2016 02:17 - 15 minutes - 20.6 MB - ★★★★★ - 350 ratingsTechnology data science machine learning linear digressions Homepage Download Apple Podcasts Google Podcasts Overcast Castro Pocket Casts RSS feed
Previous Episode: Fighting Cancer with Data Science: Followup
Next Episode: Deep Blue
If you're a data scientist, there's a good chance you're used to working with a lot of data. But there's a lot of data, and then there's Google-scale amounts of data. Keeping all that data organized is a Google-sized task, and as it happens, they've built a system for that organizational challenge. This episode is all about that system, called Goods, and in particular we'll dig into some of the details of what makes this so tough.
Relevant links: http://static.googleusercontent.com/media/research.google.com/en//pubs/archive/45390.pdf