Distributed Data Show artwork

Distributed Data Show

154 episodes - English - Latest episode: over 3 years ago - ★★★★★ - 15 ratings

The Distributed Data Podcast is your weekly source for the latest news and technical expertise to help you succeed in building large-scale distributed systems. Brought to you by the Developer Advocate team, we go in-depth with DataStax engineers and special guests from the broader data community. New episodes each Tuesday.

Technology apachecassandra database hybridcloud multicloud nosql opensource
Homepage Apple Podcasts Google Podcasts Overcast Castro Pocket Casts RSS feed

Episodes

Building CICD Pipelines in the Modern Age with Christopher Bradford | Ep. 106 Distributed Data Show

July 09, 2019 13:00 - 13 minutes - 12.1 MB

Many DSE users have very long upgrade cycles due to time and complexity concerns. Using the CICD methodology Christopher Bradford has taken up the challenge to make the upgrade path both faster and lower risk. Today we get to dive in and take a look at what he has been up to. See omnystudio.com/listener for privacy information.

Growing Your Developer Skills with Valerie Parham-Thompson | Ep. 105 Distributed Data Show

July 02, 2019 12:59 - 10 minutes - 9.87 MB

In this industry, showing the drive to expand one's technological skills is crucial. Valerie personifies this drive, and then some. We met with Valerie to discuss all things Cassandra documentation, her love of open source contributions, and some interesting projects she's working on at Pythian. See omnystudio.com/listener for privacy information.

Apache Cassandra's™ Newest Features with Jake Luciani | Ep. 104 Distributed Data Show

June 25, 2019 13:00 - 24 minutes - 22.6 MB

There are not so many developers who joined the Cassandra Community in the very beginning and then never quit. Jake is one of them: he works with the Cassandra community as a PMC Member and leads a team at DataStax Enterprise for almost a dozen years already. Of course, he attended Datastax Accelerate conference and we didn't miss a chance to ask him a few questions about the past and future of Cassandra & DSE! See omnystudio.com/listener for privacy information.

What's new with Cassandra at Instagram | Ep. 103 Distributed Data Show

June 18, 2019 12:59 - 24 minutes - 22.8 MB

In this episode, Jeff Carpenter talks with Dikang Gu about the origins of Cassandra at Instagram, an update on how the adoption of RocksDB as a storage engine for Cassandra is progressing, geographic data partitioning, and how his team is providing Cassandra as a Service inside Instagram. See omnystudio.com/listener for privacy information.

A Topical Journey Into Bulk Loading with Brian Hess | Ep. 102 Distributed Data Show

June 11, 2019 13:00 - 11 minutes - 10.9 MB

A Topical Journey Into Bulk Loading with Brian Hess | Ep. 102 Distributed Data Show by DataStax Developers See omnystudio.com/listener for privacy information.

Cassandra Data Modeling Tools | Ep. 101 Distributed Data Show

June 04, 2019 13:00 - 12 minutes - 11.6 MB

In this episode Jeff and Adron have a quick topical discussion of some tools they're using to get work done with CQL and databases in general. Adron discusses using JetBrains DataGrip and what it's been enabling him to do, then Jeff interjects with some additional thoughts and asks the question, is Cassandra not your only database? Where Adron elaborates on how DataGrip works with many other databases, so when one is approached with work across a wide spectrum of sources they can tackle that ...

Apache Cassandra TM Trends With Aaron Ploetz | Ep. 100 Distributed Data Show

May 30, 2019 18:54 - 12 minutes - 11.3 MB

This week, Aaron Ploetz joins Eric Zietlow to discuss trends within Apache Cassandra and gives us a few resources on how to learn C*. See omnystudio.com/listener for privacy information.

Diving into Cloud Native Applications with Frank Moley | Ep. 99 Distributed Data ShowCAST

May 21, 2019 13:00 - 14 minutes - 13.2 MB

Developers need new development patterns in order to embrace Cloud. As all resources are shared resiliency and security are keys. See omnystudio.com/listener for privacy information.

Shifting Cloud Trends with Peter Bakas | Ep. 98 Distributed Data Show

May 14, 2019 13:00 - 7 minutes - 7.23 MB

How has the industry changed with the rise of cloud and software as a service? What does this mean for the future? All this and more will be covered in today's Distributed Data Show. See omnystudio.com/listener for privacy information.

A Developer's Journey with Cristina Veale | Ep. 97 Distributed Data Show

May 07, 2019 13:00 - 12 minutes - 11.1 MB

David chats with Cristina about her non-traditional background breaking into the tech sector, and the journey that lies ahead as she takes on her new role of Developer Advocate at DataStax. See omnystudio.com/listener for privacy information.

Spring Data Cassandra with Chris Splinter | Distributed Data Show Ep. 96

April 30, 2019 13:00 - 15 minutes - 14.1 MB

We talk with Chris Splinter regarding Spring Data Cassandra, a slight change in our approach in how we talk about it, and a glimpse into the possible future improvements in the Java Driver to make it easier to use with the Spring Framework. See omnystudio.com/listener for privacy information.

SpringBoot: From The Trenchies with Frank Moley | Ep. 95 Distributed Data Show

April 23, 2019 13:00 - 9 minutes - 8.96 MB

Spring Boot is a powerful framework helping developers building applications fast. In this episode Franck explains us how he came to Spring in the first place and what are the good and bad sides about it. See omnystudio.com/listener for privacy information.

Apache Cassandra and Timeseries: BFF's? | Ep. 94 Distributed Data Show

April 16, 2019 13:00 - 9 minutes - 8.8 MB

One of the most common use cases when dealing with Apache Cassandra™ are timeSeries. After introducing the concept of Time series in a few words Alice and Cedrick will analyze why Cassandra got so much traction and detail what we see at customers, what are the pitfalls and what are today’s challenges. See omnystudio.com/listener for privacy information.

Reactive Programming With Cassandra Ep. 93 Distributed Data Show

April 09, 2019 13:00 - 6 minutes - 6.03 MB

The new 2.0 version of the DataStax Java Driver introduced some new patterns for accessing your data : reactive programming. In this episode, Alexandre explains the reactive APIs available in the new driver and the "sweetspots" for when to use them. See omnystudio.com/listener for privacy information.

DataStax Java Driver 4.0 with Alexandre Dutra | Ep. 92 Distributed Data Show

April 02, 2019 13:00 - 11 minutes - 10.2 MB

DataStax Engineer Alexandre Dutra gives us a tour of the new DataStax Java Driver 4.0 and offers some tips for a smooth migration for your Java app. See omnystudio.com/listener for privacy information.

The Power of Cassandra Lightweight Transactions | Ep. 91 Distributed Data Show

March 26, 2019 20:41 - 13 minutes - 12.2 MB

Cassandra is a highly available, partition-tolerant database. In the modern world, what happens when you need strong consistency but still require the unique partition tolerance that Cassandra provides. Enter the Lightweight transaction to provide ACID like transactions in distributed Cassandra workloads. See omnystudio.com/listener for privacy information.

Go and Apache Cassandra Joining Forces with Go-Cql | Ep. 90 Distributed Data Show

March 19, 2019 13:00 - 10 minutes - 10.1 MB

Go and Apache Cassandra Joining Forces with Go-Cql | Ep. 90 Distributed Data Show by DataStax Developers See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 89: Developing Cassandra Apps in Python

March 12, 2019 13:00 - 16 minutes - 14.8 MB

Amanda and Jeff talk about their latest project: implementing the microservice layer of KillRVideo in Python. They talk about what was easy, what was not too easy, and what was just plain fun. They spend some time on discussing recommendations engines and ask the audience should they implement this in DSE graph or using PySpark or other python package? Comment below! See omnystudio.com/listener for privacy information.

Distributed Data Show Ep 88: App Development with Graph Data with Dr. Gosnell and Dave Bechberger

March 05, 2019 14:00 - 11 minutes - 10.6 MB

Denise Gosnell interviews Dave Bechberger live at Data Day Texas regarding challenges when developing Graph based applications, recommendations on approaches to take, and what resources are available for developers new to Graph. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 87: Kafka and Cassandra with Tim Berglund

February 26, 2019 14:00 - 14 minutes - 13 MB

Description - Jeff and Tim talk about the most common questions developers have about Kafka and three great ways to combine Kafka with Cassandra in your applications. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 86: Awkward-Free Spark Development

February 19, 2019 17:33 - 12 minutes - 11.2 MB

Patrick talks with Holden Karau, Developer Advocate at Google, about ways to incorporate Spark in your application without impacting performance or having to learn Scala. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 85: Application Development - You Asked, We Answer

February 12, 2019 14:00 - 14 minutes - 13.1 MB

Jeff Carpenter and Amanda Moran discuss the big questions the DataStax Developer Advocate team hears most frequently from our developer community, and introduce a series of episodes to answer questions about application development. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 84: What's New with Spark?

February 05, 2019 14:00 - 19 minutes - 17.8 MB

Patrick and Holden talk about the highlights of Spark 2.4, what's coming in Spark 3, and why code reviewers are vital to open source projects See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 83: Transactional ML? Yes you Can! with Carter Bradford

January 29, 2019 14:00 - 12 minutes - 11.3 MB

Distributed Data Show Episode 83: Transactional ML? Yes you Can! with Carter Bradford by DataStax Developers See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 82: Is The Earth Shaking? With Frank Sepulveda

January 22, 2019 14:00 - 19 minutes - 18.1 MB

Distributed Data Show Episode 82: Is The Earth Shaking? With Frank Sepulveda by DataStax Developers See omnystudio.com/listener for privacy information.

Distributed Data Show 81: Searching For Success with Alice Lottini and Riccardo Carrera

January 15, 2019 14:00 - 15 minutes - 14.5 MB

Distributed Data Show 81: Searching For Success with Alice Lottini and Riccardo Carrera by DataStax Developers See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 80: Finding Bad Actors with Max Melnick

January 08, 2019 14:00 - 14 minutes - 13.4 MB

In this episode Jeff talks with Max Melnick about how he got into analytics consulting with Deloitte (no, he's not an accountant), and how the Mission Graph capability Deloitte has built on top of DataStax Enterprise helps analysts leverage complex networks to detect financial fraud, terrorism, and even supply chain vulnerabilities. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 79: Cas d'usages de Cassandra pour la banque

January 01, 2019 14:00 - 4 minutes - 4.03 MB

Différents profils assistent aux Developers Days organisés par DataStax et pas uniquement des développeurs. Nous interviewons aujourd'hui Mr Grain, architecte qui nous explique les cas d'usages qu'il entrevoit pour une banque. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 78: Developer Days Captain's Log: Stardate 47634.44

December 18, 2018 06:56 - 16 minutes - 14.8 MB

After running 6 DataStax Developer Days Events in multiple cities around the world, Patrick McFadin and Cedrick Lunven take a few minutes to wrap up, provide some developer feedbacks and trendy technical subjects before opening perspectives for next year. Highlights 00:15 Introduction of Patrick and Cedrick 00:30 Last Developer Day of six in Paris 01:15 Cedrick gives overview of the Developer Days 02:00 Wanted to hear from the developers attending and their use cases 02:45 What's on the min...

Distributed Data Show Episode 77: 6.7 Kafka and Cassandra with Chris Splinter

December 11, 2018 14:00 - 17 minutes - 16.2 MB

Kafka Connector is a new and long awaited functionality in DataStax Enterprise 6.7. Chris Splinter, product manager, explains us what the connector is able to do and provide some details how it can help and empower developers. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 76: What's new in DSE 6.7 with Jonathan Ellis

December 04, 2018 14:00 - 10 minutes - 9.22 MB

What’s new in DSE 6.7! Information and discussion about DSE Metric Collector, improvements to Search, improvements to back-up and point-in-time restore, and finally the long awaited DSE Kafka connector! See omnystudio.com/listener for privacy information.

Distributed Data Show 75: CNCF, Apache Cassandra, Kubernetes, and Prometheus with Luc Perkins

November 27, 2018 14:00 - 21 minutes - 20.1 MB

In this episode Adron Hall speaks with Luc Perkins about his work at the CNCF, Kubernetes, and where projects are heading and what projects they're working on. Adron also speaks with Luc about docs, projects he's been seeing that are really interesting, skeleton code for projects, and lot's of other topics. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 74: Data Trends in Distributed Systems

November 20, 2018 14:00 - 12 minutes - 11.3 MB

Over the past 4 years, Caroline has worked as a solutions engineer to help many customers adopt Apache Cassandra and DataStax Enterprise. In this episode Caroline shares how these conversations with those customers have changed over time, from an initial focus on scaling out databases, to an emphasis on microservices architecture, to a high level interest in using machine learning to get insights from data. See omnystudio.com/listener for privacy information.

Distributed Data Show 73: Talking Retail Systems with Travis Mattera

November 13, 2018 14:00 - 13 minutes - 12.4 MB

In this episode Adron talks with Travis about starting and working site reliability in a large retail enterprise. They tackle topics ranging around outages, database sizing, monitoring and observability, disparate workloads and migrations between cloud providers. Then Travis and Adron head into discussion about distributed cache and some of the questions we would need to ask to determine the functionality needed. The episode then wraps up with a few outtakes at the end. See omnystudio.com/li...

Distributed Data Show 72: NoSQL and Cassandra Clusters with Shogo Hoshii

November 06, 2018 14:00 - 8 minutes - 7.57 MB

This week, we sit down with Shogo Hoshii and discuss use cases for Cassandra clusters with NoSQL. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 71: Upgrading Cassandra Clusters with Carlos Rolo

October 30, 2018 13:00 - 11 minutes - 10.5 MB

This week, we talk to Carlos Rolo about upgrading Cassandra clusters. Keep watch, there might be a horror story or two! See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 70: Adding A Graph Database To Legacy Applications

October 23, 2018 15:28 - 8 minutes - 8.18 MB

Distributed Data Show Episode 70: Adding a Graph Database to Legacy Applications See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 69: Graph Frames & Fraud Detection with Jim Hatcher

October 16, 2018 16:12 - 10 minutes - 9.6 MB

In this episode Adron speaks with Jim Hatcher, and Jim tells us all about graph frames, fraud detection, and more. We also talk about some additional use cases and other interesting topics around where the industry is moving in regards to graph, analytics, and other business use cases that have really expanded the use of these multi-model databases. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 68: Super Nodes & Adjacency Lists with Jonathan Lacefield

October 11, 2018 19:37 - 11 minutes - 10.2 MB

Jonathan Lacefield is working on efforts with DataStax Enterprise Graph, and today Adron speaks with him about graph while at San Francisco Graph Day 2018! We delve into super nodes and adjacency lists to start with. But eventually we move from the aspects of what super nodes and adjacency lists give us to what we are doing and what we’re looking for with graph data and graph database solutions See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 67: The Developer Is Always Right with Jonathan Ellis

October 02, 2018 14:59 - 29 minutes - 26.8 MB

Jonathan Ellis recounts several attempts in the history of Apache Cassandra to coerce developers into specific behaviors around issues such as sequential scans, tombstones, and joins. We discuss why these attempts didn’t work, and how to respond to developer feedback more effectively. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 66: Graph Day Interview With Dr. Gosnell

September 25, 2018 15:00 - 9 minutes - 9.16 MB

Amanda sits down with Dr. Denise Gosnell at Graph Day to discuss highlights from her keynote. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 65: Fraud Use Cases for Graph with Jeremy Hanna and Jim Hatcher

September 18, 2018 15:00 - 29 minutes - 27.2 MB

David talks with Jim Hatcher and Jeremy Hanna about use cases for fraud detection, how the landscape has changed over time, learns what fraudsters are, and how graph databases are perfect to fit the need for modern requirements. See omnystudio.com/listener for privacy information.

Distributed Data Show 64: Future Enterprise Data Architecture With Josh Perryman

September 11, 2018 15:00 - 18 minutes - 17.1 MB

We talk with Josh Perryman of Expero about the current state of the art in enterprise data architectures, how he sees that changing in the future to include a broader set of database and streaming technologies, and how to deal with the resulting complexity. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 63: Building Applications On Graph Databases With Josh Perryman

September 04, 2018 15:00 - 23 minutes - 21.6 MB

We talk with Josh Perryman of Expero about his experiences building highly scalable and performant applications using relational databases, graph databases and sometimes even both at the same time. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 62: Graph Day Preview With Denise Gosnell

August 28, 2018 15:00 - 17 minutes - 16.3 MB

This week we will be speaking with Denise Gosnell, a graph expert at DataStax. We will be discussing the excitement around Graph Day conference in San Francisco coming up on 09/15. We will also be discussing how Denise got started with graph, her keynote speech for Graph Day, and her upcoming book! See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 61: Multicloud Security and Compliance

August 21, 2018 15:00 - 22 minutes - 21.1 MB

Amanda, David, and Adron talk about the challenges associated with moving their KillrVideo reference application to a multi-cloud cluster and the many considerations that come into play when going multi-cloud. Highlights 17:22 - “See where providers line up” what is the lowest common denominator 18:55 - Would latency be very different between providers as compared to within a single provider in any given region? 21:15 - “always know what everything is doing” 23:45 - OpsCenter allows homogeno...

Distributed Data Show Episode 60: Multicloud Security and Compliance

August 14, 2018 15:00 - 16 minutes - 15.3 MB

Amanda, David, and Adron talk about the challenges associated with moving their KillrVideo reference application to a multi-cloud cluster and the many considerations that come into play when going multi-cloud. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 59: Multicloud State Management

August 07, 2018 15:00 - 13 minutes - 12.3 MB

In this episode Amanda and Adron tackle the complexities of state management across clouds in a multi-cloud environment. Adron also brings up three best practices that have come up time and time again to help ensure high integrity in state data. Join us for a dive into ways to get prepared for complex state management. See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 58: Multicloud Networking

July 31, 2018 15:00 - 12 minutes - 11.9 MB

Distributed Data Show Episode 58: Multicloud Networking by DataStax Developers See omnystudio.com/listener for privacy information.

Distributed Data Show Episode 57: Multi-cloud Challenges with Adron Hall and Patrick McFadin

July 24, 2018 15:00 - 13 minutes - 12.2 MB

Patrick introduces new DataStax evangelist Adron Hall and they discuss the challenges of running distributed data across multiple cloud providers. See omnystudio.com/listener for privacy information.