![Slight Reliability artwork](https://is3-ssl.mzstatic.com/image/thumb/Podcasts114/v4/74/fa/c5/74fac5c8-a693-f74c-4093-0020252ed898/mza_15205879552970553543.jpg/100x100bb.jpg)
Slight Reliability
116 episodes - English - Latest episode: about 1 month ago -Learning SRE, one day at a time.
Homepage Apple Podcasts Google Podcasts Overcast Castro Pocket Casts RSS feed
Episodes
Slight Reliability Episode 38 - SRE Reading
January 09, 2023 19:00 - 10 minutes - 7.24 MBTo begin 2023 I share the books I read last year in my quest to be a better SRE. Here is a list of all the books mentioned during the episode: The Phoenix Project by Gene Kim, Kevin Behr, and George Spafford https://www.amazon.com/Phoenix-Project-DevOps-Helping-Business/dp/0988262592 Site Reliability Engineering (by Google) https://sre.google/sre-book/table-of-contents/ Sooner, Safer, Happier by Jonathon Smart https://soonersaferhappier.com/book/ The Toyota Way by Jeffrey Liker https://...
Slight Reliability Episode 37 - Observability New Year's Resolutions with Henrik Rexed
December 19, 2022 19:00 - 45 minutes - 31.5 MBThis week Henrik Rexed and Stephen Townshend discuss their New Year's resolutions for observability. They cover OpenTelemetry and a unified query language, continuous profiling, raw data analysis, instrumenting code, using distributed tracing as part of testing, and much more. Some of the tools or resources mentioned during the episode include: https://tracetest.io/ (distributed tracing for testing) https://github.com/open-telemetry/opamp-go (OTEL orchestration) https://ebpf.io/ (for contin...
Slight Reliability Episode 36 - Starting an SRE Team from Scratch with Gwen Berry and Steve Gill
December 12, 2022 19:00 - 28 minutes - 19.5 MBThis week we talk to Steve Gill and Gwen Berry from IAG to discuss their experiences forming an SRE incubator team (starting SRE from scratch in a large enterprise). We discuss on-call, SLOs, single pane of glass, pivoting, chaos engineering, and much more. You can find Steve on LinkedIn: https://www.linkedin.com/in/stevegill239/ You can find Gwen on LinkedIn: https://www.linkedin.com/in/gwen-berry-56324418b/ You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitt...
Slight Reliability Episode 35 - SRE Trends from re:Invent 2022
December 05, 2022 19:00 - 15 minutes - 10.8 MBThis week I share the observations I made at AWS re:Invent relating to SRE work including the lack of SREs at the event, data warehouses for observability data, the use of topologies to understand complexity, FinOps, serverless, making sense of enormous amounts of data... and more. You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre
Slight Reliability Episode 34 - What is Observability? (Live at re:Invent)
November 30, 2022 17:00 - 8 minutes - 5.8 MBThis week I was at the AWS re:Invent conference in Las Vegas, so I took the opportunity to walk around the expo asking observability vendors what their perspective or definition of "observability" was (and reflected on that). You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre
Slight Reliability Episode 33 - The Many Faces of SRE
November 21, 2022 19:00 - 13 minutes - 9.34 MBIn this episode I explore the different kinds of SRE out there and the different needs they fill in the industry, and discuss some ethically dubious practices around hiring SREs. You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 32 - Social Reliability Engineering with Kyle Forster and Shea Stewart
November 14, 2022 19:00 - 45 minutes - 31.3 MBIn this episode I chat to Kyle Forster and Shea Stewart from RunWhen about the concept of "social reliability engineering" and how it could help SREs from organisations all over the world create an ecosystem of sharing and collaboration. You can find Kyle on LinkedIn: https://www.linkedin.com/in/kyforster/ You can find Shea on LinkedIn: https://www.linkedin.com/in/sheastewart/ To find out more about RunWhen: https://www.runwhen.com/ And an example of the "street map view" of a tech stack: h...
Slight Reliability Episode 31 - I Still Wanna Know What SRE Is!
November 07, 2022 19:00 - 9 minutes - 6.74 MBIn this episode I reflect back on the very first episode of Slight Reliability "What the heck is SRE anyway?" and see if my perspective has changed since then. I also tackle the confusion about what SRE is and is not. Shout out to Sebastian Vietz (https://www.linkedin.com/in/sebastianvietz/) for his "Service Reliability Engineering" terminology and Richard Benwell (https://www.linkedin.com/in/richard-benwell-ab887b11/) for highlighting the way SRE offers a different value proposition depend...
Slight Reliability Episode 30 - A Change of Pace
October 31, 2022 19:00 - 7 minutes - 5.16 MBIn this episode I announce my new role as Developer Advocate (SRE) at SquaredUp, and what this means for the Slight Reliability podcast. You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 29 - Team Topologies
October 24, 2022 19:00 - 17 minutes - 12.3 MBIn this episode I give a summary of the book Team Topologies by Matthew Skelton and Manual Pais (https://teamtopologies.com/book) and how this relates to implementing SRE practices. (POINT OF CORRECTION: One of the authors is "Matthew" Skelton, not "Michael") You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9D...
Slight Reliability Episode 28 - State of DevOps 2022
October 17, 2022 19:00 - 12 minutes - 8.94 MBIn this episode I give my take on the Accelerate State of DevOps 2022 from the SRE perspective. You can find the Accelerate State of DevOps Report 2022 here: https://cloud.google.com/devops/state-of-devops/ You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager ...
Slight Reliability Episode 27 - Anxiety Engineering
October 10, 2022 19:00 - 13 minutes - 9.58 MBIn this episode I share my experience relapsing into anxiety and insomnia, ruminate on an SRE's sphere of influence, and tease an upcoming change of role. You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 26 - The Toyota Way
September 26, 2022 19:00 - 19 minutes - 13.3 MBIn this episode I reflect on the book "The Toyota Way" by Jeffrey Liker, and explore four principles which resonate with my work. The book in question is The Toyota Way: https://www.amazon.com/Toyota-Way-Second-Management-Manufacturer/dp/1260468518 You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outr...
Slight Reliability Episode 25 - Continuous Delivery
September 19, 2022 20:00 - 9 minutes - 6.49 MBIn this episode I discuss the concept behind continuous delivery and share the ideas we've been exploring at IAG. The book I mentioned is The Toyota Way: https://www.amazon.com/Toyota-Way-Second-Management-Manufacturer/dp/1260468518 You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbea...
Slight Reliability Episode 24 - Interview with Abby Bangser
September 12, 2022 20:00 - 28 minutes - 19.7 MBIn this episode I have a chat with Bangser about the transition from testing to SRE, the barriers thrown in front of testers (which SREs don't tend to face), being humble to be let in the door, and *much* more. You can find Abby on LinkedIn: https://www.linkedin.com/in/abbybangser/ The book she mentioned was Infrastructure as Code by Kief Morris https://www.thoughtworks.com/insights/books/infrastructure-as-code-2nd-edition You can find Chastity Majors (cofounder of Honeycomb) on Twitter:...
Slight Reliability Episode 23 - Grafana Central
September 05, 2022 20:00 - 18 minutes - 12.9 MBIn this episode I share the story of Grafana Central, an observability platform that we've been standing up at IAG. You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 22 - It's SLO Going
August 29, 2022 20:00 - 19 minutes - 13.3 MBIn this episode I share a talk I did earlier in the year as part of the Grafana User Group APAC. I share our experiences attempting to implement SLOs at IAG, and our reliability benchmarking work which is a great way to get started if SRE is brand new to your organisation. You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: ...
Slight Reliability Episode 21 - Rubik's Kube
August 22, 2022 20:00 - 12 minutes - 8.38 MBIn this episode I share experiences and ideas about Kubernetes, and what I learned from speaking to Ruben Hakopiean from Kubevious. I'd like to give a huge shout out to Ruben. Many of the topics and ideas discussed come straight from what was discussed in the interview we recorded (but were unable to publish due to audio issues). You can find Ruben on LinkedIn: https://www.linkedin.com/in/rubenhak/ And find out more about Kubevious here: https://kubevious.io/ You can find me on: LinkedIn...
Slight Reliability Episode 20 - Interview with Joey Hendricks
August 15, 2022 20:00 - 31 minutes - 22 MBIn this episode I have a chat with Joey Hendricks about running performance tests in production. You can find Joey on LinkedIn: https://www.linkedin.com/in/joey-hendricks/ And GitHub: https://github.com/JoeyHendricks You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountainee...
Slight Reliability Episode 19 - NZ DevOps Summit 2022
August 08, 2022 20:00 - 12 minutes - 8.31 MBIn this episode I share my takeaways from the NZ DevOps Summit held in Auckland. This was the first in-person event I had attended in three years. You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 18 - Interview with Chris Evans
August 01, 2022 20:00 - 31 minutes - 21.9 MBIn this episode I have a chat with Chris Evans from incident.io about using incidents to lift the lid on an organisation, how aiming for zero incidents can stall an organisation, how tracking MTTR is unhelpful, and much more. You can find Chris on LinkedIn: https://www.linkedin.com/in/evnsio/ Here are the resources Chris mentioned... The practical guide to incident management: http://incident.io/guide The Field Guide to Understanding Human Error (by Sidney Dekker) https://www.oreilly.com...
Slight Reliability Episode 17 - Interview with Ganesh Datta
July 18, 2022 20:00 - 27 minutes - 18.8 MBIn this episode I have a chat with Ganesh Datta, CTO and co-founder of Cortex.io. In this episode we discuss the human challenges of microservices, gamifying reliability, connecting business outcomes with SRE work, and much more. You can find Ganesh on LinkedIn: https://www.linkedin.com/in/gsdatta/ You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensh...
Slight Reliability Episode 16 - Interview with Sebastian Vietz
July 11, 2022 20:00 - 41 minutes - 28.3 MBIn this episode I have a chat with Sebastian Vietz, an SRE lead based in Canada who has been leading the implementation of SRE across different teams and organisations for eight years. In this episode we discuss SLO adoption, SRE going mainstream, virtual teams, and many other topics. You can find Sebastian on LinkedIn: https://www.linkedin.com/in/sebastianvietz/ You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music fro...
Slight Reliability Episode 15 - SLObro
July 04, 2022 20:00 - 11 minutes - 8.06 MBIn this episode I discuss potential pre-requisites that are ideally in place before attempting to adopt SLOs. You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 14 - SLOpoke
June 27, 2022 20:00 - 11 minutes - 7.59 MBIn this episode I share my updated thinking on SLOs and an "ah-ha!" moment I had. You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 13 - I Guess You'll Have to Latency
June 13, 2022 20:00 - 16 minutes - 11.6 MBWhat is latency and how does it relate to customer experience? Where do you measure it? Why do the metrics we choose to capture matter? You can find me on: LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 12 - SLO vs NFR Grudge Match!
June 06, 2022 20:00 - 10 minutes - 7.11 MBWhen it comes to reliability SLO's and NFR's are both *somewhat* related in that they allow us to describe the level of service we want to provide our customers. So how do they match up head to head? You can find me on: LinkedIn: https://www.linkedin.com/in/stephento... Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code:...
Slight Reliability Episode 11 - The Era of Errors
May 30, 2022 20:00 - 12 minutes - 8.65 MBWhat is an error? Where do you measure errors? How do they relate to SLO's and error budgets? You can find me on: LinkedIn: https://www.linkedin.com/in/stephento... Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 10 - Single Pain of Glass
May 23, 2022 20:00 - 16 minutes - 11.4 MBIn this episode we discuss the observability concept of a 'single pane of glass' view, and I share my experience implementing one. You can find me on: LinkedIn: https://www.linkedin.com/in/stephento... Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 9 - Thoughts from SLOconf 2022
May 16, 2022 20:00 - 10 minutes - 7.04 MBIn this episode I share my thoughts from SLOconf, a conference all about Service Level Objectives (SLO's). You can find all the talks (actually 60 of them!) from SLOconf here: https://www.youtube.com/watch?v=pgZm2Bp2-AQ&list=PLLNq9CBV7AFwkXvYmjPPIQlRDVwTmacEK You can find me on: LinkedIn: https://www.linkedin.com/in/stephento... Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC ...
Slight Reliability Episode 8 - o11yfest Pre-reactions Pt2
May 04, 2022 20:00 - 13 minutes - 9.12 MBIn this episode I provide three more (p)re-reactions to upcoming sessions at o11yfest 2022 (a conference all about observability). The talks I cover are: "Obserability driven development" by Jessica Kerr "Return on investment driven observability" by Michael Hausenblas "How the OpenTelemetry Collector puts you in the driver seat" by Alex Boten I am also speaking at o11yfest. You can watch my talk on Bad Observability at o11yfest from May 9th to the 12th: https://o11yfest.org/ You can find...
Slight Reliability Episode 7 - o11yfest Pre-reactions
May 02, 2022 20:00 - 17 minutes - 12 MBIn this episode I provide a (p)re-reaction to three of the talks that will be included in the upcoming 2022 o11yfest conference (a conference all about observability). The talks I cover are: "Where the heck are my spans?" by Reese Lee "Confidence in chaos" by Narmatha Bala "Is MTTR still relevant in a modern, cloud native world?" by Martin Mao I am also speaking at o11yfest. You can watch my talk on Bad Observability at o11yfest from May 9th to the 12th: https://o11yfest.org/ You can find...
Slight Reliability Episode 6 - Afailability
April 11, 2022 20:00 - 10 minutes - 7.52 MBHow do you measure the availability of your services? What metric do you pick? What layer of the solution do you track it from? My colleague Gwen and I are sharing our SLO definition workshop experience at SLOconf from May 9th to 12th: https://www.sloconf.com/ You can also watch my talk on Bad Observability at o11yfest from May 9th to the 12th: https://o11yfest.org/ You can find me on: LinkedIn: https://www.linkedin.com/in/stephento... Twitter: https://twitter.com/the_kiwi_sre Music from...
Slight Reliability Episode 5 - SLO Motion
April 04, 2022 20:00 - 13 minutes - 9.4 MBIn the episode I share our team's experience defining SLO's, and how we experimented and pivoted to achieve better outcomes. As discussed in the episode, if you would like to hear (and see) more about our SLO workshop, my colleague Gwen and I are speaking at SLOconf from May 9th to 12th: https://www.sloconf.com/ You can find me on: LinkedIn: https://www.linkedin.com/in/stephento... Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat....
Slight Reliability Episode 4 - Bad Observability Part 3
March 28, 2022 19:00 - 10 minutes - 7.46 MBWhat are even more antipatterns to avoid in monitoring, alerting, tracing, and logging? Shout out to James Pulley for his contribution to this episode. James is one of the world's leading experts on performance engineering and can be found on LinkedIn here: https://www.linkedin.com/in/jameslpulley3/ I will be presenting about Bad Observability at o11yfest from May 9th to the 12th 2022: https://o11yfest.org/ You can find me on: LinkedIn: https://www.linkedin.com/in/stephento... Twitter: ht...
Slight Reliability Episode 3 - Bad Observability Part 2
March 21, 2022 19:00 - 16 minutes - 11.3 MBWhat are some more antipatterns to avoid in monitoring, alerting, tracing, and logging? Shout out to Raguraman Balasubramanian (https://www.linkedin.com/in/raguraman-balasubramanian-070150108/) for his contribution to this episode. You can find me on: LinkedIn: https://www.linkedin.com/in/stephento... Twitter: https://twitter.com/the_kiwi_sre Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t...
Slight Reliability Episode 2 - Bad Observability Part 1
March 14, 2022 19:00 - 17 minutes - 11.8 MBWhat are some antipatterns to avoid in monitoring, alerting, tracing, and logging? Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Slight Reliability Episode 1 - What the heck is SRE anyway?
March 07, 2022 19:00 - 9 minutes - 6.61 MBWhat is SRE *really* about? How did it start? What do I *want* it to be? What is it being implemented as in the industry? Music from Uppbeat (free for Creators!). Intro: https://uppbeat.io/t/sensho/good-times License code: QBXDSEGNJZY9DDIC Outro: https://uppbeat.io/t/mountaineer/voyager License code: 5C0VMTUOULFSRSTM
Performance Time Episode 28: The Grand Finale!
February 17, 2022 19:00 - 23 minutes - 16 MBIn this episode I wrap up the Performance Time show with some commentary around my blog "Wrapping up 13 years as a Performance Engineer" (https://www.linkedin.com/pulse/wrapping-up-13-years-performance-engineering-stephen-townshend/)
Performance Time Episode 27: Oh No! Not Documentation!
January 17, 2022 19:00 - 12 minutes - 8.46 MBCan documentation be... fun? How is it relevant to SRE?
Performance Time Episode 26: Chicken or the Egg? SLI's and SLO's and which comes first?
January 10, 2022 19:00 - 9 minutes - 6.54 MBThis week we look at whether to identify SLI's or SLO's first - and take a look at how both approaches might look like with a ridiculous example. The Google Site Reliability Book is free here: https://sre.google/books/ (I bought and listened to the audiobook on Audible). The intro and background music is "Elevator Music Lofi" by Oleksii Kaplunskyi.
Performance Time Episode 25: Defining Service Level Indicators
December 06, 2021 18:00 - 15 minutes - 10.5 MBWhat are SLI's, SLO's, and SLA's? Why do they matter? How can you identify SLI's for your products? The intro and background music is "Elevator Music Lofi" by Oleksii Kaplunskyi.
Performance Time Episode 24: Kubernetes Explained
November 15, 2021 19:00 - 9 minutes - 6.42 MBIn this episode I attempt to explain Kubernetes and what it's all about.
Performance Time Episode 23: Docker
October 26, 2021 07:00 - 8 minutes - 5.93 MBIn this episode I try and explain Docker and containers as simply as possible.
Performance Time Episode 22: Signals Amongst the Noise
October 04, 2021 08:00 - 8 minutes - 5.93 MBHow do you know what to alert on? And are there things we shouldn't alert on? What about dashboards? In this episode we talk about locating signals amongst the noise of monitoring and logging data in our organisations.
Performance Time Episode 21: The Four Golden Signals
September 12, 2021 08:00 - 11 minutes - 8.14 MBThis week we talk about the Four Golden Signals, a starting point when choosing what to monitor and alert on in your software platforms.
Performance Time Episode 20: What is SRE?
September 05, 2021 08:00 - 11 minutes - 8.14 MBIn this episode I share my first week working as an SRE and try to explain what it's all about.
Performance Time Episode 19: Not As Broken
August 10, 2021 08:00 - 8 minutes - 5.82 MBIn this update I give a quick update on how things are going and where the podcast is going next.
Performance Time Episode 18: Broken
July 05, 2021 06:00 - 17 minutes - 12 MBIn this episode I share an experience of work related anxiety I recently went through. Shout out to Joey Hendricks for reviewing this content before it went live.
Performance Time Episode 17: Engineer to Advocate
June 28, 2021 08:00 - 10 minutes - 7.29 MBWhat are the challenges when performance engineers are asked to move away from hands on technical work, into advocating and enabling others? Shout out to Sajeesh Nair and Ben Rowan who's LinkedIn comments I refer to.