SRE / DevOps / Kubernetes Weekly Collection#49(Week 1, 2021)

  • In this blog post series, I collect the following 3 Weekly Mailing List I subscribe to, leave some comments as an aide-memoire and useful links.

DEVOPS WEEKLY ISSUE #523 January 3rd, 2021
SRE Weekly Issue #251 January 3rd, 2021
KubeWeekly #245 January 8th, 2021

DEVOPS WEEKLY ISSUE #523 January 3rd, 2021


A good discussion of the benefits of compression. Starting out with a nice case study of migrating log data, bit expanding to other data transfer use cases. Particularly interesting because of the numbers and showing the financial savings on cloud services.

  • The title is “How we compress Pub/Sub messages and more, saving a load of money”.

Working to reduce the time taken for CI/CD pipelines is nearly always a good investment of time. This post covers a few areas you can likely optimise.


A great introduction to message queues, with a detailed look at various software options including AWS and GCP services, Kafka, NSQ and NATs.

  • The title is “The Big Little Guide to Message Queues”.

A quick introduction to Kyverno, an open source policy based tool for Kubernetes. In this example we’re shown how to easily automate adding labels to namespaces.

  • The title is “Auto-labeling Kubernetes resources with Kyverno”.

A strong argument for strong schemas when defining APIs.

  • The title is “API design is stuck in the past”.


Buf is providing a toolkit for making working with Protobuf APIs much easier, both for producers and consumers. Code generation as well as built-in linting and breaking change detection.

  • The Github page of “Buf”. It is being developed by Buf, which suggests schema-driven development on the above article.

Zap describes itself as a simple cross-platform configuration management and orchestration tool. Store reusable plans in HCL and run a variety of tasks against remote machines.

  • The GitHub page for “Zap”, a simple cross-platform orchestration and configuration management tool.

Clutch provides a platform for runtime changes to infrastructure. Out of the box it has lots of features, but it’s mainly about making it easy to build custom developer dashboards with extensions.

  • The Web page of the open source Web UI and API platform “Clutch”. It is designed to simplify, accelerate, and mitigate common debugging, maintenance, and operational tasks.

SRE Weekly Issue #251 January 3rd, 2021


Writing Runbook Documentation When You’re An SRE

Tips and tricks for writing effective runbook documentation when you aren’t a technical writer

I like the discussion of the “Curse of Knowledge” cognitive bias.

Taylor Barnett — Transposit

  • The author found that there are two main reasons that engineers don’t want to write documentation: 1. There isn’t an incentive structure for doing the work, and 2. they are unsure of how to write good documentation.

SLO — From Nothing to… Production

Here’s one engineer’s SLO journey.

My main focus is on how I educated myself about SLOs and how applied this to my organization.

Ioannis Georgoulas

  • As the title suggests, the author learned about SLO(Service Level Objectives) in a few months from scratch, and shared the efforts applied to the production environment in the following major items.
    ○ Prepare yourself
    ○ Where to start
    ○ Take ownership and be an SLO advocate
    ○ Build the framework
    ○ Summary

How to sell SLOs to Engineering Directors

This blog is a redacted internal memo that aimed to familiarize SLOs with its audience, explain the value of an SLO culture, and describe how we would implement and roll them out.

Thomas Césaré-Herriau — Brex

  • Since I have covered it in last week’s DEVOPS WEEKLY# 522, so I will skip it.

Why I’ve Been Merging Microservices Back Into The Monolith At InVision

Why would you do this? It’s all about Conway’s Law.

Ben Nadel

  • Since I have covered it in last week’s DEVOPS WEEKLY# 522, so I will skip it.

Incident Phenomena: Shorthand Names, à la Danny Ocean

The folks at Adaptive Capacity Labs have seen a few patterns crop up over and over in their post-incident reviews. How many of these have you seen before?

John Allspaw — Adaptive Capacity Labs

  • It explains what it named the patterns observed at the time of incidents.

Home Alone: a Post-Incident Review

Lots of complex contributing factors led to the main character being left behind in the movie Home Alone… so let’s treat it like a production incident!

Fred Hebert

  • The author wrote an incident investigation the way he would do them for work issues for “Home Alone” as an incident that Kevin was behind, and he had stuck at home to fight burglars.

Making sense of what happened is hard

This one includes a complex timeline showing the interplay of two pairs of bugs, where one in each pair masked the other.

Lorin Hochstein

  • The author has taken half of Dr. Hannah Harvey’s course, “The Art of Storytelling’’ and is trying to explain obstacles with a focus on oral storytelling. A diagram is used in the article because the content is complicated, but it seems that he would continue to make efforts to improve the technique of oral storytelling. I also want to improve my storytelling skills too.
    ○ “Now I just need to figure out how to tell this as a story without the benefit of a diagram.”

KubeWeekly #245 January 8th, 2021

The Headlines

Editor’s pick of the highlights from the past week.

2020 CNCF Annual Report

The Cloud Native Computing Foundation (CNCF) annual report for 2020 is now available. The report highlights the growth of the community, events, projects, and more, over the past year.

As CNCF celebrated its fifth birthday in 2020, it achieved greater engagement through membership growth, event attendance growth, increased end user participation, and broad industry commentary.

  • An introductory article on CNCF’s “2020 CNCF Annual Report”.

Kubernetes Security Essentials Course Now Available

Today Linux Foundation Training & Certification and the Cloud Native Computing Foundation are announcing the availability of their newest training course, LFS260 — Kubernetes Security Essentials. The course provides skills and knowledge on a broad range of best practices for securing container-based applications and Kubernetes platforms during build, deployment and runtime. It is also a great way to prepare to take the recently launched Certified Kubernetes Security Specialist (CKS) certification exam.

  • The training course of “Kubernetes Security Essentials (LFS260)” is released. I can now access what I purchased on Cyber ​​Monday to take the CKS exam this year.

The Technical

Tutorials, tools, and more that take you on a deep dive into the code.

Self-hosting Kubernetes is your Raspberry Pi

Alex Ellis, OpenFaas

  • It explains how to build Kubernetes clusters using Raspberry Pi 4s for self-hosting APIs, websites, and functions and publish them on the Internet to provide traffic to your users.

The Level Up Hour (S1E20): Kubernetes and Docker Deprecation

Langdon White, Scott McCarty, and Chris Short, Red Hat

  • Red Hat’s Twitch video series “The Level Up Hour”. It explains the matter in the title.

Switching on the cluster insights using Headlamp

Saiyam Pathak, Civo

  • An introductory article on Kinvolk’s new open source, easy-to-use and extensible Kubernetes web UI “Headlamp”.

Build a Prometheus Dashboard for K3s with Wio Terminal

Janakiram MSV, The New Stack

  • A tutorial as the title suggests, using Seeed Studio’s Arduino-compatible microcontroller and a compact device “Wio Terminal” with a 2.4-inch LCD.

Tutorial: Host a Local Podman Image Registry

Jack Wallen, The New Stack

  • The content of the title is explained with the background and the following points and procedures.
    ○ Registry vs. Repository vs. Tag
    ○ Create the Local Registry
    ○ Push Your First Image to the New Registry

What happens when you create a Pod in Kubernetes? (Video)

Salman Iqbal, Data Science Campus

  • The content of the title is explained carefully in the YouTube video.

Run Kubernetes Production Environment on EC2 Spot Instances With Zero Downtime: A Complete Guide

Kfir Schneider, Riskified

  • As the title suggests, it aims to show you how to use AWS EC2 Spot Instances to significantly reduce the cost of your k8s cluster and give you the confidence you need to run on Spot Instances with highly available workloads in production.

The Editorial

Articles, announcements, and morethatgive you a high-level overview of challenges and features.

All About Calico

Alex Pollitt, Tigera Saiyam Pathak, Civo

  • It explained carefully for 70 minutes on YouTube video. It talks from Kubernetes Networking, and gets into Calico and explains network technologies/considerations, Calico’s Vision, and more.

Red Hat OpenShift supports both Windows and Linux containers

Steven J. Vaughan-Nichols, ZDNet

  • It introduces Red Hat’s latest OpenShift Kubernetes feature, which will be available in early 2021 to run and manage both Linux and Windows containers from a single platform.

Top Considerations when Evaluating an Ingress Controllers for Kubernetes

Harry Tsiligiannis, ReleaseOps

  • The following points are described as important considerations to conduct ○ the decision-making process to avoid costly mistakes.
    ○ 1) Traffic protocol support
    ○ 2) Client management
    ○ 3) Traffic routing
    ○ 4) Resiliency
    ○ 5) Load balancing algorithms
    ○ 6) Authentication
    ○ 7) Observability
    ○ 8) Kubernetes Integration
    ○ 9) Traffic routing
    ○ 10) Interface
    ○ Tip: Use multiple ingress controllers to fill in the gaps
    ○ Final thoughts

8 Kubernetes insights for 2021

Scott McCarty, Red Hat

  • As the title suggests, it has insights on the following eight points. It’s better to be numbered to distinguish the numbers in the text.
  1. Basic kubectl and Helm commands for beginners by Jessica Cherry

Infrastructure Engineering — The Kubernetes Way

Vignesh T V, Timecampus

  • The second article in a two-part series on Kubernetes and its ecosystem. We’re digging deeper into the infrastructure one by one.

Upcoming CNCF Online Programs

We have expanded our webinar program to Online Programs! Stay tuned for the content release schedule.

  • The “Upcoming CNCF webinars” section has been changed to “CNCF Online Programs”, and the just the latest Webinar List has been expanded/changed from 2021. Below are some of the points I am interested in.

How about those articles? Do you have any interest in any?

Actually, I have some contents which I can not digest at this stage, I’ll make use of this aide-memoire and links for catching-up for myself too.

Bye now!!

Yoshiki Fujiwara

An infra engineer in Tokyo, Japan. Grew up in Athens, Greece(1986–1992). #Network, #Kubernetes, #GCP, #Certified AWS SAP

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store