Kafka for Operations & SRE

A focused 3-day training course for operators, DevOps engineers, and SRE teams who manage Apache Kafka in production. This hands-on course covers the full operational lifecycle — from cluster sizing and monitoring to disaster recovery, security hardening, and enterprise automation.

Every concept is immediately practiced in hands-on labs using real-world scenarios from enterprise Kafka deployments. Our trainers are active engineers who deploy and operate Kafka platforms processing billions of events per day.

Available on-site at your location or online — in English or German.

implementation iconAn illustration of implementation icon

Course Overview

people screen iconAn illustration of people screen icon

Target Audience

Operators, DevOps engineers, and SRE teams who manage or plan to manage Apache Kafka clusters in production. Suitable for teams adopting Kafka as well as operators looking to deepen their existing operational expertise.

rocket book iconAn illustration of rocket book icon

Duration & Format

3 days | 40% theory + 60% hands-on labs | Maximum 10 participants per session for individual attention and meaningful guidance.

knowledge iconAn illustration of knowledge icon

Prerequisites

Comfortable with Linux administration, networking basics, and container orchestration (Docker, Kubernetes). Experience operating distributed systems is helpful. No prior Kafka experience required.

flexibility iconAn illustration of flexibility icon

Customizable Content

We adapt the agenda to your Kafka distribution, infrastructure, and operational maturity. Running Confluent Platform? Using Strimzi on Kubernetes? Deploying on bare metal? We tailor the content accordingly.

implementation iconAn illustration of implementation icon

60% Hands-On Practice

Every concept is immediately applied in real coding exercises and labs. No death by slides — you build, test, and debug real applications throughout the course.

security iconAn illustration of security icon

Taught by Production Engineers

Your trainers build and operate Kafka platforms in production every day. Real-world war stories, not textbook theory — learn from engineers who’ve solved the problems you’ll face.

flexibility iconAn illustration of flexibility icon

Vendor-Independent

We offer neutral expertise, free from vendor lock-in. Our focus is on open-source Apache Kafka — not on selling a specific vendor’s product.

security iconAn illustration of security icon

Flexible On-Site

Remote or at your company — we come to you. Maximum 10 participants for hands-on, personalized guidance.

knowledge iconAn illustration of knowledge icon

German or English

You decide the language. All materials available in both German and English. 40% knowledge transfer, 60% hands-on practice.

Course Agenda

knowledge iconAn illustration of knowledge icon

Day 1: Operating Kafka

We start with the foundations every Kafka operator needs — how the cluster works under the hood and how to set it up for reliable, observable production workloads.

Focus:

  • How do we set up a Kafka cluster properly? Installation options — bare metal, VMs, containers, and Kubernetes — with configuration best practices for production
  • How big does our cluster need to be? Cluster sizing for storage, memory, CPU, and network — so you provision confidently instead of guessing
  • How do we know if our cluster is healthy? Monitoring with Prometheus and Grafana, the metrics that actually matter, and alerting rules that catch problems before users do
implementation iconAn illustration of implementation icon

Day 2: Problem Resolution

We tackle the scenarios that keep operators up at night — broker failures, data loss risks, security breaches, and networking headaches.

Focus:

  • What happens when a broker goes down? Disaster recovery strategies, cross-datacenter replication, and RPO/RTO planning so you can sleep soundly
  • How do we keep Kafka secure? TLS encryption, authentication mechanisms, and ACL patterns — practical security that does not slow your team down
  • How do we handle networking and load balancing? Listener configuration, multi-network setups, and partition rebalancing to keep traffic flowing evenly
rocket book iconAn illustration of rocket book icon

Day 3: Production Implementation

We bring everything together into an enterprise-ready operational model — automation, upgrades, and the practices that separate ad-hoc operations from professional platform engineering.

Focus:

  • How do we automate Kafka operations? Infrastructure as code, GitOps workflows, and self-service topic provisioning for development teams
  • How do we upgrade without downtime? Rolling upgrade strategies, version compatibility, and pre-flight checks that keep your cluster safe
  • What does an enterprise Kafka platform look like? Reference architectures, multi-tenancy, capacity management, and incident response playbooks

Where We Deliver

We deliver Apache Kafka training on-site across Europe and remotely worldwide. Based in Switzerland, our engineers bring years of production expertise directly to your team — whether you’re standing up your first Kafka cluster or hardening an existing platform for enterprise-grade operations.

Our training is not generic classroom material. Every example, lab, and discussion is drawn from real enterprise Kafka deployments in regulated industries.

technologiesAn illustration of technologies
flexibility iconAn illustration of flexibility icon

On-Site Across Europe

Switzerland, Germany, Austria, and the broader DACH region. Our engineers travel to your location for hands-on, in-person training with your team.

security iconAn illustration of security icon

Remote for US & Worldwide

Same depth and interactivity via video conference. Ideal for distributed teams across time zones — no compromise on quality.

knowledge iconAn illustration of knowledge icon

Swiss-Based, Kafka Schulung Schweiz

We are based in Switzerland and deliver Kafka training locally in German or English. Local expertise, international reach.

Getting Started

From inquiry to confirmed training — straightforward and fast.

Contact Us
Tell us your preferred dates, team size, and any specific topics you want to emphasize. We respond within one business day.
Tailored Agenda
We review your Kafka environment and team background, then propose a customized training agenda. If you use specific tools, infrastructure, or cloud providers, we incorporate them into the labs.
Schedule & Confirm
We finalize dates, logistics (on-site or online), and handle any procurement or legal requirements. Flexible scheduling — weekdays, consecutive or split across weeks.
Training Delivery
Three days of intensive, hands-on Apache Kafka operations training delivered by a senior Acosom engineer. Your team leaves with practical skills they can apply immediately.

Ready to upskill your operations team? Contact us to schedule your Apache Kafka operations training — custom dates, tailored content, delivered by production engineers.

Book Kafka Operations Training