Introduction to Apache Kafka

Apache Kafka is a real-time data pipeline processor. It high-scalability, fault tolerance, execution speed, and fluid integrations are some of the key hallmarks that make it an integral part of many Enterprise Data architectures. In this lab intensive two day course, students will learn how to use Kafka to build streaming solutions.

Retail Price: $1,895.00

Next Date: 06/06/2024

Course Days: 2

Enroll in Next Date

Request Custom Course

Course Objectives

This “skills-centric” course is about 50% hands-on lab and 50% lecture, coupling the most current techniques with the soundest industry practices. Throughout the course students will be led through a series of progressively advanced topics, where each topic consists of lecture, group discussion, comprehensive hands-on lab exercises, and lab review.

Working in a hands-on learning environment, students will explore

  • Overview of Streaming technologies
  • Kafka concepts and architecture
  • Programming using Kafka API
  • Kafka Streams
  • Monitoring Kafka
  • Tuning / Troubleshooting Kafka


Course Prerequisites

This in an Introductory and beyond level course is geared for experienced Java developers seeking to be proficient in Apache Kafka.  Attendees should be experienced developers who are comfortable with Java, and have reasonable experience working with databases. Students should also be able to navigate Linux command line, and who have basic knowledge of Linux editors (such as VI / nano) for editing code.

Course Agenda


Please note that this list of topics is based on our standard course offering, evolved from typical industry uses and trends. We’ll work with you to tune this course and level of coverage to target the skills you need most.  

  1. Introduction to Streaming Systems
  • Fast data
  • Streaming architecture
  • Lambda architecture
  • Message queues
  • Streaming processors
  1. Introduction to Kafka
  • Architecture
  • Comparing Kafka with other queue systems (JMS / MQ)
  • Kaka concepts : Messages, Topics, Partitions, Brokers, Producers, commit logs
  • Kafka & Zookeeper
  • Producing messages
  • Consuming messages (Consumers, Consumer Groups)
  • Message retention
  • Scaling Kafka
  • Labs : Getting Kafka up and running; Using Kafka utilities
  1. Programming With Kafka
  • Configuration parameters
  • Producer API (Sending messages to Kafka)
  • Consumer API (consuming messages from Kafka)
  • Commits , Offsets, Seeking
  • Schema with Avro
  • Lab : Writing Kafka clients in Java; Benchmarking Producer APIs
  1. Kafka Streams
  • Streams overview and architecture
  • Streams use cases and comparison with other platforms
  • Learning Kafka Streaming concepts (KStream, KTable, KStore)
  • KStreaming operations (transformations, filters, joins, aggregations)
  • Labs: Kafka Streaming labs
  1. Administering Kafka
  • Hardware / Software requirements
  • Deploying Kafka
  • Configuration of brokers / topics / partitions / producers / consumers
  • Security: How secure Kafka cluster, and secure client communications (SASL, Kerberos)
  • Monitoring : monitoring tools
  • Capacity Planning : estimating usage and demand
  • Trouble shooting : failure scenarios and recovery
  1. Monitoring and Instrumenting Kafka
  • Monitoring Kafka
  • Instrumenting with Metrics library
  • Labs; Monitor Kafka cluster
  • Instrument Kafka applications and monitor their performance

Optional Case Study / Workshop (Time-Permitting)

Students will build an end-to-end application simulating web traffic and send metrics to Grafana

Course Dates Course Times (EST) Delivery Mode GTR
6/6/2024 - 6/7/2024 10:00 AM - 6:00 PM Virtual gauranteed to run course date Enroll
8/8/2024 - 8/9/2024 10:00 AM - 6:00 PM Virtual Enroll
10/3/2024 - 10/4/2024 10:00 AM - 6:00 PM Virtual Enroll
12/5/2024 - 12/6/2024 10:00 AM - 6:00 PM Virtual Enroll