Categories: Kafka, Kafka Streams
Course Length: 2 Days

Apache Kafka is a framework implementation of a software bus using stream-processing. It is an open-source software platform developed by the Apache Software Foundation written in Scala and Java. The project aims to provide a unified, high-throughput, low-latency platform for handling real-time data feeds. It is also a part of an entire ecosystem in the Confluent Platform that includes Schema Registry, Connect, Streams, and KSQLDB.

This class aims to discuss what is Kafka, how Kafka is set up, and the different components of Kafka, its architecture. We discuss how to program Producer and Consumer.

  • Introduction

    • What is it?

    • Why do you need it?

    • What are the benefits?

  • Basic Components

    • Broker

    • Zookeeper

    • Messages

    • Producer

    • Consumer

    • Controller

    • Group Coordinator

    • Group Leader

  • Important Concepts

    • Replicas & High Watermarks

    • Ack

    • Retention

    • Rebalancing

  • Programming

    • Command Line Utilities

    • Producer

    • Consumer

  • Streaming

    • Streaming Introduction

    • Parallel Processing

    • KStream

    • KTable