Kafka Internals: How does Kafka store the data?

A quick recap of the basic terminology of Kafka can be found here. The fundamental unit that Kafka manages is called a message. Message is simply a byte array without any restriction of the format. As you would know, producers produce data to a topic. Each topic is further divided into multiple partitions. Messages are... Continue Reading →

Introduction to Apache Kafka Concepts

What is Apache Kafka? Apache Kafka is a distributed streaming platform. This essentially means that Kafka has following capabilities: Publish and subscribe to the stream of records just like how message queue (e.g. RabbitMQ) works Store streams of records for specified time. Kakfa is not a replacement for database or a logging platform. However, kafka... Continue Reading →

Up ↑