Kafka – Everything that you should know before interview

Introduction to Apache Kafka Concepts Read complete post Apache Kafka – Set up your first Kafka Producer and Consumer Read complete post Kafka Internals: How does Kafka store the data? Read complete post Reliable Data Delivery In Kafka Read complete post Troubleshooting Under Replicated Kafka Partitions Read complete post Kafka Broker Metrics And Their Debugging... Continue Reading →

Kafka Broker Metrics And Their Debugging

If you are new to Kafka, please read the first three posts of the series given below. Else dive in.  Introduction to Kafka Kafka Internals Reliable Data Delivery in Kafka Troubleshooting Under Replicated Kafka Partitions If you are preparing for an interview, this post contains most of the things that you should know about Kafka.... Continue Reading →

Troubleshooting Under Replicated Kafka Partitions

There are two types of replica: Leader replica and Follower replica. Let’s say that there are three replicas of a partition. One of them, should be a leader. All the requests from producers and consumers would pass to the leader in order to guarantee consistency. All the replicas other than the leader are called follower.... Continue Reading →

Kafka Internals: How does Kafka store the data?

A quick recap of the basic terminology of Kafka can be found here. The fundamental unit that Kafka manages is called a message. Message is simply a byte array without any restriction of the format. As you would know, producers produce data to a topic. Each topic is further divided into multiple partitions. Messages are... Continue Reading →

Apache Kafka Introductory Tutorial – Set up your first Kafka Producer and Consumer

Prerequisite A basic understanding about Apache Kafka. Please go through the introduction once, should you need it. Installation Download the Kafka from here. Untar the tar file that is downloaded. tar -xzf kafka_2.11-2.1.0.tgz Run The Server cd kafka_2.11-2.1.0 Start the zookeeper first bin/zookeeper-server-start.sh config/zookeeper.properties Zookeeper would be running at 2181 port. You can check that... Continue Reading →

Introduction to Apache Kafka Concepts

What is Apache Kafka? Apache Kafka is a distributed streaming platform. This essentially means that Kafka has following capabilities: Publish and subscribe to the stream of records just like how message queue (e.g. RabbitMQ) works Store streams of records for specified time. Kakfa is not a replacement for database or a logging platform. However, kafka... Continue Reading →

Up ↑