Monday, September 2, 2019

Kafka, Spark, and Elasticsearch (architecture comparison)





Kafka Spark Elasticsearch
Horizontal Scale Topic/Partition RDD/Partition Index/Shard (default 5 primary shards per index)
Resource/cluster manager Zookeeper YARN, Mesos, or standalone Master Node
Cluster node Broker Worker Node
Execution unit Partition Leader Executor Lucene Index
Core
RDD Inverted index
Applications Producer/Consumer Driver Search
Replication Follower (ISR)
Replica shard


Kafka:



Spark:


Elasticsearch:






No comments:

Post a Comment