kishoreg commented on issue #4400: what is the difference between Pinot and Druid URL: https://github.com/apache/incubator-pinot/issues/4400#issuecomment-509098996 Real-time Ingest - Druid supports two modes Push (Tranquility) and Pull (Kafka Indexing Service). Push has the limitation that late arriving events (after a certain window) will be dropped. I am not sure if the limitation applies to the pull mode. https://druid.apache.org/docs/latest/ingestion/stream-ingestion.html Pinot support ingesting data from directly from Kafka. We support two modes - using high level Kafka Consumer or low level Kafka consumer. Using low level you can get exactly once semantics. Pinot does not drop late arriving events. Segment - Pinot & Druid have their own custom columnar format to store data. At a high level, a segment contains forward index (mostly dictionary encoded) and inverted index (optional). Pinot has additional indexing schema known as star-tree index which can speed up Aggregations. https://engineering.linkedin.com/blog/2019/06/star-tree-index--powering-fast-aggregations-on-pinot See slides 21, 28, 35 for performance comparison between Pinot and Druid. https://www.slideshare.net/KishoreGopalakrishna/building-real-time-analytics-applications-using-pinot-a-linkedin-case-study
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org