kishoreg commented on issue #4400: what is the difference between Pinot and 
Druid
URL: 
https://github.com/apache/incubator-pinot/issues/4400#issuecomment-509098996
 
 
   Real-time Ingest
   - Druid supports two modes Push (Tranquility) and Pull (Kafka Indexing 
Service). Push has the limitation that late arriving events (after a certain 
window) will be dropped. I am not sure if the limitation applies to the pull 
mode.  https://druid.apache.org/docs/latest/ingestion/stream-ingestion.html
   
   Pinot support ingesting data from directly from Kafka. We support two modes 
- using high level Kafka Consumer or low level Kafka consumer. Using low level 
you can get exactly once semantics. Pinot does not drop late arriving events.
   
   Segment
   - Pinot & Druid have their own custom columnar format to store data. At a 
high level, a segment contains forward index (mostly dictionary encoded) and 
inverted index (optional). Pinot has additional indexing schema known as 
star-tree index which can speed up Aggregations. 
https://engineering.linkedin.com/blog/2019/06/star-tree-index--powering-fast-aggregations-on-pinot
   
   
   See slides 21, 28, 35 for performance comparison between Pinot and Druid. 
https://www.slideshare.net/KishoreGopalakrishna/building-real-time-analytics-applications-using-pinot-a-linkedin-case-study
   
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to