This looks very interesting! Can't wait to see some how-to's to get the the server nodes setup, and kafka pipelines setup. I'd be very interested in trying this once it's setup.
Thanks! On Tue, Jan 3, 2017 at 2:35 AM, WeiWan <[email protected]> wrote: > IndexR is a distributed, columnar storage system based on HDFS, which > focus on fast analyse, both for massive static(historical) data and rapidly > ingesting realtime data. IndexR is designed for OLAP. > > Fast analyze on large dataset > Realtime ingestion with zero delay for query > Deep integration with Hadoop ecosystem > Hardware efficiency > Highly avaliable, scalable, manageable and simple > Adapted with popular query engines like Apache Drill, Apache Hive, etc. > > And now it is open source. > > Project: https://github.com/shunfei/indexr <https://github.com/shunfei/ > indexr> > Wiki: https://github.com/shunfei/indexr/wiki <https://github.com/shunfei/ > indexr/wiki> > > IndexR is original developed by Sunteng Tech. This project started a year > ago and now has been deployed to several productions in our company. The > whole cluster consumes over 30 billions events each day in realtime from > Kafka. The largest table contains over 10 billions rows (after rollup) and > rapidly increasing. Most of the statistic/analyze queries’ latency is less > than 3 seconds in real world production environment. > > Currently it is mainly used as Drill and Hive storage plugin. It should be > quite easy to master. > > We hope IndexR be a favor to you and make it better. > > Regards > Flow Wei > > > >
