See inline.

On Dec 5, 2014, at 3:37 AM, Ajay <[email protected]> wrote:

> Hello,
> 
> I work for an eCommerce company. Currently we are looking at building a Data 
> warehouse platform as described below:
> 
> DW as a Service
>     |
> REST API
>     |
> SQL On No SQL (Drill/Pig/Hive/Spark/Storm)
>     |
> No SQL databases (One or more. May be RDBMS directly too)
>     | (Bulk load)
> My SQL Database    
> 
> I wish to get a few clarifications on Apache Drill as follows:
> 
> 1) Can we use Storm on No SQL or do we need to mix them with  Pig/Hive/Spark 
> or any other for any reason?
        Not sure what you mean here.
> 2) Can Storm be used a query interface for Business Intelligence,  Analytics 
> and Reporting
        The general pattern I have seen is you run storm to do 
aggregation/filtering and to load that into a data store that BI people are 
most familiar with (Hive). If you want Storm as the query interface you might 
want to look into Pig On Storm 
https://cwiki.apache.org/confluence/display/PIG/Pig+on+Storm+Proposal.
> 3) Do Storm supports only Hadoop, HBase?. We may use  
> Cassandra/MongoDb/CouchBase as well.
        Storm out of the box comes with connectors for Kafka, HDFS and Hbase. 
However, there are connectors for many other systems in open source 
https://github.com/hmsonline/storm-cassandra. And you can easily add 
bolts/spouts for any storage engine that you can’t find in open source.
> 4) Do Storm supports RDBMS too?. We can have a single interface to pull out  
> data from multiple data sources?
        Not out of the box but I am sure there are open source connectors for 
this.
> 5) Any recommendations(not limited to usage of Storm) for our specific  
> requirement described above.
        I am not entirely sure where in your pipeline you intend to use storm. 
If its on as SQL on No SQL store , storm might not be a good fit right now as I 
don’t think Pig on storm is available in open source yet.
> 
> Thanks
> Ajay
> 
> Note : I have posted a similar post on the Drill and Spark User list as well 
> as I am not sure which one best fits for our usecase.


-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.

Reply via email to