Hi Tom

Slightly off-topic but have you ever worked with drill? We did some tests
with a 200GB and 100MB dataset in an hdfs cluster and the performance we're
seeing is so bad drill is unusable for us..

Some initial debugging revealed that drill isn't able to distribute the
workload over the cluster. The entire query runs on one server... Have you
been able to get better performance out of it?



Kind regards
Merlijn

Op dinsdag 17 mei 2016 heeft Tom Barber <[email protected]> het
volgende geschreven:
> Okay so I've been asking around as you all know and we're considering
this apache specific Juju Charms page so I figured it would be useful to
roundup which communities I have spoken to who have shown definite interest
in collaboration.
> We have:
> Apache Bigtop (we all know about)
> Apache Zeppelin (we all know about)
> Apache Karaf
> Apache Nutch
> Apache OODT
> Apache Joshua (Incubating)
> Apache Kylin
> I'm sure there will be more, and probably some I've just forgotten about
or other people spoke to, but I think thats a pretty good start.
> As me and Kevin also discussed Drill is also a pretty important one from
a personal perspective as it offers the best (IMHO) route to getting SQL
over a bunch of your NOSQL charms with minimal effort, which then helps
Saiku and any other BI tooling you guys get into the platform. Its great
having all the big data stuff, but we need ways for end users to get this
stuff back out!
>
> Tom
> --------------
> Director Meteorite.bi - Saiku Analytics Founder
> Tel: +44(0)5603641316
> (Thanks to the Saiku community we reached our Kickstart goal, but you can
always help by sponsoring the project)
-- 
Juju mailing list
[email protected]
Modify settings or unsubscribe at: 
https://lists.ubuntu.com/mailman/listinfo/juju

Reply via email to