Re: Drill & approximate query

2015-04-23 Thread Jacques Nadeau
Uli, To add to what Ted said, we'll be adding some items in this vein in the medium term. That being said, we would be more than happy to accept contributions of ideas, design or code if you were so inclined on any of these things. Sometimes, the quickest way to get these things is to contribute

Re: Drill & approximate query

2015-04-23 Thread Ted Dunning
Uli, I think that the current plans include approximate operators for some aggregations, but not anything on the level, say, BlinkDB. That said, Drill's optimizer could easily have rules that allow you to explicitly down-sample data to different degrees and then have queries choose between versio

Drill & approximate query

2015-04-23 Thread Uli Bethke
Is approximate query on the roadmap/radar for Apache Drill (similar to what we have in BlinkDB)? I see the following benefit of this feature: When performing data discovery, the analyst can often trade off raw speed against accuracy. Data discovery tools such as Datameer etc. work on a statisti