Uli,
To add to what Ted said, we'll be adding some items in this vein in the
medium term. That being said, we would be more than happy to accept
contributions of ideas, design or code if you were so inclined on any of
these things. Sometimes, the quickest way to get these things is to
contribute
Uli,
I think that the current plans include approximate operators for some
aggregations, but not anything on the level, say, BlinkDB.
That said, Drill's optimizer could easily have rules that allow you to
explicitly down-sample data to different degrees and then have queries
choose between versio
Is approximate query on the roadmap/radar for Apache Drill (similar to
what we have in BlinkDB)?
I see the following benefit of this feature:
When performing data discovery, the analyst can often trade off raw
speed against accuracy.
Data discovery tools such as Datameer etc. work on a statisti