Re: [EXT] Re: Food for thought about intra-document operation

2017-09-26 Thread Damien Profeta
data. -Aman On Mon, Sep 25, 2017 at 6:10 AM, Damien Profeta <damien.prof...@amadeus.com> wrote: Hello, A few format handled by Drill enable to work with document, meaning nested and repeated structure instead of just tables. Json and Parquet are the two that come to my mind right now. Do

Food for thought about intra-document operation

2017-09-25 Thread Damien Profeta
Hello, A few format handled by Drill enable to work with document, meaning nested and repeated structure instead of just tables. Json and Parquet are the two that come to my mind right now. Document modeling is a great way to express complex object and is used a lot in my company. Drill is

[jira] [Created] (DRILL-5797) Use more often the new parquet reader

2017-09-15 Thread Damien Profeta (JIRA)
Damien Profeta created DRILL-5797: - Summary: Use more often the new parquet reader Key: DRILL-5797 URL: https://issues.apache.org/jira/browse/DRILL-5797 Project: Apache Drill Issue Type

[jira] [Created] (DRILL-5796) Filter pruning for multi rowgroup parquet file

2017-09-15 Thread Damien Profeta (JIRA)
Damien Profeta created DRILL-5796: - Summary: Filter pruning for multi rowgroup parquet file Key: DRILL-5796 URL: https://issues.apache.org/jira/browse/DRILL-5796 Project: Apache Drill Issue

[jira] [Created] (DRILL-5795) Filter pushdown for parquet handles multi rowgroup file

2017-09-15 Thread Damien Profeta (JIRA)
Damien Profeta created DRILL-5795: - Summary: Filter pushdown for parquet handles multi rowgroup file Key: DRILL-5795 URL: https://issues.apache.org/jira/browse/DRILL-5795 Project: Apache Drill

Switch between new parquet reader and old one

2017-09-13 Thread Damien Profeta
Hi, I was looking at the code that read the parquet file and noticed there is a switch 'isComplex' to choose if it is possible to use the new reader or if we have to use the old one. The switch is based on the columns of the files (complex type or repetition level) but it doesn't care about