Thanks for the confirmation. Created DRILL-4515 <https://issues.apache.org/jira/browse/DRILL-4515> to fix the documentation.
On Thu, Mar 17, 2016 at 1:45 AM, Jacques Nadeau <[email protected]> wrote: > Yes. > > - Drill attempts to split files on block boundaries when running on HDFS > and MapRFS. > - Drill doesn't currently split files that are sourced from the local file > system. > > -- > Jacques Nadeau > CTO and Co-Founder, Dremio > > On Wed, Mar 16, 2016 at 4:02 AM, Abdel Hakim Deneche < > [email protected]> > wrote: > > > In this documentation page: > > > > http://drill.apache.org/docs/text-files-csv-tsv-psv/ > > > > We can read the following: > > > > Using a distributed file system, such as HDFS, instead of a local file > > > system to query the files also improves performance because currently > > Drill *does > > > not split* files on block splits. > > > > > > Should it actually read: Drill *does split* files on block splits ? > > > > -- > > > > Abdelhakim Deneche > > > > Software Engineer > > > > <http://www.mapr.com/> > > > > > > Now Available - Free Hadoop On-Demand Training > > < > > > http://www.mapr.com/training?utm_source=Email&utm_medium=Signature&utm_campaign=Free%20available > > > > > > -- Abdelhakim Deneche Software Engineer <http://www.mapr.com/> Now Available - Free Hadoop On-Demand Training <http://www.mapr.com/training?utm_source=Email&utm_medium=Signature&utm_campaign=Free%20available>
