Hi Mathieu,
That issue has been resolved in Drill-1.2 snapshot.
(jira issue: https://issues.apache.org/jira/browse/DRILL-3718)

If you would like to try it out, you can download the source code from
github and build it. Or you could wait for the next official release :)

On Thu, Sep 17, 2015 at 7:16 AM, Jim Scott <[email protected]> wrote:

> While I am not going to tackle your specific question regarding using the
> delimited file reader, I will say that the 1.2 build of Drill has support
> for Apache HTTPd log format parsing. You only have to supply the format
> pattern that was used to create the logs and it will parse the records
> properly.
>
> On Thu, Sep 17, 2015 at 8:43 AM, Mathieu Agneray <
> [email protected]>
> wrote:
>
> > Hy,
> >
> > I'm having an issue with Drill file format.
> > I have a CSV file that has space delimiter (apache2 web server logs) and
> > double quotes for text area.
> > So I have configured my csv file format like this:
> >
> > "csv": {
> >       "type": "text",
> >       "extensions": [
> >         "csv"
> >       ],
> >       "escape": "\\",
> >       "comment": "\u0000",
> >       "delimiter": " "
> >     }
> >
> > and it doesn't work well.
> >
> > A line look like this:
> > XXX.XXX.XXX.XXX 200 "GET / ... etc" "USER AGENT"
> >
> > Instead of giving me (4 columns):
> > ["XXX.XXX.XXX.XXX", "200", "GET / ... etc", "USER AGENT"]
> >
> > I'm having this response (3columns):
> > ["XXX.XXX.XXX.XXX", "200", "GET / ... etc\" \"USER AGENT\""]
> >
> > But if I edit the file with comma delimiter a the configuration, it's
> > working fine.
> > Is there a problem within the code for space delimiter?
> >
> > Thanks
> >
> > Mathieu Agneray
> >
>
>
>
> --
> *Jim Scott*
> Director, Enterprise Strategy & Architecture
> +1 (347) 746-9281
> @kingmesal <https://twitter.com/kingmesal>
>
> <http://www.mapr.com/>
> [image: MapR Technologies] <http://www.mapr.com>
>
> Now Available - Free Hadoop On-Demand Training
> <
> http://www.mapr.com/training?utm_source=Email&utm_medium=Signature&utm_campaign=Free%20available
> >
>

Reply via email to