Re: Fixed-width files

2018-02-20 Thread Paul Rogers
Hi Flavio, Great question! I've not yet experimented with the solution myself, but I believe that the plugin can be placed into a jar, along with the needed Drill config file, and then placed into the jars/3rd-party directory if you keep your config information in the Drill product directory.

RE: Fixed-width files

2018-02-20 Thread Kunal Khatua
I agree... Using Andries' solution in combination with a view is probably the best approach. -Original Message- From: Flavio Pompermaier [mailto:pomperma...@okkam.it] Sent: Tuesday, February 20, 2018 1:47 PM To: user@drill.apache.org Subject: Re: Fixed-width files Actually what I'd

Re: Fixed-width files

2018-02-20 Thread Flavio Pompermaier
+--+ | 1| 12 | 123 | +--+--+--+ 1 row selected (0.587 seconds) 0: jdbc:drill:schema=dfs> Thanks, Arjun From: Kunal Khatua <kkha...@mapr.com> Sent: Wednesday, February 21, 2018 12:37 AM To: user@drill.apac

Re: Fixed-width files

2018-02-20 Thread Arjun kr
| 123 | +--+--+--+ 1 row selected (0.587 seconds) 0: jdbc:drill:schema=dfs> Thanks, Arjun From: Kunal Khatua <kkha...@mapr.com> Sent: Wednesday, February 21, 2018 12:37 AM To: user@drill.apache.org Subject: RE: Fixed-width files This might be a better opt

RE: Fixed-width files

2018-02-20 Thread Kunal Khatua
...@mapr.com] Sent: Tuesday, February 20, 2018 7:39 AM To: user@drill.apache.org Subject: Re: Fixed-width files You can also try and see if you can just use the CSV plugin to read a line as columns[0] and then use the substr function to pull out the fields in the line. https

Re: Fixed-width files

2018-02-20 Thread Andries Engelbrecht
You can also try and see if you can just use the CSV plugin to read a line as columns[0] and then use the substr function to pull out the fields in the line. http://drill.apache.org/docs/string-manipulation/#substr Here is a simple example Simple csv file [test]$ cat test.csv col1col2col3

Re: RE: Fixed-width files

2018-02-20 Thread Flavio Pompermaier
For the moment I've created an improvement issue about this: https://issues.apache.org/jira/browse/DRILL-6170 On Tue, Feb 20, 2018 at 9:23 AM, Flavio Pompermaier wrote: > Thanks Paul for this suggestion, I think I'm going to give it a try. > Once I've created my

Re: RE: Fixed-width files

2018-02-20 Thread Flavio Pompermaier
Thanks Paul for this suggestion, I think I'm going to give it a try. Once I've created my EasyFormatPlugin where should I put the produced jar? in which folder within jars directory? On Tue, Feb 20, 2018 at 2:57 AM, Paul Rogers wrote: > It may be that by "fixed width

Re: RE: Fixed-width files

2018-02-19 Thread Paul Rogers
It may be that by "fixed width text", Flavio means a file in which the text columns are of fixed width: kind of like old-school punch cards. Drill has no reader for this use case, but if you are a Java programmer, you can create one. See Drill Pull Request #1114 [1] for one example of a regex

Re: Fixed-width files

2018-02-19 Thread Flavio Pompermaier
Do you have any real example of this (apart the one reported at [1])? [1] https://drill.apache.org/docs/text-files-csv-tsv-psv/ On Mon, Feb 19, 2018 at 9:52 PM, Kunal Khatua wrote: > As long as you have delimiters, you should be able to import it as a > regular CSV file.

RE: Fixed-width files

2018-02-19 Thread Kunal Khatua
As long as you have delimiters, you should be able to import it as a regular CSV file. Using views that define the fixed-width nature should help operators downstream work more efficiently. -Original Message- From: Flavio Pompermaier [mailto:pomperma...@okkam.it] Sent: Monday,