[
https://issues.apache.org/jira/browse/DRILL-3718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chun Chang closed DRILL-3718.
-----------------------------
Assignee: Chun Chang (was: Sean Hsuan-Yi Chu)
verified fix.
Have the following tsv data file:
{noformat}
"quote" no quote
another no quote ""another with quote""
{noformat}
and drill can scan the file:
{noformat}
0: jdbc:drill:schema=dfs.drillTestDirDropTabl> select columns[0], columns[1]
from dfs.tmp.`drill-3718.tsv`;
+-------------------+-------------------------+
| EXPR$0 | EXPR$1 |
+-------------------+-------------------------+
| quote | no quote |
| another no quote | "another with quote"
|
+-------------------+-------------------------+
{noformat}
> quotes in .tsv trigger exception
> ---------------------------------
>
> Key: DRILL-3718
> URL: https://issues.apache.org/jira/browse/DRILL-3718
> Project: Apache Drill
> Issue Type: Bug
> Components: Storage - Text & CSV
> Reporter: Sean Hsuan-Yi Chu
> Assignee: Chun Chang
> Fix For: 1.2.0
>
>
> Given a simple tsv file as below
> {code}
> "a" a
> a a
> a
> {code}
> After getting the first quote, the TextReader would just keep going down the
> entire files, as opposed to stopping at the second quote.
> This will trigger an exception
> {code}
> org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR:
> TextParsingException: Error processing input: Cannot use newline character
> within quoted string, line=2, char=12. Content parsed: [ ]
> Fragment 0:0
> {code}
> which complains at having newline in the quote.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)