[
https://issues.apache.org/jira/browse/TAJO-1374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14353975#comment-14353975
]
ASF GitHub Bot commented on TAJO-1374:
--------------------------------------
Github user hyunsik commented on the pull request:
https://github.com/apache/tajo/pull/400#issuecomment-77974324
The patch looks nice to me. Of course, your work is useful.
It would be great if this feature is applied to DelimitedTextFile too
because DelimitedTextFile is new replacement to CSVFile. DelimitedTextFile's
performance is really great. According to some benchmark result, it can parse
more than 500MB CSV data sets per second. It also can boost up query response
times in many cases, especially I/O intensive workloads.
I think that the work for DelimitedTextFile does not need to be done in
this issue. We can do in another jira.
Anyway, could you fix some test failure? It still has one test failure.
> Support multi-bytes delimiter for CSV file
> ------------------------------------------
>
> Key: TAJO-1374
> URL: https://issues.apache.org/jira/browse/TAJO-1374
> Project: Tajo
> Issue Type: Improvement
> Components: storage
> Reporter: Navis
> Assignee: Navis
> Priority: Minor
>
> Supports multi-character / non-ascii delimiter for CSV file.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)