[ 
https://issues.apache.org/jira/browse/JENA-699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14132957#comment-14132957
 ] 

Andy Seaborne commented on JENA-699:
------------------------------------

Thank you for the contribution. Looks good.

The CSV parser in Jena has an element of "technical debt".  There needed to be 
a parser to get through the W3C SPARQL-WG tests (W3C process : 2 
implementations needed). All that was required was strict RFC 4180 though UTF-8 
handling was on my list of good things to have.  There are a lot of CSV parsers 
around but looking at licenses, wanting just a parser, not object model for 
handing CSV-ish data, and the correct handlign of "" escapes, it became easier 
to write one, Apache Common CSV not being out of snapshot at the time. Given 
Jena has a parsing infratstructure, cooking up a CSV parser is was the easiest 
thing to do at the time but that does accumulate some "technical debt".  Real 
CSV is a whole lot messier than the RFC defines and a solid basis to CSV 
parsing is better than the current one. The parser API should not be seen as 
fixed; it did it's job at the time but it's not really a public contract. The 
new jena-csv is better off with a solid parser.

> Replace the CSV/TSV parsing with Apache Commons CSV
> ---------------------------------------------------
>
>                 Key: JENA-699
>                 URL: https://issues.apache.org/jira/browse/JENA-699
>             Project: Apache Jena
>          Issue Type: Improvement
>            Reporter: Andy Seaborne
>            Priority: Minor
>         Attachments: JENA-699.patch
>
>
> When Apache Commons CSV is released, use that and remove the current parsers 
> in favour of a properly written and designed component.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to