[
https://issues.apache.org/jira/browse/CSV-293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17875151#comment-17875151
]
Gary D. Gregory commented on CSV-293:
-------------------------------------
Hello [~paulmillar]
Thank you for taking the time to dig up your old laptop and detailing the
background for this issue.
{quote}
RML mapper is written in Java and uses Commons CSV (or open csv?)
{quote}
RML mapper currently depends on:
{noformat}
[INFO] | +- com.opencsv:opencsv:jar:5.8:compile
[INFO] | +- org.simpleflatmapper:sfm-csv:jar:8.2.3:compile
[INFO] | | +- org.simpleflatmapper:lightning-csv:jar:8.2.3:compile
[INFO] | | | \- org.apache.commons:commons-csv:jar:1.9.0:compile
{noformat}
{quote}
However, I've worked with other data sources that suffered from having multiple
null-like String values, so I think supporting multiple null values in CSVW
makes sense.
{quote}
CSVW sounds like the initial HTTP RFC which just captured the state of affairs
at some point in time, instead of a design from scratch.
{quote}
Therefore, given the support for Stream, I don't think there's a need to update
the Commons CSV API to support an application injecting some kind of lambda.
{quote}
Right, that makes sense.
> Add support for multiple null String values
> -------------------------------------------
>
> Key: CSV-293
> URL: https://issues.apache.org/jira/browse/CSV-293
> Project: Commons CSV
> Issue Type: Improvement
> Components: Parser
> Reporter: Paul Millar
> Priority: Minor
>
> The [CSVW namespace|https://www.w3.org/ns/csvw] provides metadata describing
> a CSV file. One element of this is the ability to associate certain certain
> values with the {{null}} value, as recorded by the [csvw:null
> property|https://www.w3.org/ns/csvw#property-definitions].
> This definition corresponds (broadly) to the "null String" concept (see
> [org.apache.commons.csv.CSVFormat#setNullString|http://commons.apache.org/proper/commons-csv/apidocs/org/apache/commons/csv/CSVFormat.Builder.html#setNullString-java.lang.String-]),
> with one noticeable difference: {{CSVFormat}} supports only a single "null
> String" value while CSVW, through {{csvw:null}}, supports multiple Strings.
> In order to fully support CSVW, it would be helpful if {{CSVFormat}} were to
> be updated to allow multiple null String values.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)