[
https://issues.apache.org/jira/browse/JENA-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Seaborne closed JENA-827.
------------------------------
Resolution: Fixed
Fix Version/s: Jena 2.12.2
Assignee: Andy Seaborne
Switch to syntax only parsing.
> Include all ISO 639-3 languages
> -------------------------------
>
> Key: JENA-827
> URL: https://issues.apache.org/jira/browse/JENA-827
> Project: Apache Jena
> Issue Type: Improvement
> Components: RDF/XML
> Affects Versions: Jena 2.12.1
> Reporter: Stian Soiland-Reyes
> Assignee: Andy Seaborne
> Priority: Minor
> Fix For: Jena 2.12.2
>
> Original Estimate: 48h
> Remaining Estimate: 48h
>
> {code}
> WARN 2014-12-05 14:21:24,085
> (com.hp.hpl.jena.rdf.model.impl.RDFDefaultErrorHandler:47) -
> http://www.w3.org/ns/oa#(line 42 column 36):
> {W116}
> ISO-639 does not define language: 'vls'.
> {code}
> http://www.w3.org/ns/oa.rdf says
> {code}
> <dc:creator xml:lang="vls">Herbert Van de Sompel</dc:creator>
> {code}
> but it does.. http://www-01.sil.org/iso639-3/documentation.asp?id=vls
> The complete list of ISO639-3 is not included in
> https://github.com/apache/jena/blob/master/jena-core/src/main/java/com/hp/hpl/jena/rdfxml/xmlinput/lang/Iso639.java
> - only ISO639-2 and ISO639-3.
> The new lists can be found at http://www-01.sil.org/iso639-3/download.asp -
> e.g. http://www-01.sil.org/iso639-3/iso-639-3.tab (UTF-8 although browser
> disagrees).
> I can work on the script to update this. One question is if Iso639.java needs
> a new field for the identifier for all those languages which are not in -1
> and -2 (e.g. "vls"). Another is if we should include the proper UTF-8 names
> of the languages to get the accents correct, e.g.
> {quote}
> bbj I L Ghomálá'
> {quote}
> I'm not sure if the permissions are compatible with Apache license:
> {quote}
> ISO 639-3 Code Tables Terms of Use
> The ISO 639-3 code set may be downloaded and incorporated into software
> products, web-based systems, digital devices, etc., either commercial or
> non-commercial, provided that:
> attribution is given www.sil.org/iso639-3/ as the source of the codes;
> the identifiers of the code set are not modified or extended except as
> may be privately agreed using the Private Use Area (range qaa to qtz), and
> then such extensions shall not be distributed publicly;
> the product, system, or device does not provide a means to redistribute
> the code set.
> {quote}
> the last bit might mean we should not include the *.tab files directly - but
> would the listing in Iso6539.java consitute a "means to redistribute the code
> set"?
> Is "the identifiers of the code set are not modified" compatible with Apache
> License which presumably allows you to modify anything?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)