afs commented on PR #3405:
URL: https://github.com/apache/jena/pull/3405#issuecomment-3240031592

   Jena 5.4.0 introduced `jena-langtag`.
   
   The RDF 1.1 spec says "language tags MAY be converted to lower case. The 
value space of language tags is always in lower case."  This leads to 
differences across systems (when are we in "value space" for a language tag?) 
but also differences between Jena dataset implementations.
   
   Now RDF 1.2 Concepts says "MUST be treated accordingly, that is, in a 
case-insensitive manner."
   
   Jena went with the algorithm in [RFC 5646 section 
2.1.1](https://datatracker.ietf.org/doc/html/rfc5646#section-2.1.1), without 
regitry access, based on some previous user feedback. Language tag are now 
parsed or created as case-normalized following RFC 5646 form (lang is 
lowercase, region is uppercase) then comparison. The original language tag is 
not preserved.
   
   The WG did a survey of systems: it found systems providing lower case and 
systems providing the algorithm in RFC 5646 section 2.1.1. Jena went with the 
algorithm in RFC 5646 based on some previous user feedback.
   
   There is an [entry in the RDF 1.2 change 
log](https://www.w3.org/TR/rdf12-concepts/#:~:text=Implementations%20were%20previously%20allowed%20to%20normalize%20language%20tags%20to%20lower%20case%2C).
   
   For initial text direction, the strings are `ltr` or `rtl` and lower case is 
required.
   
   CHANGES for jena 5.4.0:
   
   > ==  New artifact
   > 
   > Jena 5.4.0 introduces a new module jena-langtag for language tag parsing 
   > in compliance with RFC 5646. Language tag validation is strengthened. 
   > When parsing, language tag violations are still treated as warnings.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: pr-unsubscr...@jena.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: pr-unsubscr...@jena.apache.org
For additional commands, e-mail: pr-h...@jena.apache.org

Reply via email to