Add language identification support for Norwegian Bokmål and Norwegian Nynorsk
------------------------------------------------------------------------------
Key: TIKA-491
URL: https://issues.apache.org/jira/browse/TIKA-491
Project: Tika
Issue Type: New Feature
Components: languageidentifier
Affects Versions: 0.7
Reporter: Jan Høydahl
Currently there is one Norwegian language profile in Tika - "no". We need to
distinguish between the two official Norwegian languages defined by ISO 639-1
codes "nb" and "nn". Those codes are recommended used instead of the common
"no" tag.
Proposed solved by removing the current language profile no.ngp and replacing
it with two new ones for nb and nn.
We must also add tests for Norwegian
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.