Github user rvesse commented on the pull request:
https://github.com/apache/jena/pull/97#issuecomment-153874598
Looks good to me
One open question, how does this interact with past work for language
specific indexing and multi-lingual indexing in general?
It's been a while since I poked around Lucene but I seem to remember that
it was often necessary to use alternative analysers particularly when you get
into languages with compound words, non-Latin alphabets, symbolic alphabets
etc. This shouldn't be a requirement for merging this work but just wanted to
check that the current design won't preclude support for this in the future?
Looking over the code it looks like it should be relatively easy to add new
analysers and filters as needed but just wanted to make sure I had understood
the code correctly
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---