On 05/05/2015 02:04 PM, amiara514 wrote:
Github user amiara514 commented on the pull request:
https://github.com/apache/jena/pull/53#issuecomment-99072949
Ah, I see. But this still doesn't help for cases where there are small
differences between literals within the same language, for example singular/plural
forms that get stemmed by the analyzer, or variations in capitalization.
It's exactly that!
So, I push the hash solution which cover all previous cases.
For the other issue (with conjonctive query), maybe deletion have to be
managed with an updateDocument ?
Aside, hope it's useful:
Note that current jena-text doesn't /do/ conjunctive query but has enough
hooks to /enable/ conjunctive query, as is done in our ppd-index code
at https://github.com/epimorphics/ppd-text-index in TextDocProducerBatch.
When it does deletions it uses the IndexWriter's deleteDocuments() method
to brutally remove all the documents associated with the current subject
and then puts back ones that are still in the dataset.
:end Aside
Chris
--
If I were you, I would go to the crackpots. /They Shall Have Stars/
Epimorphics Ltd, http://www.epimorphics.com
Registered address: Court Lodge, 105 High Street, Portishead, Bristol BS20 6PT
Epimorphics Ltd. is a limited company registered in England (number 7016688)