[
https://issues.apache.org/jira/browse/JENA-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16709803#comment-16709803
]
ASF GitHub Bot commented on JENA-1645:
--------------------------------------
GitHub user DrBAXA opened a pull request:
https://github.com/apache/jena/pull/503
JENA-1645: Use uri predicate in concrete subject query.
Added URI predicate to the Lucene search in case of concrete subject
search.
Method added in TextIndex interface made default with a fallback to the
previous implementation.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/DrBAXA/jena master
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/jena/pull/503.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #503
----
commit 52d959c7a654b03e525fad214b027b6ac6aba2b2
Author: vdanyliuk <vasyl.danyliuk@...>
Date: 2018-12-05T09:10:49Z
JENA-1645: Use uri predicate in concrete subject query.
----
> Poor performance with full text search (Lucene)
> -----------------------------------------------
>
> Key: JENA-1645
> URL: https://issues.apache.org/jira/browse/JENA-1645
> Project: Apache Jena
> Issue Type: Question
> Components: Jena
> Affects Versions: Jena 3.9.0
> Reporter: Vasyl Danyliuk
> Priority: Major
>
> Situation: half of a million of an indexed by Lucene documents(emails
> actually), searching for emails by sender/receiver and some text.
> If to put text filter in the start of SPARQL query it executes once but in a
> case of very common words here are a lot of results(100 000+) that leads to
> poor performance, limiting results count may and up with missed results.
> If to put text search as the last condition it executes once per each already
> found subject. That's completely OK but text search completely ignores
> subject URI.
> I found two methods in TextQueryPF class: variableSubject(...) for the first
> case, and concreteSubject(...) for the second one.
> The question is: why can't subject URI be used as a constraint in the text
> search?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)