[ 
https://issues.apache.org/jira/browse/SOLR-9480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16474425#comment-16474425
 ] 

Hoss Man commented on SOLR-9480:
--------------------------------

Updated patch...

bq. ... some bug where the existing refinement logic isn't picking up the 
function contributions of some shards when the doc count is 0 ...

I tracked down the problems I was seeing to SOLR-12343 -- knowning that bug 
exists, I've made the test "self regulate" itself to avoid it: first it picks a 
random sort option, and then if the sort is one that _might_ result in that bug 
causing incorect stat values, i force the limit option to be big enough that 
all posible field values will be refined & returned to the client.  This let me 
relax a *lot* of the other constraints I had put on the test based on earlier 
misdiagnoses of what was causing these types of problems.

{quote}
* right now the redundent fore & back "size" values (which are the same for 
every slot/bucket) are returned for every bucket ... i'd like to try and figure 
out if i can put that data in the facet "context" to reduce the shard response 
size.
...
* figuring out what/how/where to put info in the facetDebug output
{quote}

...neither of these really seem possible at the moment for similar/overlapping 
reasons...

# There are currently no "per bucket" facet debug information, everything is 
reported at the "Per-Facet" level
# AggValueSources/SlotAccs currently don't know what their "key" is in the 
parent facet/context ...
#* which means there is no "safe" key for the accumulator to use if it tried to 
put data directly into the (parent) facet response, or if it tried to put any 
stat specific debug info in the FacetDebugInfo Map

(This situation of "what is my instances key/name?" is something we probably 
want to solve eventually, if not for optimizing the response size and/or adding 
debuging info then for the "making the queries optional, and inheriting them 
from "ancestor" function instances higher up the tree" type functionality i 
mentioned in my previous comment)

...so i went ahead and removed those nocommits from the patch.



> Graph Traversal for Significantly Related Terms (Semantic Knowledge Graph)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-9480
>                 URL: https://issues.apache.org/jira/browse/SOLR-9480
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Trey Grainger
>            Priority: Major
>         Attachments: SOLR-9480.patch, SOLR-9480.patch, SOLR-9480.patch
>
>
> This issue is to track the contribution of the Semantic Knowledge Graph Solr 
> Plugin (request handler), which exposes a graph-like interface for 
> discovering and traversing significant relationships between entities within 
> an inverted index.
> This data model has been described in the following research paper: [The 
> Semantic Knowledge Graph: A compact, auto-generated model for real-time 
> traversal and ranking of any relationship within a 
> domain|https://arxiv.org/abs/1609.00464], as well as in presentations I gave 
> in October 2015 at [Lucene/Solr 
> Revolution|http://www.slideshare.net/treygrainger/leveraging-lucenesolr-as-a-knowledge-graph-and-intent-engine]
>  and November 2015 at the [Bay Area Search 
> Meetup|http://www.treygrainger.com/posts/presentations/searching-on-intent-knowledge-graphs-personalization-and-contextual-disambiguation/].
> The source code for this project is currently available at 
> [https://github.com/careerbuilder/semantic-knowledge-graph], and the folks at 
> CareerBuilder (where this was built) have given me the go-ahead to now 
> contribute this back to the Apache Solr Project, as well.
> Check out the Github repository, research paper, or presentations for a more 
> detailed description of this contribution. Initial patch coming soon.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to