Re: Getting a list of matching terms and offsets

2016-06-06 Thread Justin Lee
Thank you very much! That JIRA entry led me to https://issues.apache.org/jira/browse/SOLR-4722, which still works against Solr 6 with a couple of modifications and should serve as the basis for what I want to do. You saved me a bunch of work, so thanks very much. (Also, it is always nice to

Re: Getting a list of matching terms and offsets

2016-06-05 Thread Ahmet Arslan
Hi Lee, May be you can find useful starting point on https://issues.apache.org/jira/browse/SOLR-1397 Please consider to contribute when you gather something working. Ahmet On Sunday, June 5, 2016 10:37 PM, Justin Lee wrote: Thanks, yea, I looked at debug query too.

Re: Getting a list of matching terms and offsets

2016-06-05 Thread Justin Lee
Thanks, yea, I looked at debug query too. Unfortunately the output of debug query doesn't quite do it. For example, if you use a wildcard query, it will simply explain the score associated with that wildcard query, not the actual matching token. In order words, if you search for "hour*" and the

Re: Getting a list of matching terms and offsets

2016-06-05 Thread Ahmet Arslan
Well debug query has the list of token that caused match. If i am not mistaken i read an example about span query and spans thing. It was listing the positions of the matches. Cannot find the example at the moment.. Ahmet On Sunday, June 5, 2016 9:10 PM, Justin Lee

Re: Getting a list of matching terms and offsets

2016-06-05 Thread Justin Lee
Thanks for the responses Alex and Ahmet. The TermVector component was the first thing I looked at, but what it gives you is offset information for every token in the document. I'm trying to get a list of tokens that actually match the search query, and unless I'm missing something, the

Re: Getting a list of matching terms and offsets

2016-06-05 Thread Ahmet Arslan
Hi, May be org.apache.lucene.search.spans.TermSpans ? On Sunday, June 5, 2016 7:59 AM, Alexandre Rafalovitch wrote: It sounds like TermVector component's output: https://cwiki.apache.org/confluence/display/solr/The+Term+Vector+Component Perhaps with additional flags

Re: Getting a list of matching terms and offsets

2016-06-04 Thread Alexandre Rafalovitch
It sounds like TermVector component's output: https://cwiki.apache.org/confluence/display/solr/The+Term+Vector+Component Perhaps with additional flags enabled (e.g. tv.offsets and/or tv.positions). Regards, Alex. Newsletter and resources for Solr beginners and intermediates: