[ 
https://issues.apache.org/jira/browse/SOLR-4722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17625091#comment-17625091
 ] 

Enea Mansutti edited comment on SOLR-4722 at 10/27/22 12:51 PM:
----------------------------------------------------------------

Inspired by this work I made a plugin that returns the position of the token in 
the snippet: <em pos="18">word</em>

I have tested it with Solr 8.11.2, you can check github for project 
[positions-solr-highlighter|https://github.com/eneam/positions-solr-highlighter]
 

It can be easily modified to return the offsets also


was (Author: JIRAUSER297584):
Inspired by this work I made a plugin that returns the position of the token in 
the snippet: <em pos="18">word</em>

I have tested it with Solr 8.11.2, you can check github for project 
[positions-solr-highlighter|https://github.com/eneam/positions-solr-highlighter]
 

It can be easily modified to return also the offsets

> Highlighter which generates a list of query term position(s) for each item in 
> a list of documents, or returns null if highlighting is disabled.
> -----------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-4722
>                 URL: https://issues.apache.org/jira/browse/SOLR-4722
>             Project: Solr
>          Issue Type: New Feature
>          Components: highlighter
>    Affects Versions: 4.3, 6.0
>            Reporter: Tricia Jenkins
>            Priority: Minor
>         Attachments: PositionsSolrHighlighter.java, SOLR-4722.patch, 
> SOLR-4722.patch, solr-positionshighlighter.jar
>
>
> As an alternative to returning snippets, this highlighter provides the (term) 
> position for query matches.  One usecase for this is to reconcile the term 
> position from the Solr index with 'word' coordinates provided by an OCR 
> process.  In this way we are able to 'highlight' an image, like a page from a 
> book or an article from a newspaper, in the locations that match the user's 
> query.
> This is based on the FastVectorHighlighter and requires that termVectors, 
> termOffsets and termPositions be stored.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to