[
https://issues.apache.org/jira/browse/SOLR-4722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13822537#comment-13822537
]
Simon Rosenthal commented on SOLR-4722:
---------------------------------------
Great patch !
I'd like to use the code as the basis for a component which will simply return
term positions for each query term - no need for having highlighting enabled as
a prerequisite, or to return term offsets - this is a text mining project where
we'll be running queries in batch mode and storing this information externally.
Can you think of any gotchas I might encounter ?
> Highlighter which generates a list of query term position(s) for each item in
> a list of documents, or returns null if highlighting is disabled.
> -----------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: SOLR-4722
> URL: https://issues.apache.org/jira/browse/SOLR-4722
> Project: Solr
> Issue Type: New Feature
> Components: highlighter
> Affects Versions: 4.3, 5.0
> Reporter: Tricia Jenkins
> Priority: Minor
> Attachments: SOLR-4722.patch, solr-positionshighlighter.jar
>
>
> As an alternative to returning snippets, this highlighter provides the (term)
> position for query matches. One usecase for this is to reconcile the term
> position from the Solr index with 'word' coordinates provided by an OCR
> process. In this way we are able to 'highlight' an image, like a page from a
> book or an article from a newspaper, in the locations that match the user's
> query.
> This is based on the FastVectorHighlighter and requires that termVectors,
> termOffsets and termPositions be stored.
--
This message was sent by Atlassian JIRA
(v6.1#6144)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]