Hmm maybe more insights on the use case would be useful. It looks like what 
David says about metadata could make sense in your scenario depending on the 
requirements...



> Am 24.05.2020 um 13:20 schrieb Serkan KAZANCI <ser...@kazanci.com.tr>:
> 
> Thanks Jörn for the answer,
> 
> I use post tool to index html documents, so the html tags are stripped when 
> indexed and stored. The remaining text is mapped to the field content by 
> default. 
> 
> hl.fragsize=0 works perfect for the indexed document, but I can only display 
> highlighted text-only version of html document because the html tags are 
> stripped.
> 
> So is it possible to index and store the html document without stripping the 
> html tags, so that when the document is displayed with hl.fragsize=0 
> parameter, it is displayed as original html document?
> 
> Or
> 
> Is it possible to give a whole html document as a parameter to the Unified 
> highlighter so that output is also a highlighted html document?
> 
> Or 
> 
> Do you have a better idea to highlight the keywords of the whole html 
> document? 
> 
> Thanks,
> 
> Serkan
> 
> -----Original Message-----
> From: Jörn Franke [mailto:jornfra...@gmail.com] 
> Sent: Sunday, May 24, 2020 1:22 PM
> To: solr-user@lucene.apache.org
> Subject: Re: highlighting a whole html document using Unified highlighter
> 
> hl.fragsize=0
> 
> https://lucene.apache.org/solr/guide/8_5/highlighting.html
> 
> 
> 
>> Am 24.05.2020 um 11:49 schrieb Serkan KAZANCI <ser...@kazanci.com.tr>:
>> 
>> Hi,
>> 
>> 
>> 
>> I use solr to search over a million html documents, when a document is
>> searched and displayed, I want to highlight the keywords that are used to
>> find and access the document.
>> 
>> 
>> 
>> Unified highlighter is fast, accurate and supports different languages but
>> only highlights passages with given parameters.
>> 
>> 
>> 
>> How can I highlight a whole html document using Unified highlighter? I have
>> written a php code but it cannot do the complex word stemming functions.
>> 
>> 
>> 
>> 
>> 
>> Thanks,
>> 
>> 
>> 
>> Serkan
>> 
> 

Reply via email to