[ 
https://issues.apache.org/jira/browse/SOLR-10299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16069939#comment-16069939
 ] 

Jan Høydahl edited comment on SOLR-10299 at 6/30/17 12:05 PM:
--------------------------------------------------------------

A downside of building the search index from HTML is that you get search hits 
from the left-hand TOC, the menu header and the navigation in footer, so a 
search for any word in the toc such as "analysers" will retrieve *all* pages 
(with the best on top though). A workaround for this if we let our ant build 
generate the index, is to introduce a new build target {{build-site-notoc}} or 
similar that produces clean HTML without the toc, menu bar and footer?

*UPDATE* I tested. Built the site using {{_layouts/default_print.html}} and 
uploaded the index. Now we get much better precision! And the index also shrunk 
from 605k to 375k since the URL prefix is now in index.html and not in 
search-index.js :)


was (Author: janhoy):
A downside of building the search index from HTML is that you get search hits 
from the left-hand TOC, the menu header and the navigation in footer, so a 
search for any word in the toc such as "analysers" will retrieve *all* pages 
(with the best on top though). A workaround for this if we let our ant build 
generate the index, is to introduce a new build target {{build-site-notoc}} or 
similar that produces clean HTML without the toc, menu bar and footer?

> Provide search for online Ref Guide
> -----------------------------------
>
>                 Key: SOLR-10299
>                 URL: https://issues.apache.org/jira/browse/SOLR-10299
>             Project: Solr
>          Issue Type: Sub-task
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: documentation
>            Reporter: Cassandra Targett
>
> The POC to move the Ref Guide off Confluence did not address providing 
> full-text search of the page content. Not because it's hard or impossible, 
> but because there were plenty of other issues to work on.
> The current HTML page design provides a title index, but to replicate the 
> current Confluence experience, the online version(s) need to provide a 
> full-text search experience.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to