https://bugzilla.wikimedia.org/show_bug.cgi?id=32871

       Web browser: ---
             Bug #: 32871
           Summary: Search indexes limited to first 100k words?
           Product: MediaWiki extensions
           Version: any
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: Lucene Search
        AssignedTo: [email protected]
        ReportedBy: [email protected]
                CC: [email protected]
    Classification: Unclassified


There's a suggestion currently at
http://en.wikipedia.org/wiki/Wikipedia:Village_pump_%28technical%29#Web_scraping_tool_for_article_research_.28list_expansion.29
that the search indexes only the first 100k words in a page. 

This means that important stuff at the bottom of a very long page is not
included in the index, which is a bad thing.

Is there any possibility this restriction - if it exists - could be lifted such
that all of the text is indexed?

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to