https://bugzilla.wikimedia.org/show_bug.cgi?id=63729

            Bug ID: 63729
           Summary: CirrusSearch: Remove as much non-sentence stuff as
                    possible from article text
           Product: MediaWiki extensions
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: CirrusSearch
          Assignee: [email protected]
          Reporter: [email protected]
                CC: [email protected], [email protected],
                    [email protected]
       Web browser: ---
   Mobile Platform: ---

The snippets we generate actually contain stuff from within tables, image
captions, and headings.  These don't look great.  If we could smash those into
another field then the snippets would be nicer.  We could also use the sentence
fragmenter in the experimental highlighter.

Note: we already do this for the headings.  We should do it for tables and
infoboxes and stuff.  Maybe we should do it for a css class as well.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to