https://bugzilla.wikimedia.org/show_bug.cgi?id=63729
Bug ID: 63729
Summary: CirrusSearch: Remove as much non-sentence stuff as
possible from article text
Product: MediaWiki extensions
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: CirrusSearch
Assignee: [email protected]
Reporter: [email protected]
CC: [email protected], [email protected],
[email protected]
Web browser: ---
Mobile Platform: ---
The snippets we generate actually contain stuff from within tables, image
captions, and headings. These don't look great. If we could smash those into
another field then the snippets would be nicer. We could also use the sentence
fragmenter in the experimental highlighter.
Note: we already do this for the headings. We should do it for tables and
infoboxes and stuff. Maybe we should do it for a css class as well.
--
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l