On Fri, Apr 27, 2012 at 11:02 AM, Chad <[email protected]> wrote:
> On Fri, Apr 27, 2012 at 1:59 PM, Yaron Koren <[email protected]> wrote: > > Are there any MediaWiki extensions that allow for searching through the > > text of uploaded files, such as Word documents, PDFs, etc. - whether it's > > part of the standard search results or in a separate interface? Or is > > anyone working on such a thing? > > > > Aren't the contents of PDFs indexed in Lucene? > Nope, they're extracted and stuffed in metadata but not yet stored for search -- see https://bugzilla.wikimedia.org/show_bug.cgi?id=21061 and related. I have seem a couple one-off extensions for indexing .doc files or such, but don't recall specifically what they are; should be floating somewhere on www.mediawiki.org but I don't know how up to date or reliable they are. -- brion _______________________________________________ MediaWiki-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
