Hi, On Sat, Oct 23, 2010 at 2:18 PM, Ista Pouss <[email protected]> wrote: > - Is there a tika parser for mediawiki > (http://www.mediawiki.org/wiki/MediaWiki) ?
No. MediaWiki uses a database backend instead of a special file format for storing data, so you'd need to use something like the ManifoldCF (http://incubator.apache.org/connectors/) to extract information from a MediaWiki installation. > - Is it possible to write in the supported formats with tika ? No. Tika only supports extracting information from documents, not writing them. You can use the underlying parser libraries like POI or PDFBox directly to produce documents in a specific format. BR, Jukka Zitting
