OK I think I have these issues fixed... However: I was unable to get Maven to respect UTF-8 encoding of the *.apt sources: no matter what I tried (the various encoding configuration options in pom.xml) it would always replace characters with U+FFFD. So I did the workaround instead: it turns out you can specify \UXXXX in *.apt.
However: I don't have the necessary karma to commit to http://svn.apache.org/repos/asf/tika/site. Dave can you please fix that? Thanks! Mike McCandless http://blog.mikemccandless.com On Wed, Sep 5, 2012 at 9:10 AM, Michael McCandless <[email protected]> wrote: > Thanks Benson, I'll fix. > > Looks like a number of other links are also not working ... when we do > {{{api/org/apache/tika/...}}} we apparently must make that > {{{./api/org/apache/tika/...}}} instead (ie add the ./ prefix). Maven > prints a warning when it's wrong ... > > I'm also trying to fix the broken UTF-8 encoding, eg see Jan Høydahl > in http://tika.apache.org/1.2/index.html: the ø is replaced with the > unicode replacement char (U+FFFD)... > > Mike McCandless > > http://blog.mikemccandless.com > > On Tue, Sep 4, 2012 at 1:55 PM, Benson Margulies <[email protected]> > wrote: >> http://svn.apache.org/repos/asf/tika/site >> >> >> >> On Tue, Sep 4, 2012 at 1:22 PM, Michael McCandless >> <[email protected]> wrote: >>> We should fix that. >>> >>> How can I update the web site...? Is this documented somewhere...? >>> >>> I see that http://wiki.apache.org/tika/ReleaseProcess refers to >>> src/site/src/documentation/content/xdocs/index.xml but that doesn't >>> exist in svn (I see a bunch of .apt sources that seem to correspond to >>> what's live on the site). Can someone provide some pointers...? >>> >>> Mike McCandless >>> >>> http://blog.mikemccandless.com >>> >>> >>> On Sun, Sep 2, 2012 at 7:53 AM, Benson Margulies <[email protected]> >>> wrote: >>>> On this page [1] >>>> >>>> >>>> the first link under 'Getting Started' points back to the page it is >>>> on, rather than to a page that actually documents how to use tika. >>>> >>>> >>>> >>>> http://tika.apache.org/1.2/parser_guide.html#gettingstarted.html
