Am 26.10.2010 19:03, schrieb Ken Krugler: > > On Oct 21, 2010, at 12:28pm, Jukka Zitting wrote: > >> Hi, >> >> We're planning to release Jackrabbit 2.2 at the end of November, and >> it would be great to have Tika 0.8 out by then for use as a >> dependency. Ideally I'd like to see 0.8 out within the next few weeks. >> Chris, are you in for another release? I can also cut the release if >> you're busy. >> >> I guess the only big blocker we have are the references to custom >> Maven repositories. If we can't get the dependencies to Maven Central >> soon, I propose that we push the related code to a separate sandbox >> component that we only distribute as source for now. >> >> Beyond that I'd like to come up with some mechanism by which the cool >> container-aware detectors that Nick added could be better integrated >> with our default detectors. And I should have a new PDFBox release out >> by the end of this week for use in Tika 0.8. >> >> Anything else we should look for in the release? > > Nothing else that I can think of. > > I just committed a fix for > https://issues.apache.org/jira/browse/TIKA-394, which was bugging me. > > -- Ken >
https://issues.apache.org/jira/browse/TIKA-539 i have reported this some time ago on the tika user ml. my use case is to retrieve a page by httpclient and use the information in the http header to guide the metadata extraction. regards reinhard
