Robert Burrell Donkin wrote:
> Jukka Zitting wrote:
>> Hi,
>>
>> On Sun, May 17, 2009 at 9:03 AM, Robert Burrell Donkin
>> <[email protected]> wrote:
>>> IMHO it makes sense to factor out an interface, retain the existing
>>> implementation and create a separate module. this will allow assembler
>>> who don't want to use tika to create applications that don't use it.
>> How about using the org.apache.tika.detect.Detector interface (see below)?
>>
>> Tika comes with default implementations of the interface, but it
>> should be straightforward to implement the interface also based on
>> alternative implementations.
> 
> i see this as a stepping stone. tika already supports most of the
> heuristics rat uses so IMHO it would make more sense to feed back rules
> upstream (either into the default typer, or a variant tuned for
> development).
> 
> a couple of issues that suggest that this might be better than jumping
> to tika right away:
> 
> 1. in terms of interface reuse ATM tika trunk doesn't offer a minimal api[1]
> 2. the latest release (tika 0.2) is not modular

correction 0.3

- robert

Reply via email to