Yes, GZIP compression will do the job for me...but having a plain folders
and files as an output is even better.

How complicate is to update/add option in tika source to output folders and
files directly without packing it into any file format?


On Thu, Aug 7, 2014 at 3:31 PM, Sergey Beryozkin <[email protected]>
wrote:

> By the way, would a default GZIP compression suit ?
> If yes we can have it done even without the extra CXF changes.
>
> Sergey
>
>
> On 07/08/14 16:15, Sergey Beryozkin wrote:
>
>> Hi
>>
>> I can try to enhance a CXF GzipOutInterceptor (at CXF level) to use a
>> compressing Deflater in GZIP compatible mode. The server will react to a
>> client accepting GZIP and compress the out payloads.
>>
>> I think it would be a good idea to have a Tika server war module
>> introduced for users easily add custom out/in filters to the JAX-RS
>> endpoint.
>>
>> I guess we can do it for 1.7
>> Sergey
>> DEFLATER
>>
>>
>>
>> On 07/08/14 15:44, Bratislav Stojanovic wrote:
>>
>>> Hi,
>>>
>>> I'm trying to get text, metadata and attachments all in one request
>>> using tika-server (JAX-RS), but
>>> the only thing I can get as an output is either uncompressed ZIP or TAR.
>>>
>>> Is there any way to :
>>>
>>> - set compression level? Having uncompressed ZIP/TAR with resources
>>> actually occupies more space than having plain __METADATA__ , __TEXT__
>>> and other files because of additional ZIP/TAR headers. If I decide to
>>> use ZIP/TAR I would like to save some hd space.
>>>
>>> - or use a simple folder instead of output file with all extracted
>>> resources inside? This is desired
>>> for me because I don't have to decompress output to reach the extracted
>>> resources
>>>
>>> Basically, I would like to specify compression or folder in this
>>> command :
>>>
>>> curl -T example.doc http://localhost:9998/all > outputFolder
>>>
>>> I haven't found any related info on
>>> http://wiki.apache.org/tika/TikaJAXRS or mailing list archives, so
>>> please help :)
>>>
>>> --
>>> Bratislav Stojanovic, M.Sc.
>>>
>>
>>


-- 
Bratislav Stojanovic, M.Sc.

Reply via email to