Yes, GZIP compression will do the job for me...but having a plain folders and files as an output is even better.
How complicate is to update/add option in tika source to output folders and files directly without packing it into any file format? On Thu, Aug 7, 2014 at 3:31 PM, Sergey Beryozkin <[email protected]> wrote: > By the way, would a default GZIP compression suit ? > If yes we can have it done even without the extra CXF changes. > > Sergey > > > On 07/08/14 16:15, Sergey Beryozkin wrote: > >> Hi >> >> I can try to enhance a CXF GzipOutInterceptor (at CXF level) to use a >> compressing Deflater in GZIP compatible mode. The server will react to a >> client accepting GZIP and compress the out payloads. >> >> I think it would be a good idea to have a Tika server war module >> introduced for users easily add custom out/in filters to the JAX-RS >> endpoint. >> >> I guess we can do it for 1.7 >> Sergey >> DEFLATER >> >> >> >> On 07/08/14 15:44, Bratislav Stojanovic wrote: >> >>> Hi, >>> >>> I'm trying to get text, metadata and attachments all in one request >>> using tika-server (JAX-RS), but >>> the only thing I can get as an output is either uncompressed ZIP or TAR. >>> >>> Is there any way to : >>> >>> - set compression level? Having uncompressed ZIP/TAR with resources >>> actually occupies more space than having plain __METADATA__ , __TEXT__ >>> and other files because of additional ZIP/TAR headers. If I decide to >>> use ZIP/TAR I would like to save some hd space. >>> >>> - or use a simple folder instead of output file with all extracted >>> resources inside? This is desired >>> for me because I don't have to decompress output to reach the extracted >>> resources >>> >>> Basically, I would like to specify compression or folder in this >>> command : >>> >>> curl -T example.doc http://localhost:9998/all > outputFolder >>> >>> I haven't found any related info on >>> http://wiki.apache.org/tika/TikaJAXRS or mailing list archives, so >>> please help :) >>> >>> -- >>> Bratislav Stojanovic, M.Sc. >>> >> >> -- Bratislav Stojanovic, M.Sc.
