After testing a little bit the unrar flags I have some conclusions: *You can’t extract individual file(not even with the kb flag). *The only way you can extract each part is if they have all the other parts in the same directory.
So my question now if there is a way to upload bunch of files(represents all the parts of one big file)to Tika and during the extraction process they will be located at the same directory so the extraction would be successful? On Tue, 2 May 2023 at 22:32 Tim Allison <[email protected]> wrote: > K. Let's discuss and plot and plan: > https://issues.apache.org/jira/browse/TIKA-4026 > > On Tue, May 2, 2023 at 3:18 PM שי ברק <[email protected]> wrote: > > > > I found the solution for it on the web but > > haven’t tried it yet(probably will run it tomorrow). > > In my opinion you first should try to unrar regularly(with no flags) and > if you get that kind of exception then you add the -kb flag and try to > unrar again . > > Because honestly I don’t know how it behaves on regular rar files when > you add that flag :) > > > > > > On Tue, 2 May 2023 at 22:07 Tim Allison <[email protected]> wrote: > >> > >> fix -> "on regular rar files". > >> > >> On Tue, May 2, 2023 at 2:59 PM Tim Allison <[email protected]> wrote: > >> > > >> > If adding the -kb flag solves your problem, we can add it > >> > (optionally?). Do we want to add it always or does it need to be > >> > configurable on a per-file parse? If we add it generally, will it > >> > change behavior on regular unrar files? > >> > > >> > On Tue, May 2, 2023 at 2:46 PM שי ברק <[email protected]> wrote: > >> > > > >> > > I’m not sure I’m able to do so because the way I use the Tika > server in my code. > >> > > But I do know that when you use the unrar command, you can specify > the -kb flag for those kinds of files, is there anything you can do about > it so I can pass that param in my Tika-config or something like that? > >> > > > >> > > On Tue, 2 May 2023 at 21:03 Tim Allison <[email protected]> > wrote: > >> > >> > >> > >> Unfortunately, Tika doesn't have a good way to handle multifile > inputs > >> > >> for one logical file. :( Can you preprocess to concatenate or > >> > >> similar? That may be a silly question...I'm not that familiar with > >> > >> the rar format. > >> > >> > >> > >> On Tue, May 2, 2023 at 1:58 PM שי ברק <[email protected]> > wrote: > >> > >> > > >> > >> > Hey, > >> > >> > > >> > >> > I’m trying to extract splitting RAR file. > >> > >> > I have 15 portions of the one big file. > >> > >> > For each file I get the following exception: > >> > >> > “You need to start extracting from a previous volume to unpack”. > >> > >> > > >> > >> > I know that “unrar” enables to extract these kinds of files but > I’m wondering how Tika handles this and if there’s something to do about it? > >> > >> > >
