I'am looking to index ms word and pdf using uploading data with solr cell using apache tika; I just hope use tika to detect corrupt files before indexing and get a list of corrupted file. if its possible. I try runing java -jar tika-app.jar <input_dir> <output_dir> I get in the output_dir all the files of <input_dir> in format xml and all the corrupt file with size 0ko (empty)
- detect corrupt file and build a list of them before in... kostali hassan
- RE: detect corrupt file and build a list of them ... Allison, Timothy B.
- RE: detect corrupt file and build a list of t... Allison, Timothy B.
- RE: detect corrupt file and build a list ... Allison, Timothy B.
- Re: detect corrupt file and build a l... kostali hassan
- RE: detect corrupt file and buil... Allison, Timothy B.
- Re: detect corrupt file and ... kostali hassan
- Re: detect corrupt file ... kostali hassan
- RE: detect corrupt file ... Allison, Timothy B.
- Re: detect corrupt file ... kostali hassan
- RE: detect corrupt file ... Allison, Timothy B.
