Hi, thank you for you answer. I'll check the issue with PDF files.
Best regards, Matteo On 2018/02/05 10:12:10, Nick Burch <[email protected]> wrote: > On Mon, 5 Feb 2018, Matteo Alessandroni wrote: > > I'm using Apache Tika to detect a file Mime Type from its base64 > > rapresentation. Unfortunately I don't have other info about the file > > (e.g. extension). > > > > and it gives me "text/plain" for JSON and PDF files, but I would like to > > obtain a more specific information: "application/json", > > "application/pdf" etc... > > You can't detect JSON files from mime magic alone - json doesn't have > anything unique at the start, just lots of possible different things which > also occur in other formats too > > Tika can detect a PDF from the magic bytes at the start just fine. Make > sure you're actually decoding the base64 representation properly > > Nick >
