You can use Tika by yourself (recommended). See how I did it in fsriver project. You can use mapper attachment plugin which is using Tika behind the scene but gives you less control IMHO.
About versions, elasticsearch does not keep old versions around. If you need that, you have to manage it yourself. HTH -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 16 janv. 2014 à 20:42, ZenMaster80 <[email protected]> a écrit : > - Is there any literature on how to index pdf documents and binary formats > like images? > - Versioning question: If I update an already indexed document, I believe ES > will update the version number. I am wondering if it keeps the previous > document, what if I needed access to the previous document? > -- > You received this message because you are subscribed to the Google Groups > "elasticsearch" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/elasticsearch/a9e8f331-c4bd-4a4c-be5a-b91e4f2f0e26%40googlegroups.com. > For more options, visit https://groups.google.com/groups/opt_out. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/F03436DE-657A-4D2C-A8E3-83E4B4D12523%40pilato.fr. For more options, visit https://groups.google.com/groups/opt_out.
