Hi,

I would like to know the best strategy/standards to follow for indexing
multiple document types thru SOLR.

In other words, let us say we have a file upload form thru which user woudl
upload the files of different types (text, html, xml, word docs,excel
sheets, pdf, jpg, gif..etc)
Once we save the files into the hard disk at server side, we need to
initiate the SOLR indexing.

What would be the best strategy to achieve this and what are the libraries
to be used for different content/document types.

So far used pdfbox to read pdf files. Please suggest for all the possible
content/document types

Best Regards,
Kranti K K Parisa

Reply via email to