Hi, I would like to know the best strategy/standards to follow for indexing multiple document types thru SOLR.
In other words, let us say we have a file upload form thru which user woudl upload the files of different types (text, html, xml, word docs,excel sheets, pdf, jpg, gif..etc) Once we save the files into the hard disk at server side, we need to initiate the SOLR indexing. What would be the best strategy to achieve this and what are the libraries to be used for different content/document types. So far used pdfbox to read pdf files. Please suggest for all the possible content/document types Best Regards, Kranti K K Parisa