Sorry, that my question was not clear. Initially when indexed pdf files it showed the data within this pdf in the contents field.as follows:(this is output for initially indexed documents) <str name="contents"> Cloud ctured As tale in size as well as complexity. We need a cloud based system that will solve this problem. Provide interfaces to registeP CSS Client Measurements Benchmarkinse times by varying Number of documents fromnds to millions Nuervers from 1 to 5 Storage and search options as discussed abo </str>
But for newly indexed documents, the contents field is empty, Actually coding.pdf is of 3mb size, but as shown in the output the contents of this pdf are not extracted, indexing extracts the metadata,but not the contents of the file, the contents field is empty, <str name="contents"></str> what is the reason for this? Is is because of some jar missing? -- View this message in context: http://lucene.472066.n3.nabble.com/using-extract-handler-data-not-extracted-tp4110850p4110873.html Sent from the Solr - User mailing list archive at Nabble.com.