Hi, I posted this issue when working with dspace 4.x, but after installing the last version, the problem remains.
When running the index-discovery command, just a small subset of the documents are indexed and shown in the list of titles, authors and subjects. Checking the solr logs, there are several entries similar to this one: 2015-12-17 01:50:58,857 ERROR org.dspace.discovery.SolrServiceImpl @ Error while writing item to discovery index: 12968/7898 message:Expected mime type application/octet-stream but got text/html. ... org.dspace.solr.filters.LocalHostRestrictionFilter.doFilter(LocalHostRestrictionFilter.java:50) </pre></p><p><b>root cause</b> <pre>java.lang.NoClassDefFoundError: org/apache/poi/poifs/filesystem/NPOIFSFileSystem org.apache.tika.parser.microsoft.POIFSContainerDetector.detect(POIFSContainerDetector.java:143) org.apache.tika.detect.CompositeDetector.detect(CompositeDetector.java:61) org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:113) org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:219) org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:74) org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135) org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handleRequest(RequestHandlers.java:246) org.apache.solr.core.SolrCore.execute(SolrCore.java:1967) org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:777) org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:418) org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:207) org.dspace.solr.filters.LocalHostRestrictionFilter.doFilter(LocalHostRestrictionFilter.java:50) It seems that solr is not able of indexing some files (some of them seem to be non-searchable PDFs). But at least document metadata (title,author, abstract, etc.) should be indexed. Any idea on this issue? Thanks in advance, Ricardo -- You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/dspace-tech. For more options, visit https://groups.google.com/d/optout.
