Which collection are you trying to index? Is the localDocs or books? you can also try to run thru steps @ exercise 1 at above link to post data to techproducts and in general it should work end to end
solr-7.7.0:$ bin/post -c techproducts example/exampledocs/* Which documents do you On Mon, Sep 23, 2019 at 3:31 PM Pasle Choix <pasle.ch...@welynx.ca> wrote: > Thank you Susheel, > > Here is what I see > from /opt/solr-7.7.2/server/solr/configsets/_default/conf/solrconfig.xml: > > <requestHandler name="/update/extract" > startup="lazy" > class="solr.extraction.ExtractingRequestHandler" > > <lst name="defaults"> > <str name="lowernames">true</str> > <str name="fmap.meta">ignored_</str> > <str name="fmap.content">_text_</str> > </lst> > </requestHandler> > > Is there anything wrong with it and how to fix it? > > Thank you. > > Pasle Choix > > > > On Mon, Sep 23, 2019 at 2:09 PM Susheel Kumar <susheel2...@gmail.com> > wrote: > >> Not sure which configuration you are using but double check >> solrconfig.xml to have entries like below and have below sr_mv_txt below in >> schema.xml for storing and indexing. >> >> <requestHandler name="/update/extract" >> startup="lazy" >> class="solr.extraction.ExtractingRequestHandler" > >> <lst name="defaults"> >> <str name="lowernames">true</str> >> <str name="fmap.meta">ignored_</str> >> <str name="fmap.content">sr_mv_txt</str> >> </lst> >> </requestHandler> >> >> >> Thnx >> >> >> On Thu, Sep 19, 2019 at 11:02 PM PasLe Choix <paslecho...@gmail.com> >> wrote: >> >>> I am on Solr 7.7, according to the official document: >>> https://lucene.apache.org/solr/guide/7_7/solr-tutorial.html >>> Although it is mentioned Post Tool can index a directory of files, and >>> can >>> handle HTML, PDF, Office formats like Word, however no example working >>> command is given. >>> >>> ./bin/post -c localDocs ~/DocumentsError:<p>Problem accessing >>> /solr/books/update. Reason: >>> <pre> Not Found</pre></p> >>> >>> or if I directly upload a pdf as Document through Admin GUI, I will get >>> Unsupported ContentType: application/pdf Not in: [application/xml, >>> application/csv, application/json, text/json, text/csv, text/xml, >>> application/javabin] >>> >>> Can anyone please share the correct way to index on pdf/doc/docx, etc.? >>> through both Admin GUI and command line. >>> >>> Thank you very much. >>> >>> >>> Pasle Choix >>> >>