Which collection are you trying to index?  Is the localDocs or books?

you can also try to run thru steps @ exercise 1 at above link to post data
to techproducts and in general it should work end to end

solr-7.7.0:$ bin/post -c techproducts example/exampledocs/*
Which documents do you

On Mon, Sep 23, 2019 at 3:31 PM Pasle Choix <pasle.ch...@welynx.ca> wrote:

> Thank you Susheel,
>
> Here is what I see
> from /opt/solr-7.7.2/server/solr/configsets/_default/conf/solrconfig.xml:
>
>   <requestHandler name="/update/extract"
>                   startup="lazy"
>                   class="solr.extraction.ExtractingRequestHandler" >
>     <lst name="defaults">
>       <str name="lowernames">true</str>
>       <str name="fmap.meta">ignored_</str>
>       <str name="fmap.content">_text_</str>
>     </lst>
>   </requestHandler>
>
> Is there anything wrong with it and how to fix it?
>
> Thank you.
>
> Pasle Choix
>
>
>
> On Mon, Sep 23, 2019 at 2:09 PM Susheel Kumar <susheel2...@gmail.com>
> wrote:
>
>> Not sure which configuration you are using but double check
>> solrconfig.xml to have entries like below and have below sr_mv_txt below in
>> schema.xml for storing and indexing.
>>
>> <requestHandler name="/update/extract"
>>                 startup="lazy"
>>                 class="solr.extraction.ExtractingRequestHandler" >
>>   <lst name="defaults">
>>     <str name="lowernames">true</str>
>>     <str name="fmap.meta">ignored_</str>
>>     <str name="fmap.content">sr_mv_txt</str>
>>   </lst>
>> </requestHandler>
>>
>>
>> Thnx
>>
>>
>> On Thu, Sep 19, 2019 at 11:02 PM PasLe Choix <paslecho...@gmail.com>
>> wrote:
>>
>>> I am on Solr 7.7, according to the official document:
>>> https://lucene.apache.org/solr/guide/7_7/solr-tutorial.html
>>> Although it is mentioned Post Tool can index a directory of files, and
>>> can
>>> handle HTML, PDF, Office formats like Word, however no example working
>>> command is given.
>>>
>>> ./bin/post -c localDocs ~/DocumentsError:<p>Problem accessing
>>> /solr/books/update. Reason:
>>> <pre>    Not Found</pre></p>
>>>
>>> or if I directly upload a pdf as Document through Admin GUI, I will get
>>> Unsupported ContentType: application/pdf Not in: [application/xml,
>>> application/csv, application/json, text/json, text/csv, text/xml,
>>> application/javabin]
>>>
>>> Can anyone please share the correct way to index on pdf/doc/docx, etc.?
>>> through both Admin GUI and command line.
>>>
>>> Thank you very much.
>>>
>>>
>>> Pasle Choix
>>>
>>

Reply via email to