Jianer:

Getting your head around the configs is, indeed, "exciting" at times.

I just wanted to caution you that using ExtractingRequestHandler
puts the Tika parsing load on the Solr server, which doesn't
scale as the same machine that's serving queries and indexing
is _also_ parsing potentially very large files. It may not matter
if you don't do it often, but if you're going to index a large number
of files and/or you're going to do this continuously, you probably
want to move the parsing off Solr. Here's an example with DB
as well, but the DB bits can be removed easily.

https://lucidworks.com/blog/2012/02/14/indexing-with-solrj/

Best,
Erick

On Sun, Dec 20, 2015 at 9:29 PM, 周建二 <zhoujia...@ict.ac.cn> wrote:
> Hi Shawn, thanks for your reply. :)
>
>
> It is because the /update/extract handler is not defined in my collection's 
> solrconfig.xml file as I upload the basic_configs/conf to ZooKeeper. When I 
> upload sample_techproducts_configs to ZooKeeper, everything goes well.
>
>
> I am a freshman for Solr. Now I am going to learn the schema.xml 
> solrconfig.xml,  and try to make my own config for my dataset based on the 
> basic_configs.
>
>
> Thanks again.
> Jianer
>
>
>> -----原始邮件-----
>> 发件人: "Shawn Heisey" <apa...@elyograg.org>
>> 发送时间: 2015年12月20日 星期日
>> 收件人: solr-user@lucene.apache.org
>> 抄送:
>> 主题: Re: Some problems when upload data to index in cloud environment
>>
>> On 12/18/2015 6:16 PM, 周建二 wrote:
>> > I am building a solr cloud production environment. My solr version is 
>> > 5.3.1. The environment consists three nodes running CentOS 6.5. First I 
>> > build the zookeeper environment by the three nodes, and then run solr on 
>> > the three nodes, and at last build a collection consists of three shards 
>> > and each shard has two replicas. After that we can see that cloud 
>> > structure on the Solr Admin page.
>>
>> <snip>
>>
>> > <body><h2>HTTP ERROR 404</h2>
>> >
>> > <p>Problem accessing /solr/cloud-test/update/extract. Reason:
>>
>> One of two problems is likely:  Either there is no collection named
>> "cloud-test" on your cloud, or the /update/extract handler is not
>> defined in that collection's solrconfig.xml file.  The active version of
>> this file lives in zookeeper when you're running SolrCloud.
>>
>> If you're sure a collection with this name exists, how exactly did you
>> create it?  Was it built with one of the sample configs or with a config
>> that you built yourself?
>>
>> Of the three configsets included with the Solr dowbload,
>> data_driven_schema_configs and sample_techproducts_configs contain the
>> /update/extract handler.  The configset named basic_configs does NOT
>> contain the handler.
>>
>> Thanks,
>> Shawn
>>
>
>
>

Reply via email to