Re: WELCOME to solr-user@lucene.apache.org

2019-10-20 Thread Erick Erickson
In short, nothing that’s maintained as part of the Apache project. There may be 
commercial products, but I haven’t had occasion to look for one.

Best,
Erick

> On Oct 20, 2019, at 7:42 AM, Wasim S Kazi  wrote:
> 
> Good day
> 
> I would like to get some info or confirmation about configuring Solr 8+ to 
> get content from WCM (Websphere Content Management)
> 
> Essentially, we have manually index data from WCM into Solr and this all 
> works fine. We want to now automate this process, so checking is there is any 
> well established integration method between WCM and Solr. This integration 
> should allow content being indexed automatically, or periodically without 
> human intervention.
> 
> Regards
> Wasim Kazi
> 
> -Original Message-
> From: solr-user-h...@lucene.apache.org 
> Sent: Sunday, October 20, 2019 2:39 PM
> To: Wasim S Kazi 
> Subject: WELCOME to solr-user@lucene.apache.org
> 
> Hi! This is the ezmlm program. I'm managing the solr-user@lucene.apache.org 
> mailing list.
> 
> I'm working for my owner, who can be reached at 
> solr-user-ow...@lucene.apache.org.
> 
> Acknowledgment: I have added the address
> 
>   wasim.s.k...@za.ey.com
> 
> to the solr-user mailing list.
> 
> Welcome to solr-user@lucene.apache.org!
> 
> Please save this message so that you know the address you are subscribed 
> under, in case you later want to unsubscribe or change your subscription 
> address.
> 
> 
> --- Administrative commands for the solr-user list ---
> 
> I can handle administrative requests automatically. Please do not send them 
> to the list address! Instead, send your message to the correct command 
> address:
> 
> To subscribe to the list, send a message to:
>   
> 
> To remove your address from the list, send a message to:
>   
> 
> Send mail to the following for info and FAQ for this list:
>   
>   
> 
> Similar addresses exist for the digest list:
>   
>   
> 
> To get messages 123 through 145 (a maximum of 100 per request), mail:
>   
> 
> To get an index with subject and author for messages 123-456 , mail:
>   
> 
> They are always returned as sets of 100, max 2000 per request, so you'll 
> actually get 100-499.
> 
> To receive all messages with the same subject as message 12345, send a short 
> message to:
>   
> 
> The messages should contain one line or word of text to avoid being treated 
> as sp@m, but I will ignore their content.
> Only the ADDRESS you send to is important.
> 
> You can start a subscription for an alternate address, for example 
> "john@host.domain", just add a hyphen and your address (with '=' instead of 
> '@') after the command word:
> 
> 
> To stop subscription for this address, mail:
> 
> 
> In both cases, I'll send a confirmation message to that address. When you 
> receive it, simply reply to it to complete your subscription.
> 
> If despite following these instructions, you do not get the desired results, 
> please contact my owner at solr-user-ow...@lucene.apache.org. Please be 
> patient, my owner is a lot slower than I am ;-)
> 
> --- Enclosed is a copy of the request I received.
> 
> Return-Path: 
> Received: (qmail 96582 invoked by uid 99); 20 Oct 2019 11:38:52 -
> Received: from pnap-us-west-generic-nat.apache.org (HELO 
> spamd1-us-west.apache.org) (209.188.14.142)
>by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Oct 2019 11:38:52 +
> Received: from localhost (localhost [127.0.0.1])
>by spamd1-us-west.apache.org (ASF Mail Server at 
> spamd1-us-west.apache.org) with ESMTP id 81232C0C8E
>for 
> ;
>  Sun, 20 Oct 2019 11:38:51 + (UTC)
> X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org
> X-Spam-Flag: NO
> X-Spam-Score: -4.8
> X-Spam-Level:
> X-Spam-Status: No, score=-4.8 tagged_above=-999 required=6.31
>tests=[HTML_FONT_LOW_CONTRAST=0.001, HTML_MESSAGE=0.2,
>KAM_SHORT=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001,
>SPF_PASS=-0.001] autolearn=disabled
> Received: from mx1-he-de.apache.org ([10.40.0.8])
>by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, 
> port 10024)
>with ESMTP id Kbk25gxC2elm
>for 
> ;
>Sun, 20 Oct 2019 11:38:50 + (UTC)
> Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=199.49.1.52; 
> helo=em01.ey.com; envelope-from=wasim.s.k...@za.ey.com; receiver=
> Received: from em01.ey.com (em01.ey.com [199.49.1.52])
>by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with 
> ESMTPS id 86E307DDFA
>for 
> ;
>  Sun, 20 Oct 2019 11:38:49 + (UTC)
> IronPort-SDR: 
> 0i+SrmLgncBfCsgon

RE: WELCOME to solr-user@lucene.apache.org

2019-10-20 Thread Wasim S Kazi
Good day

I would like to get some info or confirmation about configuring Solr 8+ to get 
content from WCM (Websphere Content Management)

Essentially, we have manually index data from WCM into Solr and this all works 
fine. We want to now automate this process, so checking is there is any well 
established integration method between WCM and Solr. This integration should 
allow content being indexed automatically, or periodically without human 
intervention.

Regards
Wasim Kazi

-Original Message-
From: solr-user-h...@lucene.apache.org 
Sent: Sunday, October 20, 2019 2:39 PM
To: Wasim S Kazi 
Subject: WELCOME to solr-user@lucene.apache.org

Hi! This is the ezmlm program. I'm managing the solr-user@lucene.apache.org 
mailing list.

I'm working for my owner, who can be reached at 
solr-user-ow...@lucene.apache.org.

Acknowledgment: I have added the address

   wasim.s.k...@za.ey.com

to the solr-user mailing list.

Welcome to solr-user@lucene.apache.org!

Please save this message so that you know the address you are subscribed under, 
in case you later want to unsubscribe or change your subscription address.


--- Administrative commands for the solr-user list ---

I can handle administrative requests automatically. Please do not send them to 
the list address! Instead, send your message to the correct command address:

To subscribe to the list, send a message to:
   

To remove your address from the list, send a message to:
   

Send mail to the following for info and FAQ for this list:
   
   

Similar addresses exist for the digest list:
   
   

To get messages 123 through 145 (a maximum of 100 per request), mail:
   

To get an index with subject and author for messages 123-456 , mail:
   

They are always returned as sets of 100, max 2000 per request, so you'll 
actually get 100-499.

To receive all messages with the same subject as message 12345, send a short 
message to:
   

The messages should contain one line or word of text to avoid being treated as 
sp@m, but I will ignore their content.
Only the ADDRESS you send to is important.

You can start a subscription for an alternate address, for example 
"john@host.domain", just add a hyphen and your address (with '=' instead of 
'@') after the command word:


To stop subscription for this address, mail:


In both cases, I'll send a confirmation message to that address. When you 
receive it, simply reply to it to complete your subscription.

If despite following these instructions, you do not get the desired results, 
please contact my owner at solr-user-ow...@lucene.apache.org. Please be 
patient, my owner is a lot slower than I am ;-)

--- Enclosed is a copy of the request I received.

Return-Path: 
Received: (qmail 96582 invoked by uid 99); 20 Oct 2019 11:38:52 -
Received: from pnap-us-west-generic-nat.apache.org (HELO 
spamd1-us-west.apache.org) (209.188.14.142)
by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Oct 2019 11:38:52 +
Received: from localhost (localhost [127.0.0.1])
by spamd1-us-west.apache.org (ASF Mail Server at 
spamd1-us-west.apache.org) with ESMTP id 81232C0C8E
for 
;
 Sun, 20 Oct 2019 11:38:51 + (UTC)
X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org
X-Spam-Flag: NO
X-Spam-Score: -4.8
X-Spam-Level:
X-Spam-Status: No, score=-4.8 tagged_above=-999 required=6.31
tests=[HTML_FONT_LOW_CONTRAST=0.001, HTML_MESSAGE=0.2,
KAM_SHORT=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001,
SPF_PASS=-0.001] autolearn=disabled
Received: from mx1-he-de.apache.org ([10.40.0.8])
by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 
10024)
with ESMTP id Kbk25gxC2elm
for 
;
Sun, 20 Oct 2019 11:38:50 + (UTC)
Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=199.49.1.52; 
helo=em01.ey.com; envelope-from=wasim.s.k...@za.ey.com; receiver=
Received: from em01.ey.com (em01.ey.com [199.49.1.52])
by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with 
ESMTPS id 86E307DDFA
for 
;
 Sun, 20 Oct 2019 11:38:49 + (UTC)
IronPort-SDR: 
0i+SrmLgncBfCsgonKDgt+Ll+5TCuN/hbDHsUS1V98D3LWk4dgqQE9qJPrbcZyYjLWRYXieztn
 Fjky8vaAREXw==
X-IronPort-AV: E=Sophos;i="5.67,319,1566864000";
   d="gif'147?scan'147,208,217,147";a="240843155"
Received: from unknown (HELO DERUSRMPEXTP02.ey.net) ([10.151.33.58])
  by defrakaeyip01.eurw.ey.net with ESMTP; 20 Oct 2019 11:38:42 +
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none;  
b=Em+4qSC0AqZ4Ei+nYLvNi3BwVnwrjtXdFD2W5lnj3CNDBO0x9JJBOn5yWMUj4JNnCnhg4R524D5O+lX6dYrYut/tTe09g0pnRemmla9J7icpboVqK6i5gXJLHLFA9dERNQwRDieNKqKEkei0eIbCzLMJeVld1lvj7CJiXIZPZIySU5hHZI7N5+Q9i1eb4GRYxATio7ibfxNknvf3/2298wyUhY9EuQEEuTWNrylkhMtQORgdlgv+mEdpzGJO+FaiG0fv1MQ0TO8JcgybSjJ14hG7xYlhkGEO39qzV7Q9EDbsPwJuupwZg/r4XAIIZ0Bjc0f7YX11S2BhnV8mdm+T+A==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d

Re: WELCOME to solr-user@lucene.apache.org

2018-06-25 Thread Erick Erickson
First, understand that this list is maintained by volunteers, so
answers aren't guaranteed.

If you require dedicated support there are various organizations that
provide same, but
you'll have to contact them.

That said, the community is quite responsive, just post questions to
solr-user like this
one.

Best,
Erick

On Sun, Jun 24, 2018 at 11:35 PM, Srinivas Muppu (US)
 wrote:
> Hi Solr Team,
>
> We are facing Solr System Configuration issues which needs help. Please let
> us know whom to post our Questions/Queries.
>
> Thanks,
> Srinivas
>
> On Mon, Jun 25, 2018 at 2:22 AM,  wrote:
>
>> Hi! This is the ezmlm program. I'm managing the
>> solr-user@lucene.apache.org mailing list.
>>
>> I'm working for my owner, who can be reached
>> at solr-user-ow...@lucene.apache.org.
>>
>> Acknowledgment: I have added the address
>>
>>    srinivas.mu...@pwc.com
>>
>> to the solr-user mailing list.
>>
>> Welcome to solr-user@lucene.apache.org!
>>
>> Please save this message so that you know the address you are
>> subscribed under, in case you later want to unsubscribe or change your
>> subscription address.
>>
>>
>> --- Administrative commands for the solr-user list ---
>>
>> I can handle administrative requests automatically. Please
>> do not send them to the list address! Instead, send
>> your message to the correct command address:
>>
>> To subscribe to the list, send a message to:
>>
>>
>> To remove your address from the list, send a message to:
>>
>>
>> Send mail to the following for info and FAQ for this list:
>>
>>
>>
>> Similar addresses exist for the digest list:
>>
>>
>>
>> To get messages 123 through 145 (a maximum of 100 per request), mail:
>>
>>
>> To get an index with subject and author for messages 123-456 , mail:
>>
>>
>> They are always returned as sets of 100, max 2000 per request,
>> so you'll actually get 100-499.
>>
>> To receive all messages with the same subject as message 12345,
>> send a short message to:
>>
>>
>> The messages should contain one line or word of text to avoid being
>> treated as sp@m, but I will ignore their content.
>> Only the ADDRESS you send to is important.
>>
>> You can start a subscription for an alternate address,
>> for example "john@host.domain", just add a hyphen and your
>> address (with '=' instead of '@') after the command word:
>> 
>>
>> To stop subscription for this address, mail:
>> 
>>
>> In both cases, I'll send a confirmation message to that address. When
>> you receive it, simply reply to it to complete your subscription.
>>
>> If despite following these instructions, you do not get the
>> desired results, please contact my owner at
>> solr-user-ow...@lucene.apache.org. Please be patient, my owner is a
>> lot slower than I am ;-)
>>
>> --- Enclosed is a copy of the request I received.
>>
>> Return-Path: 
>> Received: (qmail 84164 invoked by uid 99); 25 Jun 2018 06:22:12 -
>> Received: from pnap-us-west-generic-nat.apache.org (HELO
>> spamd1-us-west.apache.org) (209.188.14.142)
>> by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jun 2018 06:22:12
>> +
>> Received: from localhost (localhost [127.0.0.1])
>> by spamd1-us-west.apache.org (ASF Mail Server at
>> spamd1-us-west.apache.org) with ESMTP id 63CB9CA4A5
>> for > pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:12 + (UTC)
>> X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org
>> X-Spam-Flag: NO
>> X-Spam-Score: -1
>> X-Spam-Level:
>> X-Spam-Status: No, score=-1 tagged_above=-999 required=6.31
>> tests=[HTML_MESSAGE=2, KAM_BADIPHTTP=2, KAM_SHORT=0.001,
>> NORMAL_HTTP_TO_IP=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001,
>> SPF_PASS=-0.001] autolearn=disabled
>> Received: from mx1-lw-us.apache.org ([10.40.0.8])
>> by localhost (spamd1-us-west.apache.org [10.40.0.7])
>> (amavisd-new, port 10024)
>> with ESMTP id NuBVNjDIIyqW
>> for > pwc@lucene.apache.org>;
>> Mon, 25 Jun 2018 06:22:10 + (UTC)
>> Received: from lxsmpr20.pwc.com (lxsmpr20.pwc.com [155.201.248.112])
>> by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org)
>> with ESMTPS id 500895F1B4
>> for > pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:10 + (UTC)
>> Received: from mail-vk0-f71.google.com (m

Re: WELCOME to solr-user@lucene.apache.org

2018-06-25 Thread Srinivas Muppu (US)
Hi Solr Team,

We are facing Solr System Configuration issues which needs help. Please let
us know whom to post our Questions/Queries.

Thanks,
Srinivas

On Mon, Jun 25, 2018 at 2:22 AM,  wrote:

> Hi! This is the ezmlm program. I'm managing the
> solr-user@lucene.apache.org mailing list.
>
> I'm working for my owner, who can be reached
> at solr-user-ow...@lucene.apache.org.
>
> Acknowledgment: I have added the address
>
>srinivas.mu...@pwc.com
>
> to the solr-user mailing list.
>
> Welcome to solr-user@lucene.apache.org!
>
> Please save this message so that you know the address you are
> subscribed under, in case you later want to unsubscribe or change your
> subscription address.
>
>
> --- Administrative commands for the solr-user list ---
>
> I can handle administrative requests automatically. Please
> do not send them to the list address! Instead, send
> your message to the correct command address:
>
> To subscribe to the list, send a message to:
>
>
> To remove your address from the list, send a message to:
>
>
> Send mail to the following for info and FAQ for this list:
>
>
>
> Similar addresses exist for the digest list:
>
>
>
> To get messages 123 through 145 (a maximum of 100 per request), mail:
>
>
> To get an index with subject and author for messages 123-456 , mail:
>
>
> They are always returned as sets of 100, max 2000 per request,
> so you'll actually get 100-499.
>
> To receive all messages with the same subject as message 12345,
> send a short message to:
>
>
> The messages should contain one line or word of text to avoid being
> treated as sp@m, but I will ignore their content.
> Only the ADDRESS you send to is important.
>
> You can start a subscription for an alternate address,
> for example "john@host.domain", just add a hyphen and your
> address (with '=' instead of '@') after the command word:
> 
>
> To stop subscription for this address, mail:
> 
>
> In both cases, I'll send a confirmation message to that address. When
> you receive it, simply reply to it to complete your subscription.
>
> If despite following these instructions, you do not get the
> desired results, please contact my owner at
> solr-user-ow...@lucene.apache.org. Please be patient, my owner is a
> lot slower than I am ;-)
>
> --- Enclosed is a copy of the request I received.
>
> Return-Path: 
> Received: (qmail 84164 invoked by uid 99); 25 Jun 2018 06:22:12 -
> Received: from pnap-us-west-generic-nat.apache.org (HELO
> spamd1-us-west.apache.org) (209.188.14.142)
> by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jun 2018 06:22:12
> +
> Received: from localhost (localhost [127.0.0.1])
> by spamd1-us-west.apache.org (ASF Mail Server at
> spamd1-us-west.apache.org) with ESMTP id 63CB9CA4A5
> for  pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:12 + (UTC)
> X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org
> X-Spam-Flag: NO
> X-Spam-Score: -1
> X-Spam-Level:
> X-Spam-Status: No, score=-1 tagged_above=-999 required=6.31
> tests=[HTML_MESSAGE=2, KAM_BADIPHTTP=2, KAM_SHORT=0.001,
> NORMAL_HTTP_TO_IP=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001,
> SPF_PASS=-0.001] autolearn=disabled
> Received: from mx1-lw-us.apache.org ([10.40.0.8])
> by localhost (spamd1-us-west.apache.org [10.40.0.7])
> (amavisd-new, port 10024)
> with ESMTP id NuBVNjDIIyqW
> for  pwc@lucene.apache.org>;
> Mon, 25 Jun 2018 06:22:10 + (UTC)
> Received: from lxsmpr20.pwc.com (lxsmpr20.pwc.com [155.201.248.112])
> by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org)
> with ESMTPS id 500895F1B4
> for  pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:10 + (UTC)
> Received: from mail-vk0-f71.google.com (mail-vk0-f71.google.com
> [209.85.213.71])
> by lxsmpr20.nam.pwcinternal.com (8.16.0.21/8.16.0.21) with ESMTPS
> id w5P6M3MF054491
> (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128
> verify=OK)
> for  pwc@lucene.apache.org>; Mon, 25 Jun 2018 02:22:03 -0400
> Received: by mail-vk0-f71.google.com with SMTP id j123-v6so5886670vkc.4
> for  pwc@lucene.apache.org>; Sun, 24 Jun 2018 23:22:03 -0700 (PDT)
> X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
> d=1e100.net; s=20161025;
> h=x-gm-message-state:mime-version:in-reply-to:references:from:date
>  :message-id:subject:to;
> bh=+MKXiCktrcuycddIpUqd9ljQ2oLqYBsgU3qPgb6oZ2M=;
> b=q4Vku4HdqSxx2NyQ1G2GtPG7ahk5icEeT8jaTkyyVNW+
> yq9o1oxQoQnsDV

Re: WELCOME to solr-user@lucene.apache.org

2011-05-24 Thread Lord Khan Han
Hi ,

   Can I limit the terms that the HighlightComponent uses. My query is
generally long and I want specific ones to be highlighted and the rest is
not highlighted. Is there an option like the SpellCheckComponent. it uses q
unless spellcheck.q if specified. Is  a hl.q parameter possible?


Or any other tricky way to workaround ..


PS: I need this tomorrow (hopefully) to show my boss insisting some other
stupid well known  commercial search engines..


Regards


Re: WELCOME to solr-user@lucene.apache.org

2010-11-12 Thread Solr User
Ahmet,

Thanks for the reply.

select/?q=built+to+lastdefType=dismaxqf=searchFields^0.2+title^20debugQuery=on

For some reason if I use title field in my query I don't get any results.

I am copying all searchable fields into searchFields field. So I am able to
search only in the searchFields field not in any other fields.

I request you all to clarify if anything wrong with my schema.xml. The
schema.xml is at the bottom of this email.

I am not able to get the boosting working on the title field. Please help me
here too.

Thanks,
Solr User

On Thu, Nov 11, 2010 at 5:11 PM, Ahmet Arslan iori...@yahoo.com wrote:

 There are several mistakes in your approach:

 copyField just copies data. Index time boost is not copied.

 There is no such boosting syntax. /select?q=Eachtitle^9fl=score

 You are searching on your default field.

 This is not your cause of your problem but omitNorms=true disables index
 time boosts.

 http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need.


 --- On Thu, 11/11/10, Solr User solr...@gmail.com wrote:

  From: Solr User solr...@gmail.com
  Subject: Re: WELCOME to solr-user@lucene.apache.org
  To: solr-user@lucene.apache.org
  Date: Thursday, November 11, 2010, 11:54 PM
  Eric,
 
  Thank you so much for the reply and apologize for not
  providing all the
  details.
 
  The following are the field definitons in my schema.xml:
 
  field name=title type=string indexed=true
  stored=true
  omitNorms=false /
 
  field name=author type=string indexed=true
  stored=true
  multiValued=true omitNorms=true /
 
  field name=authortype type=string indexed=true
  stored=true
  multiValued=true omitNorms=true /
 
  field name=isbn13 type=string indexed=true
  stored=true /
 
  field name=isbn10 type=string indexed=true
  stored=true /
 
  field name=material type=string indexed=true
  stored=true /
 
  field name=pubdate type=string indexed=true
  stored=true /
 
  field name=pubyear type=string indexed=true
  stored=true /
 
  field name=reldate type=string indexed=false
  stored=true /
 
  field name=format type=string indexed=true
  stored=true /
 
  field name=pages type=string indexed=false
  stored=true /
 
  field name=desc type=string indexed=true
  stored=true /
 
  field name=series type=string indexed=true
  stored=true /
 
  field name=season type=string indexed=true
  stored=true /
 
  field name=imprint type=string indexed=true
  stored=true /
 
  field name=bisacsub type=string indexed=true
  stored=true
  multiValued=true omitNorms=true /
 
  field name=bisacstatus type=string indexed=false
  stored=true /
 
  field name=category type=string indexed=true
  stored=true
  multiValued=true omitNorms=true /
 
  field name=award type=string indexed=true
  stored=true
  multiValued=true omitNorms=true /
 
  field name=age type=string indexed=true
  stored=true /
 
  field name=reading type=string indexed=true
  stored=true /
 
  field name=grade type=string indexed=true
  stored=true /
 
  field name=path type=string indexed=false
  stored=true /
 
  field name=shortdesc type=string indexed=true
  stored=true /
 
  field name=subtitle type=string indexed=true
  stored=true
  omitNorms=true/
 
  field name=price type=float indexed=true
  stored=true/
 
  field name=searchFields type=textSpell
  indexed=true stored=true
  multiValued=true omitNorms=true/
 
  Copy Fields:
 
  copyField source=title dest=searchFields/
 
  copyField source=author dest=searchFields/
 
  copyField source=isbn13 dest=searchFields/
 
  copyField source=isbn10 dest=searchFields/
 
  copyField source=format dest=searchFields/
 
  copyField source=series dest=searchFields/
 
  copyField source=season dest=searchFields/
 
  copyField source=imprint dest=searchFields/
 
  copyField source=bisacsub dest=searchFields/
 
  copyField source=category dest=searchFields/
 
  copyField source=award dest=searchFields/
 
  copyField source=shortdesc dest=searchFields/
 
  copyField source=desc dest=searchFields/
 
  copyField source=subtitle dest=searchFields/
 
 
 
  defaultSearchFieldsearchFields/defaultSearchField
 
 
 
  Before creating the indexes I feed XML file to the Solr job
  to create index
  files. I added Boost attribute to the title field before
  creating indexes
  and an example is below:
 
  ?xml version=1.0 encoding=UTF-8
  standalone=no?adddocfield
  name=material1785440/fieldfield
  boost=10.0 name=titleEach Little
  Bird That Sings/fieldfield
  name=price16.0/fieldfield
  name=isbn100152051139/fieldfield
  name=isbn139780152051136/fieldfield
  name=formatHardcover/fieldfield
  name=pubdate2005-03-01/fieldfield
  name=pubyear2005/fieldfield
  name=reldate2005-02-22/fieldfield
  name=pages272/fieldfield
  name=bisacstatusActive/fieldfield
  name=seasonSpring
  2005/fieldfield
  name=imprintChildren's/fieldfield
  name=age8.0-12.0/fieldfield
  name=grade3-6/fieldfield
  name=authorMarla Frazee/fieldfield
  name=authortypeJacket
  Illustrator/fieldfield name=authorDeborah
  Wiles/fieldfield

Re: WELCOME to solr-user@lucene.apache.org

2010-11-12 Thread Ahmet Arslan
 select/?q=built+to+lastdefType=dismaxqf=searchFields^0.2+title^20debugQuery=on
 
 For some reason if I use title field in my query I don't
 get any results.
 
 I am copying all searchable fields into searchFields field.
 So I am able to
 search only in the searchFields field not in any other
 fields.
 
 I request you all to clarify if anything wrong with my
 schema.xml. The
 schema.xml is at the bottom of this email.
 
 I am not able to get the boosting working on the title
 field. Please help me
 here too.

Change type of your title field. It is string now. Make it solr.TextField. 
Actually you dont need cath-all copy field with dismax. 
Just change their types string to text and append them qf= parameter.


  


Re: WELCOME to solr-user@lucene.apache.org

2010-11-12 Thread Solr User
Ahmet,

In production system we are using

/spell/?q=built+to+last

so that we can check the spelling. We are not using /select?q=built+to+last

Can I use dismax with /spell?

I understood from your reply that I need to change my schema.xml and modify
the field types.

Do I need to still use the searchFields field and what do I need to specify
in the defaultSearchField tag?

searchFields is one of the field names that we provided.

Thanks,
Solr User


On Fri, Nov 12, 2010 at 10:26 AM, Ahmet Arslan iori...@yahoo.com wrote:

 
 select/?q=built+to+lastdefType=dismaxqf=searchFields^0.2+title^20debugQuery=on
 
  For some reason if I use title field in my query I don't
  get any results.
 
  I am copying all searchable fields into searchFields field.
  So I am able to
  search only in the searchFields field not in any other
  fields.
 
  I request you all to clarify if anything wrong with my
  schema.xml. The
  schema.xml is at the bottom of this email.
 
  I am not able to get the boosting working on the title
  field. Please help me
  here too.

 Change type of your title field. It is string now. Make it solr.TextField.
 Actually you dont need cath-all copy field with dismax.
 Just change their types string to text and append them qf= parameter.






Re: WELCOME to solr-user@lucene.apache.org

2010-11-12 Thread Ahmet Arslan
 /spell/?q=built+to+last
 
 so that we can check the spelling. We are not using
 /select?q=built+to+last
 
 Can I use dismax with /spell?

Yes you can.

 I understood from your reply that I need to change my
 schema.xml and modify
 the field types.

Correct. Make them full-text searchable. string type is not tokenized.

 Do I need to still use the searchFields field and what do I
 need to specify
 in the defaultSearchField tag?

Delete searchFields, you don't need it. Regarding defaultSearchField, it does 
not matter with dismax. Write any of your fields. For example title.
And play with other dismax parameters. In short dismax is the way to go if you 
are searching multiple fields.


  


Re: WELCOME to solr-user@lucene.apache.org

2010-11-11 Thread Solr User
Hi,

I have a question about boosting.

I have the following fields in my schema.xml:

1. title
2. description
3. ISBN

etc

I want to boost the field title. I tried index time boosting but it did not
work. I also tried Query time boosting but with no luck.

Can someone help me on how to implement boosting on a specific field like
title?

Thanks,
Solr User

On Thu, Nov 11, 2010 at 10:26 AM, solr-user-h...@lucene.apache.org wrote:

 Hi! This is the ezmlm program. I'm managing the
 solr-user@lucene.apache.org mailing list.

 I'm working for my owner, who can be reached
 at solr-user-ow...@lucene.apache.org.

 Acknowledgment: I have added the address

   solr...@gmail.com

 to the solr-user mailing list.

 Welcome to solr-u...@lucene.apache.org!

 Please save this message so that you know the address you are
 subscribed under, in case you later want to unsubscribe or change your
 subscription address.


 --- Administrative commands for the solr-user list ---

 I can handle administrative requests automatically. Please
 do not send them to the list address! Instead, send
 your message to the correct command address:

 To subscribe to the list, send a message to:
   solr-user-subscr...@lucene.apache.org

 To remove your address from the list, send a message to:
   solr-user-unsubscr...@lucene.apache.org

 Send mail to the following for info and FAQ for this list:
   solr-user-i...@lucene.apache.org
   solr-user-...@lucene.apache.org

 Similar addresses exist for the digest list:
   solr-user-digest-subscr...@lucene.apache.org
   solr-user-digest-unsubscr...@lucene.apache.org

 To get messages 123 through 145 (a maximum of 100 per request), mail:
   solr-user-get.123_...@lucene.apache.org

 To get an index with subject and author for messages 123-456 , mail:
   solr-user-index.123_...@lucene.apache.org

 They are always returned as sets of 100, max 2000 per request,
 so you'll actually get 100-499.

 To receive all messages with the same subject as message 12345,
 send a short message to:
   solr-user-thread.12...@lucene.apache.org

 The messages should contain one line or word of text to avoid being
 treated as s...@m, but I will ignore their content.
 Only the ADDRESS you send to is important.

 You can start a subscription for an alternate address,
 for example j...@host.domain, just add a hyphen and your
 address (with '=' instead of '@') after the command word:
 solr-user-subscribe-john=host.dom...@lucene.apache.org

 To stop subscription for this address, mail:
 solr-user-unsubscribe-john=host.dom...@lucene.apache.org

 In both cases, I'll send a confirmation message to that address. When
 you receive it, simply reply to it to complete your subscription.

 If despite following these instructions, you do not get the
 desired results, please contact my owner at
 solr-user-ow...@lucene.apache.org. Please be patient, my owner is a
 lot slower than I am ;-)

 --- Enclosed is a copy of the request I received.

 Return-Path: solr...@gmail.com
 Received: (qmail 48883 invoked by uid 99); 11 Nov 2010 15:26:44 -
 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230)
by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Nov 2010 15:26:44
 +
 X-ASF-Spam-Status: No, hits=2.2 required=10.0

  
 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL
 X-Spam-Check-By: apache.org
 Received-SPF: pass (nike.apache.org: domain of solr...@gmail.comdesignates 
 209.85.213.48 as permitted sender)
 Received: from [209.85.213.48] (HELO mail-yw0-f48.google.com)
 (209.85.213.48)
by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Nov 2010 15:26:35
 +
 Received: by ywp4 with SMTP id 4so1394872ywp.35
for solr-user-sc.1289489103.apfngfdapdhadiahjfln-solrnew=gmail.com
 @lucene.apache.org; Thu, 11 Nov 2010 07:26:14 -0800 (PST)
 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
d=gmail.com; s=gamma;
h=domainkey-signature:mime-version:received:received:in-reply-to
 :references:date:message-id:subject:from:to:content-type;
bh=4KuKRrRVLjzTO4oB9/DNxMdQPfNQH2GnYznzPE6YqOo=;
b=l5lBfUYcyvipJn9SE+5j+t1XUmBjTtbyPYlRVj7jDb6G+W3NzQ21EHOowiD9rNH2L9

 gc2+6mGEZmRJOZQwpKD7SUQ2bXL9fVm7mVfS21TMAgC+ZsWQ3vvFOHXalWZa8dbtcOY7
 C23KauLY7YH1UfducfXL77J7u0/snEZl5jQ7A=
 DomainKey-Signature: a=rsa-sha1; c=nofws;
d=gmail.com; s=gamma;

  h=mime-version:in-reply-to:references:date:message-id:subject:from:to
 :content-type;
b=nb9+3a9bOHnjGO5T5BhMlW15adcafr+MPzvpgc5X5NXEUGCI05ViLho0SSoQP2Wp2i

 xp1Mfjrjw05umeKmHX23oeD5Idc2G6xgz8I3ZcJ1bUM+cD7c52cMKG2suE2VvhUHlfah
 z52rEtlqd0Q9fk/ZDWwR2DS7GoiVMRmgaWgD0=
 MIME-Version: 1.0
 Received: by 10.229.216.201 with SMTP id hj9mr877669qcb.58.1289489174123;
 Thu,
  11 Nov 2010 07:26:14 -0800 (PST)
 Received: by 10.229.66.165 with HTTP; Thu, 11 Nov 2010 07:26:14 -0800 (PST)
 In-Reply-To: 1289489103.46214.ez...@lucene.apache.org
 References: 1289489103.46214.ez...@lucene.apache.org
 

Re: WELCOME to solr-user@lucene.apache.org

2010-11-11 Thread Erick Erickson
There's not much to go on here. Boosting works,
and index time as opposed to query time boosting
addresses two different needs. Could you add some
detail? All you've really said is it didn't work, which
doesn't allow a very constructive response.

Perhaps you could review:
http://wiki.apache.org/solr/HowToContribute

Best
Erick



On Thu, Nov 11, 2010 at 10:32 AM, Solr User solr...@gmail.com wrote:

 Hi,

 I have a question about boosting.

 I have the following fields in my schema.xml:

 1. title
 2. description
 3. ISBN

 etc

 I want to boost the field title. I tried index time boosting but it did not
 work. I also tried Query time boosting but with no luck.

 Can someone help me on how to implement boosting on a specific field like
 title?

 Thanks,
 Solr User





Re: WELCOME to solr-user@lucene.apache.org

2010-11-11 Thread Solr User
Eric,

Thank you so much for the reply and apologize for not providing all the
details.

The following are the field definitons in my schema.xml:

field name=title type=string indexed=true stored=true
omitNorms=false /

field name=author type=string indexed=true stored=true
multiValued=true omitNorms=true /

field name=authortype type=string indexed=true stored=true
multiValued=true omitNorms=true /

field name=isbn13 type=string indexed=true stored=true /

field name=isbn10 type=string indexed=true stored=true /

field name=material type=string indexed=true stored=true /

field name=pubdate type=string indexed=true stored=true /

field name=pubyear type=string indexed=true stored=true /

field name=reldate type=string indexed=false stored=true /

field name=format type=string indexed=true stored=true /

field name=pages type=string indexed=false stored=true /

field name=desc type=string indexed=true stored=true /

field name=series type=string indexed=true stored=true /

field name=season type=string indexed=true stored=true /

field name=imprint type=string indexed=true stored=true /

field name=bisacsub type=string indexed=true stored=true
multiValued=true omitNorms=true /

field name=bisacstatus type=string indexed=false stored=true /

field name=category type=string indexed=true stored=true
multiValued=true omitNorms=true /

field name=award type=string indexed=true stored=true
multiValued=true omitNorms=true /

field name=age type=string indexed=true stored=true /

field name=reading type=string indexed=true stored=true /

field name=grade type=string indexed=true stored=true /

field name=path type=string indexed=false stored=true /

field name=shortdesc type=string indexed=true stored=true /

field name=subtitle type=string indexed=true stored=true
omitNorms=true/

field name=price type=float indexed=true stored=true/

field name=searchFields type=textSpell indexed=true stored=true
multiValued=true omitNorms=true/

Copy Fields:

copyField source=title dest=searchFields/

copyField source=author dest=searchFields/

copyField source=isbn13 dest=searchFields/

copyField source=isbn10 dest=searchFields/

copyField source=format dest=searchFields/

copyField source=series dest=searchFields/

copyField source=season dest=searchFields/

copyField source=imprint dest=searchFields/

copyField source=bisacsub dest=searchFields/

copyField source=category dest=searchFields/

copyField source=award dest=searchFields/

copyField source=shortdesc dest=searchFields/

copyField source=desc dest=searchFields/

copyField source=subtitle dest=searchFields/



defaultSearchFieldsearchFields/defaultSearchField



Before creating the indexes I feed XML file to the Solr job to create index
files. I added Boost attribute to the title field before creating indexes
and an example is below:

?xml version=1.0 encoding=UTF-8 standalone=no?adddocfield
name=material1785440/fieldfield boost=10.0 name=titleEach Little
Bird That Sings/fieldfield name=price16.0/fieldfield
name=isbn100152051139/fieldfield
name=isbn139780152051136/fieldfield
name=formatHardcover/fieldfield
name=pubdate2005-03-01/fieldfield name=pubyear2005/fieldfield
name=reldate2005-02-22/fieldfield name=pages272/fieldfield
name=bisacstatusActive/fieldfield name=seasonSpring
2005/fieldfield name=imprintChildren's/fieldfield
name=age8.0-12.0/fieldfield name=grade3-6/fieldfield
name=authorMarla Frazee/fieldfield name=authortypeJacket
Illustrator/fieldfield name=authorDeborah Wiles/fieldfield
name=authortypeAuthor/fieldfield name=bisacsubSocial
Issues/Friendship/fieldfield name=bisacsubSocial Issues/General (see
also headings under Family)/fieldfield
name=bisacsubGeneral/fieldfield name=bisacsubGirls amp;
Women/fieldfield name=categoryFiction/Middle Grade/fieldfield
name=categoryFiction/Award Winners/fieldfield name=categoryComing
of Age/fieldfield name=categorySocial Situations/Death amp;
Dying/fieldfield name=categorySocial
Situations/Friendship/fieldfield
name=path/assets/product/0152051139.gif/fieldfield
name=desclt;divgt;Ten-year-old Comfort Snowberger has attended 247
funerals. But that's not surprising, considering that her family runs the
town funeral home. And even though Great-uncle Edisto keeled over with a
heart attack and Great-great-aunt Florentine dropped dead--just like
that--six months later, Comfort knows how to deal with loss, or so she
thinks. She's more concerned with avoiding her crazy cousin Peach and trying
to figure out why her best friend, Declaration, suddenly won't talk to her.
Life is full of surprises. And the biggest one of all is learning what it
takes to handle them.lt;brgt; lt;brgt;Deborah Wiles has created a
unique, funny, and utterly real cast of characters in this heartfelt, and
quintessentially Southern coming-of-age novel. Comfort will charm young
readers with her wit, her warmth, and her struggles as she learns about
life, loss, and ultimately, triumph.lt;brgt;lt;/divgt;/fieldfield
name=shortdescTen-year-old Comfort Snowberger 

Re: WELCOME to solr-user@lucene.apache.org

2010-11-11 Thread Ahmet Arslan
There are several mistakes in your approach:

copyField just copies data. Index time boost is not copied.

There is no such boosting syntax. /select?q=Eachtitle^9fl=score

You are searching on your default field. 

This is not your cause of your problem but omitNorms=true disables index time 
boosts.

http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need.


--- On Thu, 11/11/10, Solr User solr...@gmail.com wrote:

 From: Solr User solr...@gmail.com
 Subject: Re: WELCOME to solr-user@lucene.apache.org
 To: solr-user@lucene.apache.org
 Date: Thursday, November 11, 2010, 11:54 PM
 Eric,
 
 Thank you so much for the reply and apologize for not
 providing all the
 details.
 
 The following are the field definitons in my schema.xml:
 
 field name=title type=string indexed=true
 stored=true
 omitNorms=false /
 
 field name=author type=string indexed=true
 stored=true
 multiValued=true omitNorms=true /
 
 field name=authortype type=string indexed=true
 stored=true
 multiValued=true omitNorms=true /
 
 field name=isbn13 type=string indexed=true
 stored=true /
 
 field name=isbn10 type=string indexed=true
 stored=true /
 
 field name=material type=string indexed=true
 stored=true /
 
 field name=pubdate type=string indexed=true
 stored=true /
 
 field name=pubyear type=string indexed=true
 stored=true /
 
 field name=reldate type=string indexed=false
 stored=true /
 
 field name=format type=string indexed=true
 stored=true /
 
 field name=pages type=string indexed=false
 stored=true /
 
 field name=desc type=string indexed=true
 stored=true /
 
 field name=series type=string indexed=true
 stored=true /
 
 field name=season type=string indexed=true
 stored=true /
 
 field name=imprint type=string indexed=true
 stored=true /
 
 field name=bisacsub type=string indexed=true
 stored=true
 multiValued=true omitNorms=true /
 
 field name=bisacstatus type=string indexed=false
 stored=true /
 
 field name=category type=string indexed=true
 stored=true
 multiValued=true omitNorms=true /
 
 field name=award type=string indexed=true
 stored=true
 multiValued=true omitNorms=true /
 
 field name=age type=string indexed=true
 stored=true /
 
 field name=reading type=string indexed=true
 stored=true /
 
 field name=grade type=string indexed=true
 stored=true /
 
 field name=path type=string indexed=false
 stored=true /
 
 field name=shortdesc type=string indexed=true
 stored=true /
 
 field name=subtitle type=string indexed=true
 stored=true
 omitNorms=true/
 
 field name=price type=float indexed=true
 stored=true/
 
 field name=searchFields type=textSpell
 indexed=true stored=true
 multiValued=true omitNorms=true/
 
 Copy Fields:
 
 copyField source=title dest=searchFields/
 
 copyField source=author dest=searchFields/
 
 copyField source=isbn13 dest=searchFields/
 
 copyField source=isbn10 dest=searchFields/
 
 copyField source=format dest=searchFields/
 
 copyField source=series dest=searchFields/
 
 copyField source=season dest=searchFields/
 
 copyField source=imprint dest=searchFields/
 
 copyField source=bisacsub dest=searchFields/
 
 copyField source=category dest=searchFields/
 
 copyField source=award dest=searchFields/
 
 copyField source=shortdesc dest=searchFields/
 
 copyField source=desc dest=searchFields/
 
 copyField source=subtitle dest=searchFields/
 
 
 
 defaultSearchFieldsearchFields/defaultSearchField
 
 
 
 Before creating the indexes I feed XML file to the Solr job
 to create index
 files. I added Boost attribute to the title field before
 creating indexes
 and an example is below:
 
 ?xml version=1.0 encoding=UTF-8
 standalone=no?adddocfield
 name=material1785440/fieldfield
 boost=10.0 name=titleEach Little
 Bird That Sings/fieldfield
 name=price16.0/fieldfield
 name=isbn100152051139/fieldfield
 name=isbn139780152051136/fieldfield
 name=formatHardcover/fieldfield
 name=pubdate2005-03-01/fieldfield
 name=pubyear2005/fieldfield
 name=reldate2005-02-22/fieldfield
 name=pages272/fieldfield
 name=bisacstatusActive/fieldfield
 name=seasonSpring
 2005/fieldfield
 name=imprintChildren's/fieldfield
 name=age8.0-12.0/fieldfield
 name=grade3-6/fieldfield
 name=authorMarla Frazee/fieldfield
 name=authortypeJacket
 Illustrator/fieldfield name=authorDeborah
 Wiles/fieldfield
 name=authortypeAuthor/fieldfield
 name=bisacsubSocial
 Issues/Friendship/fieldfield
 name=bisacsubSocial Issues/General (see
 also headings under Family)/fieldfield
 name=bisacsubGeneral/fieldfield
 name=bisacsubGirls amp;
 Women/fieldfield
 name=categoryFiction/Middle
 Grade/fieldfield
 name=categoryFiction/Award
 Winners/fieldfield name=categoryComing
 of Age/fieldfield name=categorySocial
 Situations/Death amp;
 Dying/fieldfield name=categorySocial
 Situations/Friendship/fieldfield
 name=path/assets/product/0152051139.gif/fieldfield
 name=desclt;divgt;Ten-year-old Comfort
 Snowberger has attended 247
 funerals. But that's not surprising, considering that her
 family runs the
 town funeral home. And even though Great-uncle Edisto

Re: WELCOME to solr-user@lucene.apache.org

2010-11-11 Thread Ramavtar Meena
Hi,

If you are looking for query time boosting on title field you can do
the following:
/select?q=title:android^10

Also unless you have a very good reason to use string for date data
(in your case pubdate and reldate), you should be using
solr.DateField.

regards,
Ram
On Fri, Nov 12, 2010 at 3:41 AM, Ahmet Arslan iori...@yahoo.com wrote:
 There are several mistakes in your approach:

 copyField just copies data. Index time boost is not copied.

 There is no such boosting syntax. /select?q=Eachtitle^9fl=score

 You are searching on your default field.

 This is not your cause of your problem but omitNorms=true disables index 
 time boosts.

 http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need.


 --- On Thu, 11/11/10, Solr User solr...@gmail.com wrote:

 From: Solr User solr...@gmail.com
 Subject: Re: WELCOME to solr-user@lucene.apache.org
 To: solr-user@lucene.apache.org
 Date: Thursday, November 11, 2010, 11:54 PM
 Eric,

 Thank you so much for the reply and apologize for not
 providing all the
 details.

 The following are the field definitons in my schema.xml:

 field name=title type=string indexed=true
 stored=true
 omitNorms=false /

 field name=author type=string indexed=true
 stored=true
 multiValued=true omitNorms=true /

 field name=authortype type=string indexed=true
 stored=true
 multiValued=true omitNorms=true /

 field name=isbn13 type=string indexed=true
 stored=true /

 field name=isbn10 type=string indexed=true
 stored=true /

 field name=material type=string indexed=true
 stored=true /

 field name=pubdate type=string indexed=true
 stored=true /

 field name=pubyear type=string indexed=true
 stored=true /

 field name=reldate type=string indexed=false
 stored=true /

 field name=format type=string indexed=true
 stored=true /

 field name=pages type=string indexed=false
 stored=true /

 field name=desc type=string indexed=true
 stored=true /

 field name=series type=string indexed=true
 stored=true /

 field name=season type=string indexed=true
 stored=true /

 field name=imprint type=string indexed=true
 stored=true /

 field name=bisacsub type=string indexed=true
 stored=true
 multiValued=true omitNorms=true /

 field name=bisacstatus type=string indexed=false
 stored=true /

 field name=category type=string indexed=true
 stored=true
 multiValued=true omitNorms=true /

 field name=award type=string indexed=true
 stored=true
 multiValued=true omitNorms=true /

 field name=age type=string indexed=true
 stored=true /

 field name=reading type=string indexed=true
 stored=true /

 field name=grade type=string indexed=true
 stored=true /

 field name=path type=string indexed=false
 stored=true /

 field name=shortdesc type=string indexed=true
 stored=true /

 field name=subtitle type=string indexed=true
 stored=true
 omitNorms=true/

 field name=price type=float indexed=true
 stored=true/

 field name=searchFields type=textSpell
 indexed=true stored=true
 multiValued=true omitNorms=true/

 Copy Fields:

 copyField source=title dest=searchFields/

 copyField source=author dest=searchFields/

 copyField source=isbn13 dest=searchFields/

 copyField source=isbn10 dest=searchFields/

 copyField source=format dest=searchFields/

 copyField source=series dest=searchFields/

 copyField source=season dest=searchFields/

 copyField source=imprint dest=searchFields/

 copyField source=bisacsub dest=searchFields/

 copyField source=category dest=searchFields/

 copyField source=award dest=searchFields/

 copyField source=shortdesc dest=searchFields/

 copyField source=desc dest=searchFields/

 copyField source=subtitle dest=searchFields/



 defaultSearchFieldsearchFields/defaultSearchField



 Before creating the indexes I feed XML file to the Solr job
 to create index
 files. I added Boost attribute to the title field before
 creating indexes
 and an example is below:

 ?xml version=1.0 encoding=UTF-8
 standalone=no?adddocfield
 name=material1785440/fieldfield
 boost=10.0 name=titleEach Little
 Bird That Sings/fieldfield
 name=price16.0/fieldfield
 name=isbn100152051139/fieldfield
 name=isbn139780152051136/fieldfield
 name=formatHardcover/fieldfield
 name=pubdate2005-03-01/fieldfield
 name=pubyear2005/fieldfield
 name=reldate2005-02-22/fieldfield
 name=pages272/fieldfield
 name=bisacstatusActive/fieldfield
 name=seasonSpring
 2005/fieldfield
 name=imprintChildren's/fieldfield
 name=age8.0-12.0/fieldfield
 name=grade3-6/fieldfield
 name=authorMarla Frazee/fieldfield
 name=authortypeJacket
 Illustrator/fieldfield name=authorDeborah
 Wiles/fieldfield
 name=authortypeAuthor/fieldfield
 name=bisacsubSocial
 Issues/Friendship/fieldfield
 name=bisacsubSocial Issues/General (see
 also headings under Family)/fieldfield
 name=bisacsubGeneral/fieldfield
 name=bisacsubGirls 
 Women/fieldfield
 name=categoryFiction/Middle
 Grade/fieldfield
 name=categoryFiction/Award
 Winners/fieldfield name=categoryComing
 of Age/fieldfield name=categorySocial
 Situations/Death 
 Dying/fieldfield name

Re: WELCOME to solr-user@lucene.apache.org

2009-12-08 Thread Chris Hostetter

(FYI: in the future please start a new thread with an approriate subject 
line when you ask questions -- you probably would have gotten a lot more 
responses fro people interested in Tika and SolrCell if they could tell 
that this email was about SolrCell)

: I found that Tika read the html and extract metadata like meta name=id
: content=12 from my htmls but my documents has an already an id setted by
: literal.id=10.
: 
: I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my
: literal.id

H, yeah: that seems like  an odd order of operations, but it's 
documented on the wiki so evidently it's intentional...

http://wiki.apache.org/solr/ExtractingRequestHandler#Order_of_field_operations

my best sugguestions:

 * use the capture param to restrict what gets extracted (it's probably
possible to write an XPath query that selects everything *except* 
metadata[id])
 * change the name of your uniqueKey field to be something other then id 
so it's less likely to collide with a value from the document.

I also opened two Jira issues that you may want to post comments in...

https://issues.apache.org/jira/browse/SOLR-1633
https://issues.apache.org/jira/browse/SOLR-1634


-Hoss



Re: WELCOME to solr-user@lucene.apache.org

2009-12-05 Thread Raghuveer Kancherla
2 ways I can think of ...

   - ExtractingRequestHandler (this is what I am guessing you are using now)

Set extractOnly=true while making a request to the extractingRequestHandler
and get the parsed content back. Now make a post request on update request
handler with what ever fields and field values you want.


   - Use HTMLStripWhiteSpaceTokenizer factory. This article may be helpful
   to explain what I mean.
   
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripWhitespaceTokenizerFactory.



- Raghu



On Sat, Dec 5, 2009 at 3:44 AM, khalid y kern...@gmail.com wrote:

 Hi,

 I have a problem with solr. I'm indexing some html content and solr crash
 because my id field is multivalued.
 I found that Tika read the html and extract metadata like meta name=id
 content=12 from my htmls but my documents has an already an id setted by
 literal.id=10.

 I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my
 literal.id

 I'm using solr 1.4 and tika 0.5

 Someone can explain to me how I can ignore this the Tika id metadata ??

 Thanks



Re: WELCOME to solr-user@lucene.apache.org

2009-12-05 Thread khalid y
Thanks a lot for you response !!

For the first solution :

I need to index all the content of my websites and I want just tika ignore
meta name=id because I have already an id
I'll try monday and tell you if it works

The second solution :
Are your sure Tika use the HTML Tokenizer ? I'll check

2009/12/5 Raghuveer Kancherla raghuveer.kanche...@aplopio.com

 2 ways I can think of ...

   - ExtractingRequestHandler (this is what I am guessing you are using now)

 Set extractOnly=true while making a request to the extractingRequestHandler
 and get the parsed content back. Now make a post request on update request
 handler with what ever fields and field values you want.





   - Use HTMLStripWhiteSpaceTokenizer factory. This article may be helpful
   to explain what I mean.

 http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripWhitespaceTokenizerFactory
 .



 - Raghu



 On Sat, Dec 5, 2009 at 3:44 AM, khalid y kern...@gmail.com wrote:

  Hi,
 
  I have a problem with solr. I'm indexing some html content and solr crash
  because my id field is multivalued.
  I found that Tika read the html and extract metadata like meta name=id
  content=12 from my htmls but my documents has an already an id setted
 by
  literal.id=10.
 
  I tried to map the id from Tika by fmap.id=ignored_ but it ignore also
 my
  literal.id
 
  I'm using solr 1.4 and tika 0.5
 
  Someone can explain to me how I can ignore this the Tika id metadata ??
 
  Thanks
 



Re: WELCOME to solr-user@lucene.apache.org

2008-01-08 Thread Jan Buelens
Hi,

We are currently using Solr as search engine.
To add an existing website to our search engine, we are investigating Nutch.

Does anyone have more information / experience about an integration between
Solr and Nutch?

Thanks in advance !


Best regards,
Jan


Re: WELCOME to solr-user@lucene.apache.org

2008-01-08 Thread Ryan McKinley

currently two approaches:

http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html
and:
https://issues.apache.org/jira/browse/NUTCH-442

I have had experience with the former... you may have more luck on the 
nutch-user list for help


ryan


Jan Buelens wrote:

Hi,

We are currently using Solr as search engine.
To add an existing website to our search engine, we are investigating Nutch.

Does anyone have more information / experience about an integration between
Solr and Nutch?

Thanks in advance !


Best regards,
Jan