Re: WELCOME to solr-user@lucene.apache.org
In short, nothing that’s maintained as part of the Apache project. There may be commercial products, but I haven’t had occasion to look for one. Best, Erick > On Oct 20, 2019, at 7:42 AM, Wasim S Kazi wrote: > > Good day > > I would like to get some info or confirmation about configuring Solr 8+ to > get content from WCM (Websphere Content Management) > > Essentially, we have manually index data from WCM into Solr and this all > works fine. We want to now automate this process, so checking is there is any > well established integration method between WCM and Solr. This integration > should allow content being indexed automatically, or periodically without > human intervention. > > Regards > Wasim Kazi > > -Original Message- > From: solr-user-h...@lucene.apache.org > Sent: Sunday, October 20, 2019 2:39 PM > To: Wasim S Kazi > Subject: WELCOME to solr-user@lucene.apache.org > > Hi! This is the ezmlm program. I'm managing the solr-user@lucene.apache.org > mailing list. > > I'm working for my owner, who can be reached at > solr-user-ow...@lucene.apache.org. > > Acknowledgment: I have added the address > > wasim.s.k...@za.ey.com > > to the solr-user mailing list. > > Welcome to solr-user@lucene.apache.org! > > Please save this message so that you know the address you are subscribed > under, in case you later want to unsubscribe or change your subscription > address. > > > --- Administrative commands for the solr-user list --- > > I can handle administrative requests automatically. Please do not send them > to the list address! Instead, send your message to the correct command > address: > > To subscribe to the list, send a message to: > > > To remove your address from the list, send a message to: > > > Send mail to the following for info and FAQ for this list: > > > > Similar addresses exist for the digest list: > > > > To get messages 123 through 145 (a maximum of 100 per request), mail: > > > To get an index with subject and author for messages 123-456 , mail: > > > They are always returned as sets of 100, max 2000 per request, so you'll > actually get 100-499. > > To receive all messages with the same subject as message 12345, send a short > message to: > > > The messages should contain one line or word of text to avoid being treated > as sp@m, but I will ignore their content. > Only the ADDRESS you send to is important. > > You can start a subscription for an alternate address, for example > "john@host.domain", just add a hyphen and your address (with '=' instead of > '@') after the command word: > > > To stop subscription for this address, mail: > > > In both cases, I'll send a confirmation message to that address. When you > receive it, simply reply to it to complete your subscription. > > If despite following these instructions, you do not get the desired results, > please contact my owner at solr-user-ow...@lucene.apache.org. Please be > patient, my owner is a lot slower than I am ;-) > > --- Enclosed is a copy of the request I received. > > Return-Path: > Received: (qmail 96582 invoked by uid 99); 20 Oct 2019 11:38:52 - > Received: from pnap-us-west-generic-nat.apache.org (HELO > spamd1-us-west.apache.org) (209.188.14.142) >by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Oct 2019 11:38:52 + > Received: from localhost (localhost [127.0.0.1]) >by spamd1-us-west.apache.org (ASF Mail Server at > spamd1-us-west.apache.org) with ESMTP id 81232C0C8E >for > ; > Sun, 20 Oct 2019 11:38:51 + (UTC) > X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org > X-Spam-Flag: NO > X-Spam-Score: -4.8 > X-Spam-Level: > X-Spam-Status: No, score=-4.8 tagged_above=-999 required=6.31 >tests=[HTML_FONT_LOW_CONTRAST=0.001, HTML_MESSAGE=0.2, >KAM_SHORT=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001, >SPF_PASS=-0.001] autolearn=disabled > Received: from mx1-he-de.apache.org ([10.40.0.8]) >by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, > port 10024) >with ESMTP id Kbk25gxC2elm >for > ; >Sun, 20 Oct 2019 11:38:50 + (UTC) > Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=199.49.1.52; > helo=em01.ey.com; envelope-from=wasim.s.k...@za.ey.com; receiver= > Received: from em01.ey.com (em01.ey.com [199.49.1.52]) >by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with > ESMTPS id 86E307DDFA >for > ; > Sun, 20 Oct 2019 11:38:49 + (UTC) > IronPort-SDR: > 0i+SrmLgncBfCsgon
RE: WELCOME to solr-user@lucene.apache.org
Good day I would like to get some info or confirmation about configuring Solr 8+ to get content from WCM (Websphere Content Management) Essentially, we have manually index data from WCM into Solr and this all works fine. We want to now automate this process, so checking is there is any well established integration method between WCM and Solr. This integration should allow content being indexed automatically, or periodically without human intervention. Regards Wasim Kazi -Original Message- From: solr-user-h...@lucene.apache.org Sent: Sunday, October 20, 2019 2:39 PM To: Wasim S Kazi Subject: WELCOME to solr-user@lucene.apache.org Hi! This is the ezmlm program. I'm managing the solr-user@lucene.apache.org mailing list. I'm working for my owner, who can be reached at solr-user-ow...@lucene.apache.org. Acknowledgment: I have added the address wasim.s.k...@za.ey.com to the solr-user mailing list. Welcome to solr-user@lucene.apache.org! Please save this message so that you know the address you are subscribed under, in case you later want to unsubscribe or change your subscription address. --- Administrative commands for the solr-user list --- I can handle administrative requests automatically. Please do not send them to the list address! Instead, send your message to the correct command address: To subscribe to the list, send a message to: To remove your address from the list, send a message to: Send mail to the following for info and FAQ for this list: Similar addresses exist for the digest list: To get messages 123 through 145 (a maximum of 100 per request), mail: To get an index with subject and author for messages 123-456 , mail: They are always returned as sets of 100, max 2000 per request, so you'll actually get 100-499. To receive all messages with the same subject as message 12345, send a short message to: The messages should contain one line or word of text to avoid being treated as sp@m, but I will ignore their content. Only the ADDRESS you send to is important. You can start a subscription for an alternate address, for example "john@host.domain", just add a hyphen and your address (with '=' instead of '@') after the command word: To stop subscription for this address, mail: In both cases, I'll send a confirmation message to that address. When you receive it, simply reply to it to complete your subscription. If despite following these instructions, you do not get the desired results, please contact my owner at solr-user-ow...@lucene.apache.org. Please be patient, my owner is a lot slower than I am ;-) --- Enclosed is a copy of the request I received. Return-Path: Received: (qmail 96582 invoked by uid 99); 20 Oct 2019 11:38:52 - Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Oct 2019 11:38:52 + Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 81232C0C8E for ; Sun, 20 Oct 2019 11:38:51 + (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -4.8 X-Spam-Level: X-Spam-Status: No, score=-4.8 tagged_above=-999 required=6.31 tests=[HTML_FONT_LOW_CONTRAST=0.001, HTML_MESSAGE=0.2, KAM_SHORT=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-he-de.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id Kbk25gxC2elm for ; Sun, 20 Oct 2019 11:38:50 + (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=199.49.1.52; helo=em01.ey.com; envelope-from=wasim.s.k...@za.ey.com; receiver= Received: from em01.ey.com (em01.ey.com [199.49.1.52]) by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with ESMTPS id 86E307DDFA for ; Sun, 20 Oct 2019 11:38:49 + (UTC) IronPort-SDR: 0i+SrmLgncBfCsgonKDgt+Ll+5TCuN/hbDHsUS1V98D3LWk4dgqQE9qJPrbcZyYjLWRYXieztn Fjky8vaAREXw== X-IronPort-AV: E=Sophos;i="5.67,319,1566864000"; d="gif'147?scan'147,208,217,147";a="240843155" Received: from unknown (HELO DERUSRMPEXTP02.ey.net) ([10.151.33.58]) by defrakaeyip01.eurw.ey.net with ESMTP; 20 Oct 2019 11:38:42 + ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Em+4qSC0AqZ4Ei+nYLvNi3BwVnwrjtXdFD2W5lnj3CNDBO0x9JJBOn5yWMUj4JNnCnhg4R524D5O+lX6dYrYut/tTe09g0pnRemmla9J7icpboVqK6i5gXJLHLFA9dERNQwRDieNKqKEkei0eIbCzLMJeVld1lvj7CJiXIZPZIySU5hHZI7N5+Q9i1eb4GRYxATio7ibfxNknvf3/2298wyUhY9EuQEEuTWNrylkhMtQORgdlgv+mEdpzGJO+FaiG0fv1MQ0TO8JcgybSjJ14hG7xYlhkGEO39qzV7Q9EDbsPwJuupwZg/r4XAIIZ0Bjc0f7YX11S2BhnV8mdm+T+A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d
Re: WELCOME to solr-user@lucene.apache.org
First, understand that this list is maintained by volunteers, so answers aren't guaranteed. If you require dedicated support there are various organizations that provide same, but you'll have to contact them. That said, the community is quite responsive, just post questions to solr-user like this one. Best, Erick On Sun, Jun 24, 2018 at 11:35 PM, Srinivas Muppu (US) wrote: > Hi Solr Team, > > We are facing Solr System Configuration issues which needs help. Please let > us know whom to post our Questions/Queries. > > Thanks, > Srinivas > > On Mon, Jun 25, 2018 at 2:22 AM, wrote: > >> Hi! This is the ezmlm program. I'm managing the >> solr-user@lucene.apache.org mailing list. >> >> I'm working for my owner, who can be reached >> at solr-user-ow...@lucene.apache.org. >> >> Acknowledgment: I have added the address >> >> srinivas.mu...@pwc.com >> >> to the solr-user mailing list. >> >> Welcome to solr-user@lucene.apache.org! >> >> Please save this message so that you know the address you are >> subscribed under, in case you later want to unsubscribe or change your >> subscription address. >> >> >> --- Administrative commands for the solr-user list --- >> >> I can handle administrative requests automatically. Please >> do not send them to the list address! Instead, send >> your message to the correct command address: >> >> To subscribe to the list, send a message to: >> >> >> To remove your address from the list, send a message to: >> >> >> Send mail to the following for info and FAQ for this list: >> >> >> >> Similar addresses exist for the digest list: >> >> >> >> To get messages 123 through 145 (a maximum of 100 per request), mail: >> >> >> To get an index with subject and author for messages 123-456 , mail: >> >> >> They are always returned as sets of 100, max 2000 per request, >> so you'll actually get 100-499. >> >> To receive all messages with the same subject as message 12345, >> send a short message to: >> >> >> The messages should contain one line or word of text to avoid being >> treated as sp@m, but I will ignore their content. >> Only the ADDRESS you send to is important. >> >> You can start a subscription for an alternate address, >> for example "john@host.domain", just add a hyphen and your >> address (with '=' instead of '@') after the command word: >> >> >> To stop subscription for this address, mail: >> >> >> In both cases, I'll send a confirmation message to that address. When >> you receive it, simply reply to it to complete your subscription. >> >> If despite following these instructions, you do not get the >> desired results, please contact my owner at >> solr-user-ow...@lucene.apache.org. Please be patient, my owner is a >> lot slower than I am ;-) >> >> --- Enclosed is a copy of the request I received. >> >> Return-Path: >> Received: (qmail 84164 invoked by uid 99); 25 Jun 2018 06:22:12 - >> Received: from pnap-us-west-generic-nat.apache.org (HELO >> spamd1-us-west.apache.org) (209.188.14.142) >> by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jun 2018 06:22:12 >> + >> Received: from localhost (localhost [127.0.0.1]) >> by spamd1-us-west.apache.org (ASF Mail Server at >> spamd1-us-west.apache.org) with ESMTP id 63CB9CA4A5 >> for > pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:12 + (UTC) >> X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org >> X-Spam-Flag: NO >> X-Spam-Score: -1 >> X-Spam-Level: >> X-Spam-Status: No, score=-1 tagged_above=-999 required=6.31 >> tests=[HTML_MESSAGE=2, KAM_BADIPHTTP=2, KAM_SHORT=0.001, >> NORMAL_HTTP_TO_IP=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001, >> SPF_PASS=-0.001] autolearn=disabled >> Received: from mx1-lw-us.apache.org ([10.40.0.8]) >> by localhost (spamd1-us-west.apache.org [10.40.0.7]) >> (amavisd-new, port 10024) >> with ESMTP id NuBVNjDIIyqW >> for > pwc@lucene.apache.org>; >> Mon, 25 Jun 2018 06:22:10 + (UTC) >> Received: from lxsmpr20.pwc.com (lxsmpr20.pwc.com [155.201.248.112]) >> by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) >> with ESMTPS id 500895F1B4 >> for > pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:10 + (UTC) >> Received: from mail-vk0-f71.google.com (m
Re: WELCOME to solr-user@lucene.apache.org
Hi Solr Team, We are facing Solr System Configuration issues which needs help. Please let us know whom to post our Questions/Queries. Thanks, Srinivas On Mon, Jun 25, 2018 at 2:22 AM, wrote: > Hi! This is the ezmlm program. I'm managing the > solr-user@lucene.apache.org mailing list. > > I'm working for my owner, who can be reached > at solr-user-ow...@lucene.apache.org. > > Acknowledgment: I have added the address > >srinivas.mu...@pwc.com > > to the solr-user mailing list. > > Welcome to solr-user@lucene.apache.org! > > Please save this message so that you know the address you are > subscribed under, in case you later want to unsubscribe or change your > subscription address. > > > --- Administrative commands for the solr-user list --- > > I can handle administrative requests automatically. Please > do not send them to the list address! Instead, send > your message to the correct command address: > > To subscribe to the list, send a message to: > > > To remove your address from the list, send a message to: > > > Send mail to the following for info and FAQ for this list: > > > > Similar addresses exist for the digest list: > > > > To get messages 123 through 145 (a maximum of 100 per request), mail: > > > To get an index with subject and author for messages 123-456 , mail: > > > They are always returned as sets of 100, max 2000 per request, > so you'll actually get 100-499. > > To receive all messages with the same subject as message 12345, > send a short message to: > > > The messages should contain one line or word of text to avoid being > treated as sp@m, but I will ignore their content. > Only the ADDRESS you send to is important. > > You can start a subscription for an alternate address, > for example "john@host.domain", just add a hyphen and your > address (with '=' instead of '@') after the command word: > > > To stop subscription for this address, mail: > > > In both cases, I'll send a confirmation message to that address. When > you receive it, simply reply to it to complete your subscription. > > If despite following these instructions, you do not get the > desired results, please contact my owner at > solr-user-ow...@lucene.apache.org. Please be patient, my owner is a > lot slower than I am ;-) > > --- Enclosed is a copy of the request I received. > > Return-Path: > Received: (qmail 84164 invoked by uid 99); 25 Jun 2018 06:22:12 - > Received: from pnap-us-west-generic-nat.apache.org (HELO > spamd1-us-west.apache.org) (209.188.14.142) > by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jun 2018 06:22:12 > + > Received: from localhost (localhost [127.0.0.1]) > by spamd1-us-west.apache.org (ASF Mail Server at > spamd1-us-west.apache.org) with ESMTP id 63CB9CA4A5 > for pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:12 + (UTC) > X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org > X-Spam-Flag: NO > X-Spam-Score: -1 > X-Spam-Level: > X-Spam-Status: No, score=-1 tagged_above=-999 required=6.31 > tests=[HTML_MESSAGE=2, KAM_BADIPHTTP=2, KAM_SHORT=0.001, > NORMAL_HTTP_TO_IP=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001, > SPF_PASS=-0.001] autolearn=disabled > Received: from mx1-lw-us.apache.org ([10.40.0.8]) > by localhost (spamd1-us-west.apache.org [10.40.0.7]) > (amavisd-new, port 10024) > with ESMTP id NuBVNjDIIyqW > for pwc@lucene.apache.org>; > Mon, 25 Jun 2018 06:22:10 + (UTC) > Received: from lxsmpr20.pwc.com (lxsmpr20.pwc.com [155.201.248.112]) > by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) > with ESMTPS id 500895F1B4 > for pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:10 + (UTC) > Received: from mail-vk0-f71.google.com (mail-vk0-f71.google.com > [209.85.213.71]) > by lxsmpr20.nam.pwcinternal.com (8.16.0.21/8.16.0.21) with ESMTPS > id w5P6M3MF054491 > (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 > verify=OK) > for pwc@lucene.apache.org>; Mon, 25 Jun 2018 02:22:03 -0400 > Received: by mail-vk0-f71.google.com with SMTP id j123-v6so5886670vkc.4 > for pwc@lucene.apache.org>; Sun, 24 Jun 2018 23:22:03 -0700 (PDT) > X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; > d=1e100.net; s=20161025; > h=x-gm-message-state:mime-version:in-reply-to:references:from:date > :message-id:subject:to; > bh=+MKXiCktrcuycddIpUqd9ljQ2oLqYBsgU3qPgb6oZ2M=; > b=q4Vku4HdqSxx2NyQ1G2GtPG7ahk5icEeT8jaTkyyVNW+ > yq9o1oxQoQnsDV
Re: WELCOME to solr-user@lucene.apache.org
Hi , Can I limit the terms that the HighlightComponent uses. My query is generally long and I want specific ones to be highlighted and the rest is not highlighted. Is there an option like the SpellCheckComponent. it uses q unless spellcheck.q if specified. Is a hl.q parameter possible? Or any other tricky way to workaround .. PS: I need this tomorrow (hopefully) to show my boss insisting some other stupid well known commercial search engines.. Regards
Re: WELCOME to solr-user@lucene.apache.org
Ahmet, Thanks for the reply. select/?q=built+to+lastdefType=dismaxqf=searchFields^0.2+title^20debugQuery=on For some reason if I use title field in my query I don't get any results. I am copying all searchable fields into searchFields field. So I am able to search only in the searchFields field not in any other fields. I request you all to clarify if anything wrong with my schema.xml. The schema.xml is at the bottom of this email. I am not able to get the boosting working on the title field. Please help me here too. Thanks, Solr User On Thu, Nov 11, 2010 at 5:11 PM, Ahmet Arslan iori...@yahoo.com wrote: There are several mistakes in your approach: copyField just copies data. Index time boost is not copied. There is no such boosting syntax. /select?q=Eachtitle^9fl=score You are searching on your default field. This is not your cause of your problem but omitNorms=true disables index time boosts. http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need. --- On Thu, 11/11/10, Solr User solr...@gmail.com wrote: From: Solr User solr...@gmail.com Subject: Re: WELCOME to solr-user@lucene.apache.org To: solr-user@lucene.apache.org Date: Thursday, November 11, 2010, 11:54 PM Eric, Thank you so much for the reply and apologize for not providing all the details. The following are the field definitons in my schema.xml: field name=title type=string indexed=true stored=true omitNorms=false / field name=author type=string indexed=true stored=true multiValued=true omitNorms=true / field name=authortype type=string indexed=true stored=true multiValued=true omitNorms=true / field name=isbn13 type=string indexed=true stored=true / field name=isbn10 type=string indexed=true stored=true / field name=material type=string indexed=true stored=true / field name=pubdate type=string indexed=true stored=true / field name=pubyear type=string indexed=true stored=true / field name=reldate type=string indexed=false stored=true / field name=format type=string indexed=true stored=true / field name=pages type=string indexed=false stored=true / field name=desc type=string indexed=true stored=true / field name=series type=string indexed=true stored=true / field name=season type=string indexed=true stored=true / field name=imprint type=string indexed=true stored=true / field name=bisacsub type=string indexed=true stored=true multiValued=true omitNorms=true / field name=bisacstatus type=string indexed=false stored=true / field name=category type=string indexed=true stored=true multiValued=true omitNorms=true / field name=award type=string indexed=true stored=true multiValued=true omitNorms=true / field name=age type=string indexed=true stored=true / field name=reading type=string indexed=true stored=true / field name=grade type=string indexed=true stored=true / field name=path type=string indexed=false stored=true / field name=shortdesc type=string indexed=true stored=true / field name=subtitle type=string indexed=true stored=true omitNorms=true/ field name=price type=float indexed=true stored=true/ field name=searchFields type=textSpell indexed=true stored=true multiValued=true omitNorms=true/ Copy Fields: copyField source=title dest=searchFields/ copyField source=author dest=searchFields/ copyField source=isbn13 dest=searchFields/ copyField source=isbn10 dest=searchFields/ copyField source=format dest=searchFields/ copyField source=series dest=searchFields/ copyField source=season dest=searchFields/ copyField source=imprint dest=searchFields/ copyField source=bisacsub dest=searchFields/ copyField source=category dest=searchFields/ copyField source=award dest=searchFields/ copyField source=shortdesc dest=searchFields/ copyField source=desc dest=searchFields/ copyField source=subtitle dest=searchFields/ defaultSearchFieldsearchFields/defaultSearchField Before creating the indexes I feed XML file to the Solr job to create index files. I added Boost attribute to the title field before creating indexes and an example is below: ?xml version=1.0 encoding=UTF-8 standalone=no?adddocfield name=material1785440/fieldfield boost=10.0 name=titleEach Little Bird That Sings/fieldfield name=price16.0/fieldfield name=isbn100152051139/fieldfield name=isbn139780152051136/fieldfield name=formatHardcover/fieldfield name=pubdate2005-03-01/fieldfield name=pubyear2005/fieldfield name=reldate2005-02-22/fieldfield name=pages272/fieldfield name=bisacstatusActive/fieldfield name=seasonSpring 2005/fieldfield name=imprintChildren's/fieldfield name=age8.0-12.0/fieldfield name=grade3-6/fieldfield name=authorMarla Frazee/fieldfield name=authortypeJacket Illustrator/fieldfield name=authorDeborah Wiles/fieldfield
Re: WELCOME to solr-user@lucene.apache.org
select/?q=built+to+lastdefType=dismaxqf=searchFields^0.2+title^20debugQuery=on For some reason if I use title field in my query I don't get any results. I am copying all searchable fields into searchFields field. So I am able to search only in the searchFields field not in any other fields. I request you all to clarify if anything wrong with my schema.xml. The schema.xml is at the bottom of this email. I am not able to get the boosting working on the title field. Please help me here too. Change type of your title field. It is string now. Make it solr.TextField. Actually you dont need cath-all copy field with dismax. Just change their types string to text and append them qf= parameter.
Re: WELCOME to solr-user@lucene.apache.org
Ahmet, In production system we are using /spell/?q=built+to+last so that we can check the spelling. We are not using /select?q=built+to+last Can I use dismax with /spell? I understood from your reply that I need to change my schema.xml and modify the field types. Do I need to still use the searchFields field and what do I need to specify in the defaultSearchField tag? searchFields is one of the field names that we provided. Thanks, Solr User On Fri, Nov 12, 2010 at 10:26 AM, Ahmet Arslan iori...@yahoo.com wrote: select/?q=built+to+lastdefType=dismaxqf=searchFields^0.2+title^20debugQuery=on For some reason if I use title field in my query I don't get any results. I am copying all searchable fields into searchFields field. So I am able to search only in the searchFields field not in any other fields. I request you all to clarify if anything wrong with my schema.xml. The schema.xml is at the bottom of this email. I am not able to get the boosting working on the title field. Please help me here too. Change type of your title field. It is string now. Make it solr.TextField. Actually you dont need cath-all copy field with dismax. Just change their types string to text and append them qf= parameter.
Re: WELCOME to solr-user@lucene.apache.org
/spell/?q=built+to+last so that we can check the spelling. We are not using /select?q=built+to+last Can I use dismax with /spell? Yes you can. I understood from your reply that I need to change my schema.xml and modify the field types. Correct. Make them full-text searchable. string type is not tokenized. Do I need to still use the searchFields field and what do I need to specify in the defaultSearchField tag? Delete searchFields, you don't need it. Regarding defaultSearchField, it does not matter with dismax. Write any of your fields. For example title. And play with other dismax parameters. In short dismax is the way to go if you are searching multiple fields.
Re: WELCOME to solr-user@lucene.apache.org
Hi, I have a question about boosting. I have the following fields in my schema.xml: 1. title 2. description 3. ISBN etc I want to boost the field title. I tried index time boosting but it did not work. I also tried Query time boosting but with no luck. Can someone help me on how to implement boosting on a specific field like title? Thanks, Solr User On Thu, Nov 11, 2010 at 10:26 AM, solr-user-h...@lucene.apache.org wrote: Hi! This is the ezmlm program. I'm managing the solr-user@lucene.apache.org mailing list. I'm working for my owner, who can be reached at solr-user-ow...@lucene.apache.org. Acknowledgment: I have added the address solr...@gmail.com to the solr-user mailing list. Welcome to solr-u...@lucene.apache.org! Please save this message so that you know the address you are subscribed under, in case you later want to unsubscribe or change your subscription address. --- Administrative commands for the solr-user list --- I can handle administrative requests automatically. Please do not send them to the list address! Instead, send your message to the correct command address: To subscribe to the list, send a message to: solr-user-subscr...@lucene.apache.org To remove your address from the list, send a message to: solr-user-unsubscr...@lucene.apache.org Send mail to the following for info and FAQ for this list: solr-user-i...@lucene.apache.org solr-user-...@lucene.apache.org Similar addresses exist for the digest list: solr-user-digest-subscr...@lucene.apache.org solr-user-digest-unsubscr...@lucene.apache.org To get messages 123 through 145 (a maximum of 100 per request), mail: solr-user-get.123_...@lucene.apache.org To get an index with subject and author for messages 123-456 , mail: solr-user-index.123_...@lucene.apache.org They are always returned as sets of 100, max 2000 per request, so you'll actually get 100-499. To receive all messages with the same subject as message 12345, send a short message to: solr-user-thread.12...@lucene.apache.org The messages should contain one line or word of text to avoid being treated as s...@m, but I will ignore their content. Only the ADDRESS you send to is important. You can start a subscription for an alternate address, for example j...@host.domain, just add a hyphen and your address (with '=' instead of '@') after the command word: solr-user-subscribe-john=host.dom...@lucene.apache.org To stop subscription for this address, mail: solr-user-unsubscribe-john=host.dom...@lucene.apache.org In both cases, I'll send a confirmation message to that address. When you receive it, simply reply to it to complete your subscription. If despite following these instructions, you do not get the desired results, please contact my owner at solr-user-ow...@lucene.apache.org. Please be patient, my owner is a lot slower than I am ;-) --- Enclosed is a copy of the request I received. Return-Path: solr...@gmail.com Received: (qmail 48883 invoked by uid 99); 11 Nov 2010 15:26:44 - Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Nov 2010 15:26:44 + X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of solr...@gmail.comdesignates 209.85.213.48 as permitted sender) Received: from [209.85.213.48] (HELO mail-yw0-f48.google.com) (209.85.213.48) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Nov 2010 15:26:35 + Received: by ywp4 with SMTP id 4so1394872ywp.35 for solr-user-sc.1289489103.apfngfdapdhadiahjfln-solrnew=gmail.com @lucene.apache.org; Thu, 11 Nov 2010 07:26:14 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=4KuKRrRVLjzTO4oB9/DNxMdQPfNQH2GnYznzPE6YqOo=; b=l5lBfUYcyvipJn9SE+5j+t1XUmBjTtbyPYlRVj7jDb6G+W3NzQ21EHOowiD9rNH2L9 gc2+6mGEZmRJOZQwpKD7SUQ2bXL9fVm7mVfS21TMAgC+ZsWQ3vvFOHXalWZa8dbtcOY7 C23KauLY7YH1UfducfXL77J7u0/snEZl5jQ7A= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=nb9+3a9bOHnjGO5T5BhMlW15adcafr+MPzvpgc5X5NXEUGCI05ViLho0SSoQP2Wp2i xp1Mfjrjw05umeKmHX23oeD5Idc2G6xgz8I3ZcJ1bUM+cD7c52cMKG2suE2VvhUHlfah z52rEtlqd0Q9fk/ZDWwR2DS7GoiVMRmgaWgD0= MIME-Version: 1.0 Received: by 10.229.216.201 with SMTP id hj9mr877669qcb.58.1289489174123; Thu, 11 Nov 2010 07:26:14 -0800 (PST) Received: by 10.229.66.165 with HTTP; Thu, 11 Nov 2010 07:26:14 -0800 (PST) In-Reply-To: 1289489103.46214.ez...@lucene.apache.org References: 1289489103.46214.ez...@lucene.apache.org
Re: WELCOME to solr-user@lucene.apache.org
There's not much to go on here. Boosting works, and index time as opposed to query time boosting addresses two different needs. Could you add some detail? All you've really said is it didn't work, which doesn't allow a very constructive response. Perhaps you could review: http://wiki.apache.org/solr/HowToContribute Best Erick On Thu, Nov 11, 2010 at 10:32 AM, Solr User solr...@gmail.com wrote: Hi, I have a question about boosting. I have the following fields in my schema.xml: 1. title 2. description 3. ISBN etc I want to boost the field title. I tried index time boosting but it did not work. I also tried Query time boosting but with no luck. Can someone help me on how to implement boosting on a specific field like title? Thanks, Solr User
Re: WELCOME to solr-user@lucene.apache.org
Eric, Thank you so much for the reply and apologize for not providing all the details. The following are the field definitons in my schema.xml: field name=title type=string indexed=true stored=true omitNorms=false / field name=author type=string indexed=true stored=true multiValued=true omitNorms=true / field name=authortype type=string indexed=true stored=true multiValued=true omitNorms=true / field name=isbn13 type=string indexed=true stored=true / field name=isbn10 type=string indexed=true stored=true / field name=material type=string indexed=true stored=true / field name=pubdate type=string indexed=true stored=true / field name=pubyear type=string indexed=true stored=true / field name=reldate type=string indexed=false stored=true / field name=format type=string indexed=true stored=true / field name=pages type=string indexed=false stored=true / field name=desc type=string indexed=true stored=true / field name=series type=string indexed=true stored=true / field name=season type=string indexed=true stored=true / field name=imprint type=string indexed=true stored=true / field name=bisacsub type=string indexed=true stored=true multiValued=true omitNorms=true / field name=bisacstatus type=string indexed=false stored=true / field name=category type=string indexed=true stored=true multiValued=true omitNorms=true / field name=award type=string indexed=true stored=true multiValued=true omitNorms=true / field name=age type=string indexed=true stored=true / field name=reading type=string indexed=true stored=true / field name=grade type=string indexed=true stored=true / field name=path type=string indexed=false stored=true / field name=shortdesc type=string indexed=true stored=true / field name=subtitle type=string indexed=true stored=true omitNorms=true/ field name=price type=float indexed=true stored=true/ field name=searchFields type=textSpell indexed=true stored=true multiValued=true omitNorms=true/ Copy Fields: copyField source=title dest=searchFields/ copyField source=author dest=searchFields/ copyField source=isbn13 dest=searchFields/ copyField source=isbn10 dest=searchFields/ copyField source=format dest=searchFields/ copyField source=series dest=searchFields/ copyField source=season dest=searchFields/ copyField source=imprint dest=searchFields/ copyField source=bisacsub dest=searchFields/ copyField source=category dest=searchFields/ copyField source=award dest=searchFields/ copyField source=shortdesc dest=searchFields/ copyField source=desc dest=searchFields/ copyField source=subtitle dest=searchFields/ defaultSearchFieldsearchFields/defaultSearchField Before creating the indexes I feed XML file to the Solr job to create index files. I added Boost attribute to the title field before creating indexes and an example is below: ?xml version=1.0 encoding=UTF-8 standalone=no?adddocfield name=material1785440/fieldfield boost=10.0 name=titleEach Little Bird That Sings/fieldfield name=price16.0/fieldfield name=isbn100152051139/fieldfield name=isbn139780152051136/fieldfield name=formatHardcover/fieldfield name=pubdate2005-03-01/fieldfield name=pubyear2005/fieldfield name=reldate2005-02-22/fieldfield name=pages272/fieldfield name=bisacstatusActive/fieldfield name=seasonSpring 2005/fieldfield name=imprintChildren's/fieldfield name=age8.0-12.0/fieldfield name=grade3-6/fieldfield name=authorMarla Frazee/fieldfield name=authortypeJacket Illustrator/fieldfield name=authorDeborah Wiles/fieldfield name=authortypeAuthor/fieldfield name=bisacsubSocial Issues/Friendship/fieldfield name=bisacsubSocial Issues/General (see also headings under Family)/fieldfield name=bisacsubGeneral/fieldfield name=bisacsubGirls amp; Women/fieldfield name=categoryFiction/Middle Grade/fieldfield name=categoryFiction/Award Winners/fieldfield name=categoryComing of Age/fieldfield name=categorySocial Situations/Death amp; Dying/fieldfield name=categorySocial Situations/Friendship/fieldfield name=path/assets/product/0152051139.gif/fieldfield name=desclt;divgt;Ten-year-old Comfort Snowberger has attended 247 funerals. But that's not surprising, considering that her family runs the town funeral home. And even though Great-uncle Edisto keeled over with a heart attack and Great-great-aunt Florentine dropped dead--just like that--six months later, Comfort knows how to deal with loss, or so she thinks. She's more concerned with avoiding her crazy cousin Peach and trying to figure out why her best friend, Declaration, suddenly won't talk to her. Life is full of surprises. And the biggest one of all is learning what it takes to handle them.lt;brgt; lt;brgt;Deborah Wiles has created a unique, funny, and utterly real cast of characters in this heartfelt, and quintessentially Southern coming-of-age novel. Comfort will charm young readers with her wit, her warmth, and her struggles as she learns about life, loss, and ultimately, triumph.lt;brgt;lt;/divgt;/fieldfield name=shortdescTen-year-old Comfort Snowberger
Re: WELCOME to solr-user@lucene.apache.org
There are several mistakes in your approach: copyField just copies data. Index time boost is not copied. There is no such boosting syntax. /select?q=Eachtitle^9fl=score You are searching on your default field. This is not your cause of your problem but omitNorms=true disables index time boosts. http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need. --- On Thu, 11/11/10, Solr User solr...@gmail.com wrote: From: Solr User solr...@gmail.com Subject: Re: WELCOME to solr-user@lucene.apache.org To: solr-user@lucene.apache.org Date: Thursday, November 11, 2010, 11:54 PM Eric, Thank you so much for the reply and apologize for not providing all the details. The following are the field definitons in my schema.xml: field name=title type=string indexed=true stored=true omitNorms=false / field name=author type=string indexed=true stored=true multiValued=true omitNorms=true / field name=authortype type=string indexed=true stored=true multiValued=true omitNorms=true / field name=isbn13 type=string indexed=true stored=true / field name=isbn10 type=string indexed=true stored=true / field name=material type=string indexed=true stored=true / field name=pubdate type=string indexed=true stored=true / field name=pubyear type=string indexed=true stored=true / field name=reldate type=string indexed=false stored=true / field name=format type=string indexed=true stored=true / field name=pages type=string indexed=false stored=true / field name=desc type=string indexed=true stored=true / field name=series type=string indexed=true stored=true / field name=season type=string indexed=true stored=true / field name=imprint type=string indexed=true stored=true / field name=bisacsub type=string indexed=true stored=true multiValued=true omitNorms=true / field name=bisacstatus type=string indexed=false stored=true / field name=category type=string indexed=true stored=true multiValued=true omitNorms=true / field name=award type=string indexed=true stored=true multiValued=true omitNorms=true / field name=age type=string indexed=true stored=true / field name=reading type=string indexed=true stored=true / field name=grade type=string indexed=true stored=true / field name=path type=string indexed=false stored=true / field name=shortdesc type=string indexed=true stored=true / field name=subtitle type=string indexed=true stored=true omitNorms=true/ field name=price type=float indexed=true stored=true/ field name=searchFields type=textSpell indexed=true stored=true multiValued=true omitNorms=true/ Copy Fields: copyField source=title dest=searchFields/ copyField source=author dest=searchFields/ copyField source=isbn13 dest=searchFields/ copyField source=isbn10 dest=searchFields/ copyField source=format dest=searchFields/ copyField source=series dest=searchFields/ copyField source=season dest=searchFields/ copyField source=imprint dest=searchFields/ copyField source=bisacsub dest=searchFields/ copyField source=category dest=searchFields/ copyField source=award dest=searchFields/ copyField source=shortdesc dest=searchFields/ copyField source=desc dest=searchFields/ copyField source=subtitle dest=searchFields/ defaultSearchFieldsearchFields/defaultSearchField Before creating the indexes I feed XML file to the Solr job to create index files. I added Boost attribute to the title field before creating indexes and an example is below: ?xml version=1.0 encoding=UTF-8 standalone=no?adddocfield name=material1785440/fieldfield boost=10.0 name=titleEach Little Bird That Sings/fieldfield name=price16.0/fieldfield name=isbn100152051139/fieldfield name=isbn139780152051136/fieldfield name=formatHardcover/fieldfield name=pubdate2005-03-01/fieldfield name=pubyear2005/fieldfield name=reldate2005-02-22/fieldfield name=pages272/fieldfield name=bisacstatusActive/fieldfield name=seasonSpring 2005/fieldfield name=imprintChildren's/fieldfield name=age8.0-12.0/fieldfield name=grade3-6/fieldfield name=authorMarla Frazee/fieldfield name=authortypeJacket Illustrator/fieldfield name=authorDeborah Wiles/fieldfield name=authortypeAuthor/fieldfield name=bisacsubSocial Issues/Friendship/fieldfield name=bisacsubSocial Issues/General (see also headings under Family)/fieldfield name=bisacsubGeneral/fieldfield name=bisacsubGirls amp; Women/fieldfield name=categoryFiction/Middle Grade/fieldfield name=categoryFiction/Award Winners/fieldfield name=categoryComing of Age/fieldfield name=categorySocial Situations/Death amp; Dying/fieldfield name=categorySocial Situations/Friendship/fieldfield name=path/assets/product/0152051139.gif/fieldfield name=desclt;divgt;Ten-year-old Comfort Snowberger has attended 247 funerals. But that's not surprising, considering that her family runs the town funeral home. And even though Great-uncle Edisto
Re: WELCOME to solr-user@lucene.apache.org
Hi, If you are looking for query time boosting on title field you can do the following: /select?q=title:android^10 Also unless you have a very good reason to use string for date data (in your case pubdate and reldate), you should be using solr.DateField. regards, Ram On Fri, Nov 12, 2010 at 3:41 AM, Ahmet Arslan iori...@yahoo.com wrote: There are several mistakes in your approach: copyField just copies data. Index time boost is not copied. There is no such boosting syntax. /select?q=Eachtitle^9fl=score You are searching on your default field. This is not your cause of your problem but omitNorms=true disables index time boosts. http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need. --- On Thu, 11/11/10, Solr User solr...@gmail.com wrote: From: Solr User solr...@gmail.com Subject: Re: WELCOME to solr-user@lucene.apache.org To: solr-user@lucene.apache.org Date: Thursday, November 11, 2010, 11:54 PM Eric, Thank you so much for the reply and apologize for not providing all the details. The following are the field definitons in my schema.xml: field name=title type=string indexed=true stored=true omitNorms=false / field name=author type=string indexed=true stored=true multiValued=true omitNorms=true / field name=authortype type=string indexed=true stored=true multiValued=true omitNorms=true / field name=isbn13 type=string indexed=true stored=true / field name=isbn10 type=string indexed=true stored=true / field name=material type=string indexed=true stored=true / field name=pubdate type=string indexed=true stored=true / field name=pubyear type=string indexed=true stored=true / field name=reldate type=string indexed=false stored=true / field name=format type=string indexed=true stored=true / field name=pages type=string indexed=false stored=true / field name=desc type=string indexed=true stored=true / field name=series type=string indexed=true stored=true / field name=season type=string indexed=true stored=true / field name=imprint type=string indexed=true stored=true / field name=bisacsub type=string indexed=true stored=true multiValued=true omitNorms=true / field name=bisacstatus type=string indexed=false stored=true / field name=category type=string indexed=true stored=true multiValued=true omitNorms=true / field name=award type=string indexed=true stored=true multiValued=true omitNorms=true / field name=age type=string indexed=true stored=true / field name=reading type=string indexed=true stored=true / field name=grade type=string indexed=true stored=true / field name=path type=string indexed=false stored=true / field name=shortdesc type=string indexed=true stored=true / field name=subtitle type=string indexed=true stored=true omitNorms=true/ field name=price type=float indexed=true stored=true/ field name=searchFields type=textSpell indexed=true stored=true multiValued=true omitNorms=true/ Copy Fields: copyField source=title dest=searchFields/ copyField source=author dest=searchFields/ copyField source=isbn13 dest=searchFields/ copyField source=isbn10 dest=searchFields/ copyField source=format dest=searchFields/ copyField source=series dest=searchFields/ copyField source=season dest=searchFields/ copyField source=imprint dest=searchFields/ copyField source=bisacsub dest=searchFields/ copyField source=category dest=searchFields/ copyField source=award dest=searchFields/ copyField source=shortdesc dest=searchFields/ copyField source=desc dest=searchFields/ copyField source=subtitle dest=searchFields/ defaultSearchFieldsearchFields/defaultSearchField Before creating the indexes I feed XML file to the Solr job to create index files. I added Boost attribute to the title field before creating indexes and an example is below: ?xml version=1.0 encoding=UTF-8 standalone=no?adddocfield name=material1785440/fieldfield boost=10.0 name=titleEach Little Bird That Sings/fieldfield name=price16.0/fieldfield name=isbn100152051139/fieldfield name=isbn139780152051136/fieldfield name=formatHardcover/fieldfield name=pubdate2005-03-01/fieldfield name=pubyear2005/fieldfield name=reldate2005-02-22/fieldfield name=pages272/fieldfield name=bisacstatusActive/fieldfield name=seasonSpring 2005/fieldfield name=imprintChildren's/fieldfield name=age8.0-12.0/fieldfield name=grade3-6/fieldfield name=authorMarla Frazee/fieldfield name=authortypeJacket Illustrator/fieldfield name=authorDeborah Wiles/fieldfield name=authortypeAuthor/fieldfield name=bisacsubSocial Issues/Friendship/fieldfield name=bisacsubSocial Issues/General (see also headings under Family)/fieldfield name=bisacsubGeneral/fieldfield name=bisacsubGirls Women/fieldfield name=categoryFiction/Middle Grade/fieldfield name=categoryFiction/Award Winners/fieldfield name=categoryComing of Age/fieldfield name=categorySocial Situations/Death Dying/fieldfield name
Re: WELCOME to solr-user@lucene.apache.org
(FYI: in the future please start a new thread with an approriate subject line when you ask questions -- you probably would have gotten a lot more responses fro people interested in Tika and SolrCell if they could tell that this email was about SolrCell) : I found that Tika read the html and extract metadata like meta name=id : content=12 from my htmls but my documents has an already an id setted by : literal.id=10. : : I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my : literal.id H, yeah: that seems like an odd order of operations, but it's documented on the wiki so evidently it's intentional... http://wiki.apache.org/solr/ExtractingRequestHandler#Order_of_field_operations my best sugguestions: * use the capture param to restrict what gets extracted (it's probably possible to write an XPath query that selects everything *except* metadata[id]) * change the name of your uniqueKey field to be something other then id so it's less likely to collide with a value from the document. I also opened two Jira issues that you may want to post comments in... https://issues.apache.org/jira/browse/SOLR-1633 https://issues.apache.org/jira/browse/SOLR-1634 -Hoss
Re: WELCOME to solr-user@lucene.apache.org
2 ways I can think of ... - ExtractingRequestHandler (this is what I am guessing you are using now) Set extractOnly=true while making a request to the extractingRequestHandler and get the parsed content back. Now make a post request on update request handler with what ever fields and field values you want. - Use HTMLStripWhiteSpaceTokenizer factory. This article may be helpful to explain what I mean. http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripWhitespaceTokenizerFactory. - Raghu On Sat, Dec 5, 2009 at 3:44 AM, khalid y kern...@gmail.com wrote: Hi, I have a problem with solr. I'm indexing some html content and solr crash because my id field is multivalued. I found that Tika read the html and extract metadata like meta name=id content=12 from my htmls but my documents has an already an id setted by literal.id=10. I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my literal.id I'm using solr 1.4 and tika 0.5 Someone can explain to me how I can ignore this the Tika id metadata ?? Thanks
Re: WELCOME to solr-user@lucene.apache.org
Thanks a lot for you response !! For the first solution : I need to index all the content of my websites and I want just tika ignore meta name=id because I have already an id I'll try monday and tell you if it works The second solution : Are your sure Tika use the HTML Tokenizer ? I'll check 2009/12/5 Raghuveer Kancherla raghuveer.kanche...@aplopio.com 2 ways I can think of ... - ExtractingRequestHandler (this is what I am guessing you are using now) Set extractOnly=true while making a request to the extractingRequestHandler and get the parsed content back. Now make a post request on update request handler with what ever fields and field values you want. - Use HTMLStripWhiteSpaceTokenizer factory. This article may be helpful to explain what I mean. http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripWhitespaceTokenizerFactory . - Raghu On Sat, Dec 5, 2009 at 3:44 AM, khalid y kern...@gmail.com wrote: Hi, I have a problem with solr. I'm indexing some html content and solr crash because my id field is multivalued. I found that Tika read the html and extract metadata like meta name=id content=12 from my htmls but my documents has an already an id setted by literal.id=10. I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my literal.id I'm using solr 1.4 and tika 0.5 Someone can explain to me how I can ignore this the Tika id metadata ?? Thanks
Re: WELCOME to solr-user@lucene.apache.org
Hi, We are currently using Solr as search engine. To add an existing website to our search engine, we are investigating Nutch. Does anyone have more information / experience about an integration between Solr and Nutch? Thanks in advance ! Best regards, Jan
Re: WELCOME to solr-user@lucene.apache.org
currently two approaches: http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html and: https://issues.apache.org/jira/browse/NUTCH-442 I have had experience with the former... you may have more luck on the nutch-user list for help ryan Jan Buelens wrote: Hi, We are currently using Solr as search engine. To add an existing website to our search engine, we are investigating Nutch. Does anyone have more information / experience about an integration between Solr and Nutch? Thanks in advance ! Best regards, Jan