Re: indexing to Solr

2016-12-17 Thread Michael Coffey
mcgibbney <lewi...@apache.org> To: "user@nutch.apache.org" <user@nutch.apache.org> Sent: Monday, November 21, 2016 10:34 AM Subject: Re: indexing to Solr Hi Michael, On Sat, Nov 19, 2016 at 8:09 AM, <user-digest-h...@nutch.apache.org> wrote: > From: Michael Coffe

Re: indexing to Solr

2016-12-17 Thread Michael Coffey
In this case, I am using solr 5.4.1 Also, as mentioned previously, the tutorial says nothing about which version of solr to use. From: lewis john mcgibbney <lewi...@apache.org> To: "user@nutch.apache.org" <user@nutch.apache.org> Sent: Monday, November 21, 2016 10:3

Re: indexing to Solr

2016-11-21 Thread Michael Coffey
nt: Monday, November 21, 2016 10:34 AM Subject: Re: indexing to Solr Hi Michael, On Sat, Nov 19, 2016 at 8:09 AM, <user-digest-h...@nutch.apache.org> wrote: > From: Michael Coffey <mcof...@yahoo.com.invalid> > To: "user@nutch.apache.org" <user@nutch.apache.org

Re: indexing to Solr

2016-11-21 Thread lewis john mcgibbney
Hi Michael, On Sat, Nov 19, 2016 at 8:09 AM, <user-digest-h...@nutch.apache.org> wrote: > From: Michael Coffey <mcof...@yahoo.com.invalid> > To: "user@nutch.apache.org" <user@nutch.apache.org> > Cc: > Date: Fri, 18 Nov 2016 21:15:14 + (UTC) > Subj

indexing to Solr

2016-11-18 Thread Michael Coffey
Where can I find up-to-date information on indexing to Solr? When I search the web, I find tutorials that use the deprecated solrindex command. I also find questions where people want to know why it doesn't work. I have a good nutch 1.12 installation on a working hadoop cluster and a Solr 6.3.0

Re: [MASSMAIL]Crawl Command - Getting Exception While Indexing With Solr

2015-11-18 Thread Manish Verma
2.jar library. > > Regards > > - Mensaje original - >> De: "Roannel Fernández Hernández" <roan...@uci.cu> >> Para: user@nutch.apache.org >> Enviados: Miércoles, 18 de Noviembre 2015 9:03:38 >> Asunto: Re: [MASSMAIL]Crawl Command - Getting Ex

Re: [MASSMAIL]Crawl Command - Getting Exception While Indexing With Solr

2015-11-18 Thread Manish Verma
ensaje original - >> De: "Manish Verma" ve...@apple.com> >> Para: user@nutch.apache.org >> Enviados: Lunes, 16 de Noviembre 2015 12:36:46 >> Asunto: [MASSMAIL]Crawl Command - Getting Exception While Indexing With Solr >> >> Hi , >> >>

Re: [MASSMAIL]Crawl Command - Getting Exception While Indexing With Solr

2015-11-18 Thread Roannel Fernández Hernández
Hi What version of Nutch you downloaded exactly? Regards - Mensaje original - > De: "Manish Verma" ve...@apple.com> > Para: user@nutch.apache.org > Enviados: Lunes, 16 de Noviembre 2015 12:36:46 > Asunto: [MASSMAIL]Crawl Command - Getting Exception While I

Re: [MASSMAIL]Crawl Command - Getting Exception While Indexing With Solr

2015-11-18 Thread Roannel Fernández Hernández
2015 9:03:38 > Asunto: Re: [MASSMAIL]Crawl Command - Getting Exception While Indexing With > Solr > > Hi > > What version of Nutch you downloaded exactly? > > Regards > > - Mensaje original - > > De: "Manish Verma" ve...@apple.com> > >

Crawl Command - Getting Exception While Indexing With Solr

2015-11-16 Thread Manish Verma
Hi , I was using bin version of Nutch 1.X and everything was working fine , I downloaded the source of Nutch 1.x and build it, In indexing phase it throws below exception. I am using below crawl command and it run well till parsing and fails at indexing with below exception. ./crawl -i -D

Re: Problems indexing to solr 3.5 from nutch 1.8

2015-09-06 Thread Guy McD
here > > https://github.com/apache/nutch/blob/trunk/conf/schema.xml > > Lewis > > On Thu, Sep 3, 2015 at 11:13 AM, <user-digest-h...@nutch.apache.org> > wrote: > > > > > Subject: Re: Problems indexing to solr 3.5 from nutch 1.8 > > Having a similar problem i

Re: Problems indexing to solr 3.5 from nutch 1.8

2015-09-05 Thread Lewis John Mcgibbney
Hi Guy, The schema is present in the conf directory as shown here https://github.com/apache/nutch/blob/trunk/conf/schema.xml Lewis On Thu, Sep 3, 2015 at 11:13 AM, <user-digest-h...@nutch.apache.org> wrote: > > Subject: Re: Problems indexing to solr 3.5 from nutch 1.8 > H

Re: Problems indexing to solr 3.5 from nutch 1.8

2015-09-03 Thread Lewis John Mcgibbney
Hi Paddy, Some comments in addition to my response. You should try upgrading to Nutch 1.10 when we release very shortly. There has been so much work done since 1.8 that you can benefit from. Keep your ears peeled here for a release candidate and then eventual release. Please see response below.

Re: Problems indexing to solr 3.5 from nutch 1.8

2015-09-03 Thread Guy McD
Having a similar problem in getting Nutch and Solr integrated. Newest version of both. Downloaded and installed a few days ago. Following the tut tells me to copy over the schema.xml, but it doesn't appear to be in the directory that the tut says. Or anywhere for that matter. This is probably a

Problems indexing to solr 3.5 from nutch 1.8

2015-09-01 Thread Patrick Wilmes
Hey there, I'm running into Problems with indexing documents crawled by nutch 1.8 into solr 3.5. Nutch does not report any kind of error or warning and seems to run just fine. but the solr index remains empty. (The logs do not show any kind of error or warning eather). Is there any way to solve

Re: NullPointerException occured during indexing to solr from nutch 1.7 source build.

2014-09-04 Thread atawfik
, you will see that the indexing job has failed. Regards Ameer -- View this message in context: http://lucene.472066.n3.nabble.com/NullPointerException-occured-during-indexing-to-solr-from-nutch-1-7-source-build-tp4156343p4157058.html Sent from the Nutch - User mailing list archive at Nabble.com.

Re: NullPointerException occured during indexing to solr from nutch 1.7 source build.

2014-09-03 Thread vinay . kashyap
...@uyarer.com Sent:user@nutch.apache.org Date:Tue, September 2, 2014 8:35 pm Subject:Re: NullPointerException occured during indexing to solr from nutch 1.7 source build. Hi, This is an issue. Below is the code of SolrDeleteDuplicate class from nutch 1.7 trunk where the solr record is deleted

NullPointerException occured during indexing to solr from nutch 1.7 source build.

2014-09-02 Thread vinay . kashyap
indexing to solr, i'm getting below exceptions. I have copied the scheme-solr4.xml to my solr and added exceptions in regex-urlfilter.txt for a particular website which i give for crawling in the directory urls/seed.txt. Error: java.lang.NullPointerException

RE: Errors when indexing to Solr

2012-09-07 Thread Fournier, Danny G
Not Found request: http://127.0.0.1:8080/solr/core2/update -Original Message- From: Fournier, Danny G [mailto:danny.fourn...@dfo-mpo.gc.ca] Sent: September 6, 2012 4:15 PM To: user@nutch.apache.org Subject: Errors when indexing to Solr I'm getting two different errors while trying to index

RE: Errors when indexing to Solr

2012-09-07 Thread Markus Jelsma
-Original message- From:Fournier, Danny G danny.fourn...@dfo-mpo.gc.ca Sent: Fri 07-Sep-2012 14:46 To: user@nutch.apache.org Subject: RE: Errors when indexing to Solr I've tried crawling with nutch-1.6-SNAPSHOT.jar and got the following error: [root@w7sp1-x64 nutch]# bin/nutch

RE: Errors when indexing to Solr

2012-09-07 Thread Fournier, Danny G
...@openindex.io] Sent: September 7, 2012 9:49 AM To: user@nutch.apache.org Subject: RE: Errors when indexing to Solr -Original message- From:Fournier, Danny G danny.fourn...@dfo-mpo.gc.ca Sent: Fri 07-Sep-2012 14:46 To: user@nutch.apache.org Subject: RE: Errors when indexing to Solr I've

Errors when indexing to Solr

2012-09-06 Thread Fournier, Danny G
1.6 to fix this? Error #1 - When indexing directly to Solr Command: bin/nutch crawl urls -solr http://localhost:8080/solr/core2 -depth 3 -topN 5 Error: Exception in thread main java.io.IOException: org.apache.solr.client.solrj.SolrServerException

Re: OutOfMemoryError when indexing into Solr

2011-10-31 Thread Markus Jelsma
-Original Message- From: arkadi.kosmy...@csiro.au [mailto:arkadi.kosmy...@csiro.au] Sent: Friday, 28 October 2011 12:11 PM To: user@nutch.apache.org; markus.jel...@openindex.io Subject: [ExternalEmail] RE: OutOfMemoryError when indexing into Solr Hi Markus, -Original Message

RE: OutOfMemoryError when indexing into Solr

2011-10-30 Thread Arkadi.Kosmynin
...@csiro.au] Sent: Friday, 28 October 2011 12:11 PM To: user@nutch.apache.org; markus.jel...@openindex.io Subject: [ExternalEmail] RE: OutOfMemoryError when indexing into Solr Hi Markus, -Original Message- From: Markus Jelsma [mailto:markus.jel...@openindex.io] Sent: Thursday

Re: OutOfMemoryError when indexing into Solr

2011-10-27 Thread Fred Zimmerman
of memory when indexing into Solr. This does not look like a trivial lack of memory problem that can be solved by giving more memory to the JVM. I've increased the max memory size from 2Gb to 3Gb, then to 6Gb, but this did not make any difference. A log extract is included below. Would anyone have

Re: OutOfMemoryError when indexing into Solr

2011-10-27 Thread Markus Jelsma
26, 2011 at 11:54 PM, arkadi.kosmy...@csiro.au wrote: Hi, I am working with a Nutch 1.4 snapshot and having a very strange problem that makes the system run out of memory when indexing into Solr. This does not look like a trivial lack of memory problem that can be solved by giving more

Re: OutOfMemoryError when indexing into Solr

2011-10-27 Thread Markus Jelsma
of IndexerMapReduce is a notorious RAM consumer. On Thursday 27 October 2011 05:54:54 arkadi.kosmy...@csiro.au wrote: Hi, I am working with a Nutch 1.4 snapshot and having a very strange problem that makes the system run out of memory when indexing into Solr. This does not look like a trivial lack

RE: OutOfMemoryError when indexing into Solr

2011-10-27 Thread Arkadi.Kosmynin
Hi Markus, -Original Message- From: Markus Jelsma [mailto:markus.jel...@openindex.io] Sent: Thursday, 27 October 2011 11:33 PM To: user@nutch.apache.org Subject: Re: OutOfMemoryError when indexing into Solr Interesting, how many records and how large are your records? There a bit

Re: How to avoid splitting strings when indexing to solr

2011-08-08 Thread Marek Bachmann
On 07.08.2011 15:35, Markus Jelsma wrote: 700 property 701 namemoreIndexingFilter.indexMimeTypeParts/name 702 valuetrue/value 703 descriptionDetermines whether the index-more plugin will split the mime- type 704 in sub parts, this requires the type field to be multi valued.

Re: How to avoid splitting strings when indexing to solr

2011-08-08 Thread Markus Jelsma
it is in nutch-default of 1.3 only. If you upgraded and copied over the 1.2 conf you'll miss it indeed. On 07.08.2011 15:35, Markus Jelsma wrote: 700 property 701 namemoreIndexingFilter.indexMimeTypeParts/name 702 valuetrue/value 703

Re: How to avoid splitting strings when indexing to solr

2011-08-07 Thread Markus Jelsma
700 property 701 namemoreIndexingFilter.indexMimeTypeParts/name 702 valuetrue/value 703 descriptionDetermines whether the index-more plugin will split the mime- type 704 in sub parts, this requires the type field to be multi valued. Set to true for backward 705

Re: How to avoid splitting strings when indexing to solr

2011-08-05 Thread Gora Mohanty
Hi, Not too familiar these days with Nutch, but my guess is that a Solr analyser is getting applied. To have a field exactly as is, use the String fieldtype on Solr's schema.xml rather than tje text fieldtype. Regards, Gora On 05-Aug-2011 6:35 PM, Marek Bachmann m.bachm...@uni-kassel.de wrote: