mcgibbney <lewi...@apache.org>
To: "user@nutch.apache.org" <user@nutch.apache.org>
Sent: Monday, November 21, 2016 10:34 AM
Subject: Re: indexing to Solr
Hi Michael,
On Sat, Nov 19, 2016 at 8:09 AM, <user-digest-h...@nutch.apache.org> wrote:
> From: Michael Coffe
In this case, I am using solr 5.4.1
Also, as mentioned previously, the tutorial says nothing about which version of
solr to use.
From: lewis john mcgibbney <lewi...@apache.org>
To: "user@nutch.apache.org" <user@nutch.apache.org>
Sent: Monday, November 21, 2016 10:3
nt: Monday, November 21, 2016 10:34 AM
Subject: Re: indexing to Solr
Hi Michael,
On Sat, Nov 19, 2016 at 8:09 AM, <user-digest-h...@nutch.apache.org> wrote:
> From: Michael Coffey <mcof...@yahoo.com.invalid>
> To: "user@nutch.apache.org" <user@nutch.apache.org
Hi Michael,
On Sat, Nov 19, 2016 at 8:09 AM, <user-digest-h...@nutch.apache.org> wrote:
> From: Michael Coffey <mcof...@yahoo.com.invalid>
> To: "user@nutch.apache.org" <user@nutch.apache.org>
> Cc:
> Date: Fri, 18 Nov 2016 21:15:14 + (UTC)
> Subj
Where can I find up-to-date information on indexing to Solr? When I search the
web, I find tutorials that use the deprecated solrindex command. I also find
questions where people want to know why it doesn't work.
I have a good nutch 1.12 installation on a working hadoop cluster and a Solr
6.3.0
2.jar library.
>
> Regards
>
> - Mensaje original -
>> De: "Roannel Fernández Hernández" <roan...@uci.cu>
>> Para: user@nutch.apache.org
>> Enviados: Miércoles, 18 de Noviembre 2015 9:03:38
>> Asunto: Re: [MASSMAIL]Crawl Command - Getting Ex
ensaje original -
>> De: "Manish Verma" ve...@apple.com>
>> Para: user@nutch.apache.org
>> Enviados: Lunes, 16 de Noviembre 2015 12:36:46
>> Asunto: [MASSMAIL]Crawl Command - Getting Exception While Indexing With Solr
>>
>> Hi ,
>>
>>
Hi
What version of Nutch you downloaded exactly?
Regards
- Mensaje original -
> De: "Manish Verma" ve...@apple.com>
> Para: user@nutch.apache.org
> Enviados: Lunes, 16 de Noviembre 2015 12:36:46
> Asunto: [MASSMAIL]Crawl Command - Getting Exception While I
2015 9:03:38
> Asunto: Re: [MASSMAIL]Crawl Command - Getting Exception While Indexing With
> Solr
>
> Hi
>
> What version of Nutch you downloaded exactly?
>
> Regards
>
> - Mensaje original -
> > De: "Manish Verma" ve...@apple.com>
> >
Hi ,
I was using bin version of Nutch 1.X and everything was working fine , I
downloaded the source of Nutch 1.x and build it, In indexing phase it throws
below exception. I am using below crawl command and it run well till parsing
and fails at indexing with below exception.
./crawl -i -D
here
>
> https://github.com/apache/nutch/blob/trunk/conf/schema.xml
>
> Lewis
>
> On Thu, Sep 3, 2015 at 11:13 AM, <user-digest-h...@nutch.apache.org>
> wrote:
>
> >
> > Subject: Re: Problems indexing to solr 3.5 from nutch 1.8
> > Having a similar problem i
Hi Guy,
The schema is present in the conf directory as shown here
https://github.com/apache/nutch/blob/trunk/conf/schema.xml
Lewis
On Thu, Sep 3, 2015 at 11:13 AM, <user-digest-h...@nutch.apache.org> wrote:
>
> Subject: Re: Problems indexing to solr 3.5 from nutch 1.8
> H
Hi Paddy,
Some comments in addition to my response. You should try upgrading to Nutch
1.10 when we release very shortly. There has been so much work done since
1.8 that you can benefit from. Keep your ears peeled here for a release
candidate and then eventual release.
Please see response below.
Having a similar problem in getting Nutch and Solr integrated. Newest
version of both. Downloaded and installed a few days ago.
Following the tut tells me to copy over the schema.xml, but it doesn't
appear to be in the directory that the tut says. Or anywhere for that
matter.
This is probably a
Hey there,
I'm running into Problems with indexing documents crawled by nutch 1.8 into
solr 3.5. Nutch does not report any kind of
error or warning and seems to run just fine. but the solr index remains
empty. (The logs do not show any kind of error or warning eather).
Is there any way to solve
, you will see that the indexing job has failed.
Regards
Ameer
--
View this message in context:
http://lucene.472066.n3.nabble.com/NullPointerException-occured-during-indexing-to-solr-from-nutch-1-7-source-build-tp4156343p4157058.html
Sent from the Nutch - User mailing list archive at Nabble.com.
...@uyarer.com
Sent:user@nutch.apache.org
Date:Tue, September 2, 2014 8:35 pm
Subject:Re: NullPointerException occured during indexing to solr from
nutch 1.7 source build.
Hi,
This is an issue. Below is the code of SolrDeleteDuplicate class
from
nutch
1.7 trunk where the solr record is deleted
indexing to solr, i'm getting below
exceptions.
I have copied the scheme-solr4.xml to my solr and added
exceptions in regex-urlfilter.txt for a particular website which i give
for crawling in the directory urls/seed.txt.
Error:
java.lang.NullPointerException
Not Found
request: http://127.0.0.1:8080/solr/core2/update
-Original Message-
From: Fournier, Danny G [mailto:danny.fourn...@dfo-mpo.gc.ca]
Sent: September 6, 2012 4:15 PM
To: user@nutch.apache.org
Subject: Errors when indexing to Solr
I'm getting two different errors while trying to index
-Original message-
From:Fournier, Danny G danny.fourn...@dfo-mpo.gc.ca
Sent: Fri 07-Sep-2012 14:46
To: user@nutch.apache.org
Subject: RE: Errors when indexing to Solr
I've tried crawling with nutch-1.6-SNAPSHOT.jar and got the following
error:
[root@w7sp1-x64 nutch]# bin/nutch
...@openindex.io]
Sent: September 7, 2012 9:49 AM
To: user@nutch.apache.org
Subject: RE: Errors when indexing to Solr
-Original message-
From:Fournier, Danny G danny.fourn...@dfo-mpo.gc.ca
Sent: Fri 07-Sep-2012 14:46
To: user@nutch.apache.org
Subject: RE: Errors when indexing to Solr
I've
1.6 to fix this?
Error #1 - When indexing directly to Solr
Command: bin/nutch crawl urls -solr http://localhost:8080/solr/core2
-depth 3 -topN 5
Error: Exception in thread main java.io.IOException:
org.apache.solr.client.solrj.SolrServerException
-Original Message-
From: arkadi.kosmy...@csiro.au [mailto:arkadi.kosmy...@csiro.au]
Sent: Friday, 28 October 2011 12:11 PM
To: user@nutch.apache.org; markus.jel...@openindex.io
Subject: [ExternalEmail] RE: OutOfMemoryError when indexing into Solr
Hi Markus,
-Original Message
...@csiro.au]
Sent: Friday, 28 October 2011 12:11 PM
To: user@nutch.apache.org; markus.jel...@openindex.io
Subject: [ExternalEmail] RE: OutOfMemoryError when indexing into Solr
Hi Markus,
-Original Message-
From: Markus Jelsma [mailto:markus.jel...@openindex.io]
Sent: Thursday
of memory when indexing into Solr. This does
not look like a trivial lack of memory problem that can be solved by giving
more memory to the JVM. I've increased the max memory size from 2Gb to 3Gb,
then to 6Gb, but this did not make any difference.
A log extract is included below.
Would anyone have
26, 2011 at 11:54 PM, arkadi.kosmy...@csiro.au wrote:
Hi,
I am working with a Nutch 1.4 snapshot and having a very strange problem
that makes the system run out of memory when indexing into Solr. This
does not look like a trivial lack of memory problem that can be solved
by giving more
of IndexerMapReduce is a notorious RAM consumer.
On Thursday 27 October 2011 05:54:54 arkadi.kosmy...@csiro.au wrote:
Hi,
I am working with a Nutch 1.4 snapshot and having a very strange problem
that makes the system run out of memory when indexing into Solr. This does
not look like a trivial lack
Hi Markus,
-Original Message-
From: Markus Jelsma [mailto:markus.jel...@openindex.io]
Sent: Thursday, 27 October 2011 11:33 PM
To: user@nutch.apache.org
Subject: Re: OutOfMemoryError when indexing into Solr
Interesting, how many records and how large are your records?
There a bit
On 07.08.2011 15:35, Markus Jelsma wrote:
700 property
701 namemoreIndexingFilter.indexMimeTypeParts/name
702 valuetrue/value
703 descriptionDetermines whether the index-more plugin will split the
mime-
type
704 in sub parts, this requires the type field to be multi valued.
it is in nutch-default of 1.3 only. If you upgraded and copied over the 1.2
conf you'll miss it indeed.
On 07.08.2011 15:35, Markus Jelsma wrote:
700 property
701 namemoreIndexingFilter.indexMimeTypeParts/name
702 valuetrue/value
703
700 property
701 namemoreIndexingFilter.indexMimeTypeParts/name
702 valuetrue/value
703 descriptionDetermines whether the index-more plugin will split the
mime-
type
704 in sub parts, this requires the type field to be multi valued. Set to
true
for backward
705
Hi,
Not too familiar these days
with Nutch, but my guess is
that a Solr analyser is getting applied. To have a field exactly as is, use
the String fieldtype on Solr's schema.xml rather than tje text fieldtype.
Regards,
Gora
On 05-Aug-2011 6:35 PM, Marek Bachmann m.bachm...@uni-kassel.de wrote:
32 matches
Mail list logo