:36
> To: user@nutch.apache.org
> Subject: Re: Parsed segment has outlinks filtered
>
> Hi,
> Setting the prop parse.filter.urls= false does not filter out the outlinks.
> I get all the outlinks for my parsed url. So this is working as expected.
> However it has caused something
Subject: Re: Parsed segment has outlinks filtered
Hi,
Setting the prop parse.filter.urls= false does not filter out the outlinks.
I get all the outlinks for my parsed url. So this is working as expected.
However it has caused something unwarranted on the FetcherThread as now it
seems to be fetching
l not filter outlinks at all.
>>
>> Yossi.
>>
>> -Original Message-
>> From: Sachin Mittal
>> Sent: Thursday, 17 October 2019 19:15
>> To: user@nutch.apache.org
>> Subject: Parsed segment has outlinks filtered
>>
>> Hi,
>&g
alse, the Parser
>> will not filter outlinks at all.
>>
>> Yossi.
>>
>> -Original Message-
>> From: Sachin Mittal
>> Sent: Thursday, 17 October 2019 19:15
>> To: user@nutch.apache.org
>> Subject: Parsed segment has outlinks
Mittal
Sent: Thursday, 17 October 2019 19:15
To: user@nutch.apache.org
Subject: Parsed segment has outlinks filtered
Hi,
I was bit confused on the outlinks generated from a parsed url.
If I use the utility:
bin/nutch parsechecker url
The generated outlinks has all the outlinks.
However if I
Hi,
I was bit confused on the outlinks generated from a parsed url.
If I use the utility:
bin/nutch parsechecker url
The generated outlinks has all the outlinks.
However if I check the dump of parsed segment generated using nutch crawl
script using command:
bin/nutch readseg -dump
6 matches
Mail list logo