Hi,

I have nothing said. This works fine ! It's morning and I'm still not woke up 
:-D

I just want to know if it was possible to re index modified documents ? Or re 
index documents which are already in database ?

Thank's in advance !

Jisay


> 
> Hi Susam Pal and thank's for your help !
> 
> The solution you give to me doesn't work... I have still an error with 
> Hadoop... And if I download an older version of the API, will this patch work 
> ? I have Nutch-0.9 and I don't know if I compile with an oder Hadoop API, 
> this patch will work. But if it will work where can I find an older version 
> of Hadoop API ?
> 
> Thank's in advance for your help,
> 
> Jisay
> 
> 
>>
>> I am not sure but it seems that this is because of an older version of
>> Hadoop. I don't have older versions of Nutch or Hadoop with me to
>> confirm this. Just try omitting the second argument in:
>> fs.listPaths(indexes, HadoopFSUtil.getPassAllFilter()) and see if it
>> compiles?
>>
>> I guess, fs.listPaths(indexes) should work since I can find such a
>> method (though it is deprecated now) in the latest Hadoop API.
>>
>> Regards,
>> Susam pal
>>
>> On Tue, Mar 18, 2008 at 9:09 PM, Jean-Christophe Alleman
>>  wrote:
>>>
>>>
>>>
>>> Thank's for your reply Susam Pal !
>>>
>>> I have run ant and I have an error I can't resolve... Look at this :
>>>
>>> debian:~/nutch-0.9# ant
>>> Buildfile: build.xml
>>>
>>> init:
>>> [unjar] Expanding: /root/nutch-0.9/lib/hadoop-0.12.2-core.jar into 
>>> /root/nutch-0.9/build/hadoop
>>> [untar] Expanding: /root/nutch-0.9/build/hadoop/bin.tgz into 
>>> /root/nutch-0.9/bin
>>> [unjar] Expanding: /root/nutch-0.9/lib/hadoop-0.12.2-core.jar into 
>>> /root/nutch-0.9/build
>>>
>>> compile-core:
>>> [javac] Compiling 133 source files to /root/nutch-0.9/build/classes
>>> [javac] /root/nutch-0.9/src/java/org/apache/nutch/crawl/Crawl.java:150: 
>>> cannot find symbol
>>> [javac] symbol : variable HadoopFSUtil
>>> [javac] location: class org.apache.nutch.crawl.Crawl
>>> [javac] merger.merge(fs.listPaths(indexes, HadoopFSUtil.getPassAllFilter()),
>>> [javac] ^
>>> [javac] Note: Some input files use or override a deprecated API.
>>> [javac] Note: Recompile with -Xlint:deprecation for details.
>>> [javac] Note: Some input files use unchecked or unsafe operations.
>>> [javac] Note: Recompile with -Xlint:unchecked for details.
>>> [javac] 1 error
>>>
>>> BUILD FAILED
>>> /root/nutch-0.9/build.xml:106: Compile failed; see the compiler error 
>>> output for details.
>>>
>>> Total time: 8 seconds
>>>
>>> I have already corrected 3errors but I can't correct this one... I don't 
>>> know what's HadoopFSUtil and so I can't correct the error... Help me please,
>>>
>>> Thank's for your help !
>>>
>>> Jisay
>>>
>>>
>>>
>>>>
>>>> The patch was generated for Nutch 1.0 development version which is
>>>> currently in trunk. So, it is unable to patch your older version
>>>> cleanly.
>>>>
>>>> I also see that you are using NUTCH-601v0.3.patch. However,
>>>> NUTCH-601v1.0.patch is the recommended patch. If this patch fails, you
>>>> can make the modifications manually. This patch is extremely simple
>>>> and if you just open the patch using a text editor, you would find
>>>> that 3 lines have been removed from the original source code
>>>> (indicated by leading minus signs) and 11 new lines have been added
>>>> (indicated by plus signs). You have to make these changes manually to
>>>> your Nutch 0.9 source code directory.
>>>>
>>>> Once you make the changes, just build your project again with ant and
>>>> you would be ready for recrawl.
>>>>
>>>> Regards,
>>>> Susam Pal
>>>>
>>>> On Tue, Mar 18, 2008 at 7:12 PM, Jean-Christophe Alleman
>>>
>>>> wrote:
>>>>>
>>>>>
>>>>> Hi, I'm interested by this patch but I can't patch it. I have some 
>>>>> problems when I try to patch...
>>>>>
>>>>> Here is what I do :
>>>>>
>>>>> debian:~/patch# patch -p0> can't find file to patch at input line 5
>>>
>>>>> Perhaps you used the wrong -p or --strip option?
>>>>> The text leading up to this was:
>>>>> --------------------------
>>>>> |Index: src/java/org/apache/nutch/crawl/Crawl.java
>>>>> |===================================================================
>>>>> |--- src/java/org/apache/nutch/crawl/Crawl.java (revision 628119)
>>>>> |+++ src/java/org/apache/nutch/crawl/Crawl.java (working copy)
>>>>> --------------------------
>>>>> File to patch: /root/nutch-0.9/src/java/org/apache/nutch/crawl/Crawl.java
>>>>> patching file /root/nutch-0.9/src/java/org/apache/nutch/crawl/Crawl.java
>>>>> Reversed (or previously applied) patch detected! Assume -R? [n] y
>>>>> Hunk #2 FAILED at 100.
>>>>> Hunk #3 FAILED at 131.
>>>>> 2 out of 3 hunks FAILED -- saving rejects to file 
>>>>> /root/nutch-0.9/src/java/org/apache/nutch/crawl/Crawl.java.rej
>>>>>
>>>>> Can you please help me ! It's first time I patch. Please help me !
>>>>>
>>>>> Thank's in advance,
>>>>>
>>>>> Jisay
>>>>>
>>>>>
>>>>>
>>>>>>
>>>>>> The recrawl patch in https://issues.apache.org/jira/browse/NUTCH-601
>>>>>> got committed today. So if you check out the latest trunk, you can
>>>>>> recrawl without deleting the crawl directory.
>>>>>>
>>>>>> However, if you are using an older version, you may use the script at:
>>>>>> http://wiki.apache.org/nutch/Crawl
>>>>>>
>>>>>> Regards,
>>>>>> Susam Pal
>>>>>>
>>>>>> On Fri, Mar 14, 2008 at 3:48 AM, Bradford Stephens
>>>
>>>>>> wrote:
>>>>>>> Greetings,
>>>>>>>
>>>>>>> A coworker and I are experimenting with Nutch in anticipation of a
>>>>>>> pretty large rollout at our company. However, we seem to be stuck on
>>>>>>> something -- after the crawler is finished, we can't manually re-crawl
>>>>>>> into the same directory/index! It says "Directory already exists" when
>>>>>>> we try to initiate a new crawl. Any ideas?
>>>>>>>
>>>>>>> Cheers,
>>>>>>> Bradford
>>>>>>>
>>>>>
>>>>> _________________________________________________________________
>>>>> Changez votre Live en un clic !
>>>>> http://get.live.com
>>>
>>> _________________________________________________________________
>>> Windows Live: une foule de solutions orginales pour partager vos souvenirs !
>>> http://get.live.com
> 
> _________________________________________________________________
> Emmenez vos amis avec vous, grâce à Messenger sur votre GSM.
> http://get.live.com

_________________________________________________________________
Vous partez ? Emmenez vos amis avec vous !
http://www.windowslivemobile.msn.com/nl-be

Reply via email to