Solved!

For my purpose I did a java string replace on the  "_" by " " and it is
fine know.
I think this is related with tokenizers.

Thanks anyway,
Marco

On 8/16/06, Marco Vanossi <[EMAIL PROTECTED]> wrote:

Hi,

 If you have a page named: chocolate_cake.html and if you search for
"chocolate", the page will not be found.
 Do you know a quick solution to make Nutch retrieve c "hocolate_cake.html
for a chocolate or cake search?

 I'm not very sure but I think it is related to the words analyzing
section of the program.  In Lucene there are  several  analyzers from wich
you can choose before indexing pages.
 Hope you get me.

Thanks,
Marco

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to