work.
regards,
Heiko Dietze
Nico Sabbi wrote:
Il giorno lun, 02/11/2009 alle 09.48 +0100, Zanzico Gioele ha scritto:
Zanzico Gioele
Senior Web Analyst
VitecGroup - Division / Unit
Tel +39 0424 07
Fax +39 0424 808999
www.vitecgroup.it http://www.vitecgroup.it/
P Respect
if this is the best place for such a change,
but it worked for me.
with best regards,
Heiko Dietze
Eugen Kochuev wrote:
Any information on this? I really need to limit nutch in indexing
(only textual formats, excluding css, javascript and other non human
oriented data)
Nutch is trying to crawl everything
meant that you should leave it out, yes.
Unfortunaly for the fetching of the pages this is not the solution, but
the index will be based only on the proper content. I think the index is
created with the parsed content.
with best regards,
Heiko Dietze