Re: could you unsubscribe me from this mailing list pls. tks

2009-11-02 Thread Heiko Dietze
work. regards, Heiko Dietze Nico Sabbi wrote: Il giorno lun, 02/11/2009 alle 09.48 +0100, Zanzico Gioele ha scritto: Zanzico Gioele Senior Web Analyst VitecGroup - Division / Unit Tel +39 0424 07 Fax +39 0424 808999 www.vitecgroup.it http://www.vitecgroup.it/ P Respect

Re: content-type crawling problem

2006-05-29 Thread Heiko Dietze
if this is the best place for such a change, but it worked for me. with best regards, Heiko Dietze Eugen Kochuev wrote: Any information on this? I really need to limit nutch in indexing (only textual formats, excluding css, javascript and other non human oriented data) Nutch is trying to crawl everything

Re: content-type crawling problem

2006-05-29 Thread Heiko Dietze
meant that you should leave it out, yes. Unfortunaly for the fetching of the pages this is not the solution, but the index will be based only on the proper content. I think the index is created with the parsed content. with best regards, Heiko Dietze