in the urlfilter there is a filter which exclude caracters like ? @... you hv to disable this line:
#-[...@=] thnx > From: e...@lakemeadonline.com > Subject: Dynamic Html Parsing > Date: Thu, 15 Oct 2009 13:00:37 -0700 > To: nutch-user@lucene.apache.org > > Is there a way to enable Dynamic Html parsing in Nutch using a plugin > or setting? > > Eric Osgood > --------------------------------------------- > Cal Poly - Computer Engineering, Moon Valley Software > --------------------------------------------- > eosg...@calpoly.edu, e...@lakemeadonline.com > --------------------------------------------- > www.calpoly.edu/~eosgood, www.lakemeadonline.com > _________________________________________________________________ New! Get to Messenger faster: Sign-in here now! http://go.microsoft.com/?linkid=9677407