Hi Michael. Have you indexed the crawl/segment? Easy to forget sometimes : ) Also, check the crawler-tools.xml or whatever it's called, so that ASP pages aren't blocked or anything. The Nutch crawler doesn't by default handle parameters (committees.asp?viewPerson=Ji), I guess that could be an issue as well. No errors or funny stuff in the logs?
Fredrik On 7/23/05, Feng (Michael) Ji <[EMAIL PROTECTED]> wrote: > Hi there: > > I have a question about the crawling depth VS search > result. I attached part of my log information; > > " > 050722 181508 fetching > http://www.committemuse.com/content/committees.asp > : > : > 050722 181508 fetching > : > 050722 181508 status: segment 20050722181440, 100 > pages, 4 errors, 1952888 bytes, 26204 ms > " > > And I see segment in my tomcat box. > > But when I do search the specific word in that page, > it return 0. > > Is that because the page is written in "asp"? > > thanks, > > Michael, > > > > __________________________________________________ > Do You Yahoo!? > Tired of spam? Yahoo! Mail has the best spam protection around > http://mail.yahoo.com > ------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_idt77&alloc_id492&op=click _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
