Hi Michael.

Have you indexed the crawl/segment? Easy to forget sometimes : ) Also,
check the crawler-tools.xml or whatever it's called, so that ASP pages
aren't blocked or anything. The Nutch crawler doesn't by default
handle parameters (committees.asp?viewPerson=Ji), I guess that could
be an issue as well. No errors or funny stuff in the logs?

Fredrik

On 7/23/05, Feng (Michael) Ji <[EMAIL PROTECTED]> wrote:
> Hi there:
> 
> I have a question about the crawling depth VS search
> result. I attached part of my log information;
> 
> "
> 050722 181508 fetching
> http://www.committemuse.com/content/committees.asp
> :
> :
> 050722 181508 fetching
> :
> 050722 181508 status: segment 20050722181440, 100
> pages, 4 errors, 1952888 bytes, 26204 ms
> "
> 
> And I see segment in my tomcat box.
> 
> But when I do search the specific word in that page,
> it return 0.
> 
> Is that because the page is written in "asp"?
> 
> thanks,
> 
> Michael,
> 
> 
> 
> __________________________________________________
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around
> http://mail.yahoo.com
>


-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_idt77&alloc_id492&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to