I'm surprised that tomcat would lock the directory somehow... has anybody reproduced this problem? Is there a fix possible for it?
--- Cherian Thomas <[EMAIL PROTECTED]> wrote: > Hi, > On linux OS and using tomcat 5.0 we could get new > pages without server > restart. > On windows this problem persists because tomcat puts > a lock on the > directory where indexes are stored. > > -Cherian Thomas > > -----Original Message----- > From: bob knob [mailto:[EMAIL PROTECTED] > Sent: Tuesday, April 11, 2006 7:35 PM > To: [email protected] > Subject: Auto-crawling & re-crawling the web site > > Hi, > > I am currently evaluating Nutch for use on an > intranet > site search engine. I am by no means an expert in > this > field although I am trying to learn more about it. > > 1 I was reading one of the articles referenced on > the > nutch site: > > http://today.java.net/pub/a/today/2006/02/16/introduction-to-nutch-2.htm > l > > -and I was a little bit concerned about its warning > concerning "re-crawling" the site. I understand that > there are several steps of crawling, building the > index, etc., but it sounded to me like new pages on > my > web site would be ignored until I restarted the > Nutch > server even after I've re-crawled. Am I correct > about > this? How do most people deal with it? > > 2 It seems like I would want to re-crawl or re-index > the site on a nightly basis. All of this seems to be > done with shell scripts, and I wonder what options > are > available to someone working on a Windows platform. > I > could run cygrunsrv/cron on Windows I guess. Is > there > some reason more of this scripting couldn't be > redone > as a Java program? Also, has anybody considered > creating a Windows service to manage > indexing/crawling > like the one that manages the Tomcat web server? > > Thanks, > Bob > > __________________________________________________ > Do You Yahoo!? > Tired of spam? Yahoo! Mail has the best spam > protection around > http://mail.yahoo.com > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
