Thanks Martina, that works with cygwin, now I am getting this error when crawling an intranet, and the virtual memory of java gets up to 1gb and the crawl stops with no signs an no more errors than the following
Error parsing: http://intranet/pdf/ronda_pupo.pdf: failed(2,0): Can't be handled as pdf document. java.io.IOException: Error: expected the end of a dict ionary. The crawl stop there and no matter how long I wait, do nothing from there. Any ideas? Best regards, Rolando -----Original Message----- From: Koch Martina [mailto:[email protected]] Sent: Monday, January 26, 2009 3:21 AM To: [email protected] Subject: AW: Error in eclipse when crawl Hi Rolando, you have to install cygwin to be able to run Nutch in Windows. Have a look at this tutorial for more information: http://wiki.apache.org/nutch/GettingNutchRunningWithWindows Besides there are several posts on the mailing list regarding this error message. Kind regards, Martina -----Ursprüngliche Nachricht----- Von: Rolando Bermudez Peña [mailto:[email protected]] Gesendet: Montag, 26. Januar 2009 09:04 An: [email protected] Betreff: RE: Error in eclipse when crawl Yes, I am using windows, nutch can’t run in windows? -----Original Message----- From: Doğacan Güney [mailto:[email protected]] Sent: Monday, January 26, 2009 3:04 AM To: [email protected] Subject: Re: Error in eclipse when crawl Are you using windows? "whoami" is a unix program that returns the name of current user, nutch (actually hadoop) needs it for security purposes. On Mon, Jan 26, 2009 at 6:51 AM, Rolando Bermudez Peña <[email protected]> wrote: > Hello all, > I am new in Nutch, already create a new project in Eclipse 3.4 width the > latest nutch in trunk and got the following error in the java console when > trying to crawl some intranet site. > > Exception in thread "main" java.io.IOException: Failed to get the current > user's information. > at org.apache.hadoop.mapred.JobClient.getUGI(JobClient.java:717) > at > org.apache.hadoop.mapred.JobClient.configureCommandLineOptions(JobClient.java:592) > at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:774) > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1127) > at org.apache.nutch.crawl.Injector.inject(Injector.java:160) > at org.apache.nutch.crawl.Crawl.main(Crawl.java:112) > Caused by: javax.security.auth.login.LoginException: Login failed: Cannot run > program "whoami": CreateProcess error=2, The system cannot find the file > specified > at > org.apache.hadoop.security.UnixUserGroupInformation.login(UnixUserGroupInformation.java:250) > at > org.apache.hadoop.security.UnixUserGroupInformation.login(UnixUserGroupInformation.java:275) > at org.apache.hadoop.mapred.JobClient.getUGI(JobClient.java:715) > ... 5 more > > Someone can help with any idea of what means 'whoami' or whats is the problem? > > Thanks in advance. > Rolando. > > -- Doğacan Güney
