Hi Alex, It appears to be linked to icu4j...
Is this in your /lib Lewis ________________________________________ From: McGibbney, Lewis John [[email protected]] Sent: 30 April 2011 19:25 To: [email protected] Subject: RE: Fetch fails due to CharsetDectector - help! Hi Alex, What version of Nutch are you using? By the looks of it, you are missing some jar file(s) which needs to be in your /lib directory. Although I have never seen this error so can't advise on exactly what it is that you are missing! ________________________________________ From: Alex [[email protected]] Sent: 30 April 2011 19:06 To: [email protected] Subject: Fetch fails due to CharsetDectector - help! When Nutch gets ready to fetch my site it fails and gives me this error. Does anyone know how to fix this? fetch of http://example.com/ failed with: java.lang.NoClassDefFoundError: com/ibm/icu/text/CharsetDetector Here is the log: 28/04/11 21:30:47:441 PDT] INFO fetcher.Fetcher: fetching http://example.com/ [28/04/11 21:30:47:461 PDT] INFO fetcher.Fetcher: -finishing thread FetcherThread, activeThreads=9 [28/04/11 21:30:47:462 PDT] INFO fetcher.Fetcher: -finishing thread FetcherThread, activeThreads=5 [28/04/11 21:30:47:465 PDT] INFO fetcher.Fetcher: -finishing thread FetcherThread, activeThreads=4 [28/04/11 21:30:47:467 PDT] INFO fetcher.Fetcher: -finishing thread FetcherThread, activeThreads=3 [28/04/11 21:30:47:461 PDT] INFO fetcher.Fetcher: -finishing thread FetcherThread, activeThreads=6 [28/04/11 21:30:47:461 PDT] INFO fetcher.Fetcher: -finishing thread FetcherThread, activeThreads=7 [28/04/11 21:30:47:461 PDT] INFO fetcher.Fetcher: -finishing thread FetcherThread, activeThreads=8 [28/04/11 21:30:47:469 PDT] INFO fetcher.Fetcher: -finishing thread FetcherThread, activeThreads=1 [28/04/11 21:30:47:468 PDT] INFO fetcher.Fetcher: -finishing thread FetcherThread, activeThreads=2 [28/04/11 21:30:48:694 PDT] INFO fetcher.Fetcher: -activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0 [28/04/11 21:30:49:745 PDT] INFO fetcher.Fetcher: -activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0 [28/04/11 21:30:50:137 PDT] INFO fetcher.Fetcher: fetch of http://example.com/ failed with: java.lang.NoClassDefFoundError: com/ibm/icu/text/ CharsetDetector [28/04/11 21:30:50:138 PDT] INFO fetcher.Fetcher: -finishing thread FetcherThread, activeThreads=0 [28/04/11 21:30:50:783 PDT] INFO fetcher.Fetcher: -activeThreads=0, spinWaiting=0, fetchQueues.totalSize=0 [28/04/11 21:30:50:783 PDT] INFO fetcher.Fetcher: -activeThreads=0 [28/04/11 21:30:51:080 PDT] INFO fetcher.Fetcher: Fetcher: done Email has been scanned for viruses by Altman Technologies' email management service - www.altman.co.uk/emailsystems Glasgow Caledonian University is a registered Scottish charity, number SC021474 Winner: Times Higher Education’s Widening Participation Initiative of the Year 2009 and Herald Society’s Education Initiative of the Year 2009. http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html Winner: Times Higher Education’s Outstanding Support for Early Career Researchers of the Year 2010, GCU as a lead with Universities Scotland partners. http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,15691,en.html Email has been scanned for viruses by Altman Technologies' email management service - www.altman.co.uk/emailsystems Glasgow Caledonian University is a registered Scottish charity, number SC021474 Winner: Times Higher Education’s Widening Participation Initiative of the Year 2009 and Herald Society’s Education Initiative of the Year 2009. http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html Winner: Times Higher Education’s Outstanding Support for Early Career Researchers of the Year 2010, GCU as a lead with Universities Scotland partners. http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,15691,en.html

