I'm using 0.7 from a few weeks ago.
I was fetching 204 456 pages,

Nutch -segread tells that there are: 
"segments\20050723140812 is corrupt, using only 207126 entries."

Here's what I have with ctrl-break:

Full thread dump Java HotSpot(TM) Client VM (1.4.2_08-b03 mixed mode):

"MultiThreadedHttpConnectionManager cleanup" daemon prio=5 tid=0x032605e8
nid=0x1034 in Object.wait() [3b0f000..3b0fd8c]
        at java.lang.Object.wait(Native Method)
        at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:111)
        - locked <0x150622f0> (a java.lang.ref.ReferenceQueue$Lock)
        at
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$ReferenceQu
eueThread.run(MultiThreadedHttpConnectionManager.java:1100)

"fetcher8" prio=5 tid=0x03230888 nid=0x1d04 in Object.wait()
[354f000..354fd8c]
        at java.lang.Object.wait(Native Method)
        at
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager.doGetConnec
tion(MultiThreadedHttpConnectionManager.java:509)
        - locked <0x15063090> (a
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$ConnectionP
ool)
        at
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager.getConnecti
onWithTimeout(MultiThreadedHttpConnectionManager.java:394)
        at
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDir
ector.java:152)
        at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:393)
        at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:324)
        at
org.apache.nutch.protocol.httpclient.HttpResponse.<init>(HttpResponse.java:7
6)
        at
org.apache.nutch.protocol.httpclient.Http.getProtocolOutput(Http.java:213)
        at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:135)

"Signal Dispatcher" daemon prio=10 tid=0x009ff8a8 nid=0xfdc waiting on
condition [0..0]

"Finalizer" daemon prio=9 tid=0x009fcd20 nid=0x900 in Object.wait()
[2f7f000..2f7fd8c]
        at java.lang.Object.wait(Native Method)
        at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:111)
        - locked <0x14ee5b08> (a java.lang.ref.ReferenceQueue$Lock)
        at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:127)
        at java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:159)

"Reference Handler" daemon prio=10 tid=0x009fb9a0 nid=0xeb4 in Object.wait()
[2f3f000..2f3fd8c]
        at java.lang.Object.wait(Native Method)
        at java.lang.Object.wait(Object.java:429)
        at java.lang.ref.Reference$ReferenceHandler.run(Reference.java:115)
        - locked <0x14ee5b70> (a java.lang.ref.Reference$Lock)

"main" prio=5 tid=0x000366e0 nid=0x10d0 waiting on condition [7f000..7fc38]
        at java.lang.Thread.sleep(Native Method)
        at org.apache.nutch.fetcher.Fetcher.run(Fetcher.java:342)
        at org.apache.nutch.fetcher.Fetcher.main(Fetcher.java:479)

"VM Thread" prio=5 tid=0x00a3b720 nid=0x1230 runnable

"VM Periodic Task Thread" prio=10 tid=0x00a3d360 nid=0xae0 waiting on
condition
"Suspend Checker Thread" prio=10 tid=0x009feeb0 nid=0x27c runnable





-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to