I'm using 0.7 from a few weeks ago.
I was fetching 204 456 pages,
Nutch -segread tells that there are:
"segments\20050723140812 is corrupt, using only 207126 entries."
Here's what I have with ctrl-break:
Full thread dump Java HotSpot(TM) Client VM (1.4.2_08-b03 mixed mode):
"MultiThreadedHttpConnectionManager cleanup" daemon prio=5 tid=0x032605e8
nid=0x1034 in Object.wait() [3b0f000..3b0fd8c]
at java.lang.Object.wait(Native Method)
at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:111)
- locked <0x150622f0> (a java.lang.ref.ReferenceQueue$Lock)
at
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$ReferenceQu
eueThread.run(MultiThreadedHttpConnectionManager.java:1100)
"fetcher8" prio=5 tid=0x03230888 nid=0x1d04 in Object.wait()
[354f000..354fd8c]
at java.lang.Object.wait(Native Method)
at
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager.doGetConnec
tion(MultiThreadedHttpConnectionManager.java:509)
- locked <0x15063090> (a
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$ConnectionP
ool)
at
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager.getConnecti
onWithTimeout(MultiThreadedHttpConnectionManager.java:394)
at
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDir
ector.java:152)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:393)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:324)
at
org.apache.nutch.protocol.httpclient.HttpResponse.<init>(HttpResponse.java:7
6)
at
org.apache.nutch.protocol.httpclient.Http.getProtocolOutput(Http.java:213)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:135)
"Signal Dispatcher" daemon prio=10 tid=0x009ff8a8 nid=0xfdc waiting on
condition [0..0]
"Finalizer" daemon prio=9 tid=0x009fcd20 nid=0x900 in Object.wait()
[2f7f000..2f7fd8c]
at java.lang.Object.wait(Native Method)
at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:111)
- locked <0x14ee5b08> (a java.lang.ref.ReferenceQueue$Lock)
at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:127)
at java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:159)
"Reference Handler" daemon prio=10 tid=0x009fb9a0 nid=0xeb4 in Object.wait()
[2f3f000..2f3fd8c]
at java.lang.Object.wait(Native Method)
at java.lang.Object.wait(Object.java:429)
at java.lang.ref.Reference$ReferenceHandler.run(Reference.java:115)
- locked <0x14ee5b70> (a java.lang.ref.Reference$Lock)
"main" prio=5 tid=0x000366e0 nid=0x10d0 waiting on condition [7f000..7fc38]
at java.lang.Thread.sleep(Native Method)
at org.apache.nutch.fetcher.Fetcher.run(Fetcher.java:342)
at org.apache.nutch.fetcher.Fetcher.main(Fetcher.java:479)
"VM Thread" prio=5 tid=0x00a3b720 nid=0x1230 runnable
"VM Periodic Task Thread" prio=10 tid=0x00a3d360 nid=0xae0 waiting on
condition
"Suspend Checker Thread" prio=10 tid=0x009feeb0 nid=0x27c runnable
-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers