[
https://issues.apache.org/jira/browse/NUTCH-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13192993#comment-13192993
]
Remi Tassing commented on NUTCH-1086:
-------------------------------------
With the dirty code I wrote on NTLMv2 and HttpUrlConnection, I'm having the
following Java error from time to time. I believe it's due to the poor
integration of my code with Nutch:
#
# A fatal error has been detected by the Java Runtime Environment:
#
# EXCEPTION_ACCESS_VIOLATION (0xc0000005) at pc=0x762135c8, pid=7320, tid=5720
#
# JRE version: 7.0-b147
# Java VM: Java HotSpot(TM) Client VM (21.0-b17 mixed mode windows-x86 )
# Problematic frame:
# C [Secur32.dll+0x35c8]
#
# Failed to write core dump. Minidumps are not enabled by default on client
versions of Windows
#
# If you would like to submit a bug report, please visit:
# http://bugreport.sun.com/bugreport/crash.jsp
# The crash happened outside the Java Virtual Machine in native code.
# See problematic frame for where to report the bug.
#
--------------- T H R E A D ---------------
Current thread (0x4753a800): JavaThread "FetcherThread" daemon
[_thread_in_native, id=5720, stack(0x48350000,0x483a0000)]
siginfo: ExceptionCode=0xc0000005, reading address 0x00000010
Registers:
EAX=0x00000000, EBX=0x00000000, ECX=0x4839f0cc, EDX=0x02bdafe8
ESP=0x4839f0c4, EBP=0x4839f0d4, ESI=0x002b0058, EDI=0x00000000
EIP=0x762135c8, EFLAGS=0x00010202
Top of Stack: (sp=0x4839f0c4)
0x4839f0c4: 4839f0cc 65a5014e 002b0058 02bdafe8
0x4839f0d4: 4839f0e4 6b62a15c 477c2d10 4753a928
0x4839f0e4: 4839f180 6b62a2b1 477c2d10 477c2d00
0x4839f0f4: 4753a800 437469b8 437469b8 052c98e8
0x4839f104: 4839f320 025aa595 4839f354 4839f1c4
0x4839f114: 4753a800 47657b90 47075798 470757d8
0x4839f124: 00000000 00000001 4839f130 00000200
0x4839f134: 00000002 4839f154 477c2d10 00000000
Instructions: (pc=0x762135c8)
0x762135a8: 00 e8 c2 f6 ff ff 8b f0 85 f6 74 1c 56 ff 35 54
0x762135b8: 10 22 76 ff 15 20 11 21 76 8b 46 60 8d 4d f8 51
0x762135c8: ff 50 10 5e c9 c2 04 00 b8 01 03 09 80 eb f4 90
0x762135d8: 90 90 90 90 8b ff 55 8b ec 51 51 8b 45 08 8b 08
Register to memory mapping:
EAX=0x00000000 is an unknown value
EBX=0x00000000 is an unknown value
ECX=0x4839f0cc is pointing into the stack for thread: 0x4753a800
EDX=0x02bdafe8 is an unknown value
ESP=0x4839f0c4 is pointing into the stack for thread: 0x4753a800
EBP=0x4839f0d4 is pointing into the stack for thread: 0x4753a800
ESI=0x002b0058 is an unknown value
EDI=0x00000000 is an unknown value
Stack: [0x48350000,0x483a0000], sp=0x4839f0c4, free space=316k
Native frames: (J=compiled Java code, j=interpreted, Vv=VM code, C=native code)
C [Secur32.dll+0x35c8] FreeCredentialsHandle+0x30
C [net.dll+0xa15c]
Java_sun_net_www_protocol_http_ntlm_NTLMAuthSequence_getCredentialsHandle+0x180
C [net.dll+0xa2b1]
Java_sun_net_www_protocol_http_ntlm_NTLMAuthSequence_getNextToken+0x137
Java frames: (J=compiled Java code, j=interpreted, Vv=VM code)
j sun.net.www.protocol.http.ntlm.NTLMAuthSequence.getNextToken(J[B)[B+0
j
sun.net.www.protocol.http.ntlm.NTLMAuthSequence.getAuthHeader(Ljava/lang/String;)Ljava/lang/String;+24
j
sun.net.www.protocol.http.ntlm.NTLMAuthentication.setHeaders(Lsun/net/www/protocol/http/HttpURLConnection;Lsun/net/www/HeaderParser;Ljava/lang/String;)Z+73
j
sun.net.www.protocol.http.HttpURLConnection.getServerAuthentication(Lsun/net/www/protocol/http/AuthenticationHeader;)Lsun/net/www/protocol/http/AuthenticationInfo;+760
j
sun.net.www.protocol.http.HttpURLConnection.getInputStream()Ljava/io/InputStream;+972
j
sun.net.www.protocol.https.HttpsURLConnectionImpl.getInputStream()Ljava/io/InputStream;+4
j
org.apache.nutch.protocol.httpclient.HttpResponse.<init>(Lorg/apache/nutch/protocol/httpclient/Http;Ljava/net/URL;Lorg/apache/nutch/crawl/CrawlDatum;Z)V+453
j
org.apache.nutch.protocol.httpclient.Http.getResponse(Ljava/net/URL;Lorg/apache/nutch/crawl/CrawlDatum;Z)Lorg/apache/nutch/net/protocols/Response;+13
j
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;)Lorg/apache/nutch/protocol/ProtocolOutput;+283
j org.apache.nutch.fetcher.Fetcher$FetcherThread.run()V+646
v ~StubRoutines::call_stub
--------------- P R O C E S S ---------------
Java Threads: ( => current thread )
0x4708ac00 JavaThread "Thread-27" daemon [_thread_in_native, id=6032,
stack(0x48cf0000,0x48d40000)]
0x4708a400 JavaThread "MultiThreadedHttpConnectionManager cleanup" daemon
[_thread_blocked, id=4920, stack(0x48450000,0x484a0000)]
0x4708a000 JavaThread "FetcherThread" daemon [_thread_blocked, id=6244,
stack(0x01210000,0x01260000)]
0x47089800 JavaThread "FetcherThread" daemon [_thread_blocked, id=7148,
stack(0x483a0000,0x483f0000)]
=>0x4753a800 JavaThread "FetcherThread" daemon [_thread_in_native, id=5720,
stack(0x48350000,0x483a0000)]
0x4753a000 JavaThread "FetcherThread" daemon [_thread_blocked, id=7808,
stack(0x48200000,0x48250000)]
0x47539c00 JavaThread "FetcherThread" daemon [_thread_blocked, id=6348,
stack(0x47300000,0x47350000)]
0x47539000 JavaThread "FetcherThread" daemon [_thread_blocked, id=4668,
stack(0x47410000,0x47460000)]
0x470b7800 JavaThread "FetcherThread" daemon [_thread_blocked, id=4424,
stack(0x480d0000,0x48120000)]
0x4764c000 JavaThread "FetcherThread" daemon [_thread_blocked, id=1600,
stack(0x48140000,0x48190000)]
0x4764b800 JavaThread "FetcherThread" daemon [_thread_blocked, id=4476,
stack(0x47b20000,0x47b70000)]
0x4764b400 JavaThread "FetcherThread" daemon [_thread_blocked, id=8000,
stack(0x47350000,0x473a0000)]
0x4767ac00 JavaThread "SpillThread" daemon [_thread_blocked, id=5708,
stack(0x47bd0000,0x47c20000)]
0x47689400 JavaThread "communication thread" daemon [_thread_blocked,
id=4976, stack(0x47260000,0x472b0000)]
0x4711e800 JavaThread "Thread-11" [_thread_blocked, id=6608,
stack(0x478d0000,0x47920000)]
0x47089000 JavaThread "Service Thread" daemon [_thread_blocked, id=3652,
stack(0x00b30000,0x00b80000)]
0x4706c400 JavaThread "C1 CompilerThread0" daemon [_thread_blocked, id=5272,
stack(0x473c0000,0x47410000)]
0x4706b000 JavaThread "Attach Listener" daemon [_thread_blocked, id=3568,
stack(0x00b90000,0x00be0000)]
0x47069c00 JavaThread "Signal Dispatcher" daemon [_thread_blocked, id=6512,
stack(0x472b0000,0x47300000)]
0x0240f000 JavaThread "Finalizer" daemon [_thread_blocked, id=4252,
stack(0x01260000,0x012b0000)]
0x0240c800 JavaThread "Reference Handler" daemon [_thread_blocked, id=7492,
stack(0x00aa0000,0x00af0000)]
0x00b1dc00 JavaThread "main" [_thread_blocked, id=2896,
stack(0x00820000,0x00870000)]
Other Threads:
0x02407800 VMThread [stack: 0x46fc0000,0x47010000] [id=8048]
0x4709b000 WatcherThread [stack: 0x47460000,0x474b0000] [id=6908]
VM state:not at safepoint (normal execution)
VM Mutex/Monitor currently owned by a thread: None
Heap
def new generation total 81664K, used 13803K [0x045a0000, 0x09e30000,
0x192f0000)
eden space 72640K, 19% used [0x045a0000, 0x0531ada0, 0x08c90000)
from space 9024K, 0% used [0x08c90000, 0x08c90000, 0x09560000)
to space 9024K, 0% used [0x09560000, 0x09560000, 0x09e30000)
tenured generation total 181236K, used 108739K [0x192f0000, 0x243ed000,
0x42da0000)
the space 181236K, 59% used [0x192f0000, 0x1fd20ff8, 0x1fd21000, 0x243ed000)
compacting perm gen total 12288K, used 10197K [0x42da0000, 0x439a0000,
0x46da0000)
the space 12288K, 82% used [0x42da0000, 0x43795548, 0x43795600, 0x439a0000)
No shared spaces configured.
Code Cache [0x025a0000, 0x027c8000, 0x045a0000)
total_blobs=1160 nmethods=977 adapters=115 free_code_cache=30587Kb
largest_free_block=31319360
Dynamic libraries:
...
VM Arguments:
...
Launcher Type: SUN_STANDARD
Environment Variables:
...
--------------- S Y S T E M ---------------
...
elapsed time: 336 seconds
> Rewrite protocol-httpclient
> ---------------------------
>
> Key: NUTCH-1086
> URL: https://issues.apache.org/jira/browse/NUTCH-1086
> Project: Nutch
> Issue Type: Improvement
> Components: fetcher
> Reporter: Markus Jelsma
>
> There are several issues about protocol-httpclient and several comments about
> rewriting the plugin with the new http client libraries. There is, however,
> not yet an issue for rewriting/reimplementing protocol-httpclient.
> http://hc.apache.org/httpcomponents-client-ga/
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira