[ https://issues.apache.org/jira/browse/NUTCH-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13192993#comment-13192993 ]
Remi Tassing commented on NUTCH-1086: ------------------------------------- With the dirty code I wrote on NTLMv2 and HttpUrlConnection, I'm having the following Java error from time to time. I believe it's due to the poor integration of my code with Nutch: # # A fatal error has been detected by the Java Runtime Environment: # # EXCEPTION_ACCESS_VIOLATION (0xc0000005) at pc=0x762135c8, pid=7320, tid=5720 # # JRE version: 7.0-b147 # Java VM: Java HotSpot(TM) Client VM (21.0-b17 mixed mode windows-x86 ) # Problematic frame: # C [Secur32.dll+0x35c8] # # Failed to write core dump. Minidumps are not enabled by default on client versions of Windows # # If you would like to submit a bug report, please visit: # http://bugreport.sun.com/bugreport/crash.jsp # The crash happened outside the Java Virtual Machine in native code. # See problematic frame for where to report the bug. # --------------- T H R E A D --------------- Current thread (0x4753a800): JavaThread "FetcherThread" daemon [_thread_in_native, id=5720, stack(0x48350000,0x483a0000)] siginfo: ExceptionCode=0xc0000005, reading address 0x00000010 Registers: EAX=0x00000000, EBX=0x00000000, ECX=0x4839f0cc, EDX=0x02bdafe8 ESP=0x4839f0c4, EBP=0x4839f0d4, ESI=0x002b0058, EDI=0x00000000 EIP=0x762135c8, EFLAGS=0x00010202 Top of Stack: (sp=0x4839f0c4) 0x4839f0c4: 4839f0cc 65a5014e 002b0058 02bdafe8 0x4839f0d4: 4839f0e4 6b62a15c 477c2d10 4753a928 0x4839f0e4: 4839f180 6b62a2b1 477c2d10 477c2d00 0x4839f0f4: 4753a800 437469b8 437469b8 052c98e8 0x4839f104: 4839f320 025aa595 4839f354 4839f1c4 0x4839f114: 4753a800 47657b90 47075798 470757d8 0x4839f124: 00000000 00000001 4839f130 00000200 0x4839f134: 00000002 4839f154 477c2d10 00000000 Instructions: (pc=0x762135c8) 0x762135a8: 00 e8 c2 f6 ff ff 8b f0 85 f6 74 1c 56 ff 35 54 0x762135b8: 10 22 76 ff 15 20 11 21 76 8b 46 60 8d 4d f8 51 0x762135c8: ff 50 10 5e c9 c2 04 00 b8 01 03 09 80 eb f4 90 0x762135d8: 90 90 90 90 8b ff 55 8b ec 51 51 8b 45 08 8b 08 Register to memory mapping: EAX=0x00000000 is an unknown value EBX=0x00000000 is an unknown value ECX=0x4839f0cc is pointing into the stack for thread: 0x4753a800 EDX=0x02bdafe8 is an unknown value ESP=0x4839f0c4 is pointing into the stack for thread: 0x4753a800 EBP=0x4839f0d4 is pointing into the stack for thread: 0x4753a800 ESI=0x002b0058 is an unknown value EDI=0x00000000 is an unknown value Stack: [0x48350000,0x483a0000], sp=0x4839f0c4, free space=316k Native frames: (J=compiled Java code, j=interpreted, Vv=VM code, C=native code) C [Secur32.dll+0x35c8] FreeCredentialsHandle+0x30 C [net.dll+0xa15c] Java_sun_net_www_protocol_http_ntlm_NTLMAuthSequence_getCredentialsHandle+0x180 C [net.dll+0xa2b1] Java_sun_net_www_protocol_http_ntlm_NTLMAuthSequence_getNextToken+0x137 Java frames: (J=compiled Java code, j=interpreted, Vv=VM code) j sun.net.www.protocol.http.ntlm.NTLMAuthSequence.getNextToken(J[B)[B+0 j sun.net.www.protocol.http.ntlm.NTLMAuthSequence.getAuthHeader(Ljava/lang/String;)Ljava/lang/String;+24 j sun.net.www.protocol.http.ntlm.NTLMAuthentication.setHeaders(Lsun/net/www/protocol/http/HttpURLConnection;Lsun/net/www/HeaderParser;Ljava/lang/String;)Z+73 j sun.net.www.protocol.http.HttpURLConnection.getServerAuthentication(Lsun/net/www/protocol/http/AuthenticationHeader;)Lsun/net/www/protocol/http/AuthenticationInfo;+760 j sun.net.www.protocol.http.HttpURLConnection.getInputStream()Ljava/io/InputStream;+972 j sun.net.www.protocol.https.HttpsURLConnectionImpl.getInputStream()Ljava/io/InputStream;+4 j org.apache.nutch.protocol.httpclient.HttpResponse.<init>(Lorg/apache/nutch/protocol/httpclient/Http;Ljava/net/URL;Lorg/apache/nutch/crawl/CrawlDatum;Z)V+453 j org.apache.nutch.protocol.httpclient.Http.getResponse(Ljava/net/URL;Lorg/apache/nutch/crawl/CrawlDatum;Z)Lorg/apache/nutch/net/protocols/Response;+13 j org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;)Lorg/apache/nutch/protocol/ProtocolOutput;+283 j org.apache.nutch.fetcher.Fetcher$FetcherThread.run()V+646 v ~StubRoutines::call_stub --------------- P R O C E S S --------------- Java Threads: ( => current thread ) 0x4708ac00 JavaThread "Thread-27" daemon [_thread_in_native, id=6032, stack(0x48cf0000,0x48d40000)] 0x4708a400 JavaThread "MultiThreadedHttpConnectionManager cleanup" daemon [_thread_blocked, id=4920, stack(0x48450000,0x484a0000)] 0x4708a000 JavaThread "FetcherThread" daemon [_thread_blocked, id=6244, stack(0x01210000,0x01260000)] 0x47089800 JavaThread "FetcherThread" daemon [_thread_blocked, id=7148, stack(0x483a0000,0x483f0000)] =>0x4753a800 JavaThread "FetcherThread" daemon [_thread_in_native, id=5720, stack(0x48350000,0x483a0000)] 0x4753a000 JavaThread "FetcherThread" daemon [_thread_blocked, id=7808, stack(0x48200000,0x48250000)] 0x47539c00 JavaThread "FetcherThread" daemon [_thread_blocked, id=6348, stack(0x47300000,0x47350000)] 0x47539000 JavaThread "FetcherThread" daemon [_thread_blocked, id=4668, stack(0x47410000,0x47460000)] 0x470b7800 JavaThread "FetcherThread" daemon [_thread_blocked, id=4424, stack(0x480d0000,0x48120000)] 0x4764c000 JavaThread "FetcherThread" daemon [_thread_blocked, id=1600, stack(0x48140000,0x48190000)] 0x4764b800 JavaThread "FetcherThread" daemon [_thread_blocked, id=4476, stack(0x47b20000,0x47b70000)] 0x4764b400 JavaThread "FetcherThread" daemon [_thread_blocked, id=8000, stack(0x47350000,0x473a0000)] 0x4767ac00 JavaThread "SpillThread" daemon [_thread_blocked, id=5708, stack(0x47bd0000,0x47c20000)] 0x47689400 JavaThread "communication thread" daemon [_thread_blocked, id=4976, stack(0x47260000,0x472b0000)] 0x4711e800 JavaThread "Thread-11" [_thread_blocked, id=6608, stack(0x478d0000,0x47920000)] 0x47089000 JavaThread "Service Thread" daemon [_thread_blocked, id=3652, stack(0x00b30000,0x00b80000)] 0x4706c400 JavaThread "C1 CompilerThread0" daemon [_thread_blocked, id=5272, stack(0x473c0000,0x47410000)] 0x4706b000 JavaThread "Attach Listener" daemon [_thread_blocked, id=3568, stack(0x00b90000,0x00be0000)] 0x47069c00 JavaThread "Signal Dispatcher" daemon [_thread_blocked, id=6512, stack(0x472b0000,0x47300000)] 0x0240f000 JavaThread "Finalizer" daemon [_thread_blocked, id=4252, stack(0x01260000,0x012b0000)] 0x0240c800 JavaThread "Reference Handler" daemon [_thread_blocked, id=7492, stack(0x00aa0000,0x00af0000)] 0x00b1dc00 JavaThread "main" [_thread_blocked, id=2896, stack(0x00820000,0x00870000)] Other Threads: 0x02407800 VMThread [stack: 0x46fc0000,0x47010000] [id=8048] 0x4709b000 WatcherThread [stack: 0x47460000,0x474b0000] [id=6908] VM state:not at safepoint (normal execution) VM Mutex/Monitor currently owned by a thread: None Heap def new generation total 81664K, used 13803K [0x045a0000, 0x09e30000, 0x192f0000) eden space 72640K, 19% used [0x045a0000, 0x0531ada0, 0x08c90000) from space 9024K, 0% used [0x08c90000, 0x08c90000, 0x09560000) to space 9024K, 0% used [0x09560000, 0x09560000, 0x09e30000) tenured generation total 181236K, used 108739K [0x192f0000, 0x243ed000, 0x42da0000) the space 181236K, 59% used [0x192f0000, 0x1fd20ff8, 0x1fd21000, 0x243ed000) compacting perm gen total 12288K, used 10197K [0x42da0000, 0x439a0000, 0x46da0000) the space 12288K, 82% used [0x42da0000, 0x43795548, 0x43795600, 0x439a0000) No shared spaces configured. Code Cache [0x025a0000, 0x027c8000, 0x045a0000) total_blobs=1160 nmethods=977 adapters=115 free_code_cache=30587Kb largest_free_block=31319360 Dynamic libraries: ... VM Arguments: ... Launcher Type: SUN_STANDARD Environment Variables: ... --------------- S Y S T E M --------------- ... elapsed time: 336 seconds > Rewrite protocol-httpclient > --------------------------- > > Key: NUTCH-1086 > URL: https://issues.apache.org/jira/browse/NUTCH-1086 > Project: Nutch > Issue Type: Improvement > Components: fetcher > Reporter: Markus Jelsma > > There are several issues about protocol-httpclient and several comments about > rewriting the plugin with the new http client libraries. There is, however, > not yet an issue for rewriting/reimplementing protocol-httpclient. > http://hc.apache.org/httpcomponents-client-ga/ -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira