[ 
https://issues.apache.org/jira/browse/HBASE-23684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17014582#comment-17014582
 ] 

Michael Stack commented on HBASE-23684:
---------------------------------------

Here is another instance:

{code}
 2020-01-13 18:43:12,603 INFO org.apache.hadoop.hbase.wal.WALSplitter: 
Splitting 
WAL=hdfs://nameservice1/hbase/genie/WALs/hbasedn006.example.org,16020,1578934813898-splitting/hbasedn006.example.org%2C16020%2C1578934813898.1578940951781,
 size=130.6 M    (136916443 bytes)
 2020-01-13 18:43:12,605 INFO org.apache.hadoop.hbase.util.FSHDFSUtils: Recover 
lease on dfs file 
hdfs://nameservice1/hbase/genie/WALs/hbasedn006.example.org,16020,1578934813898-splitting/hbasedn006.example.org%2C16020%2C1578934813898.1578940951781
 2020-01-13 18:43:12,608 INFO org.apache.hadoop.hbase.util.FSHDFSUtils: 
Recovered lease, attempt=0 on 
file=hdfs://nameservice1/hbase/genie/WALs/hbasedn006.example.org,16020,1578934813898-splitting/hbasedn006.example.org%2C16020%2C1578934813898.
         1578940951781 after 3ms
 2020-01-13 18:43:12,613 INFO org.apache.hadoop.hbase.wal.WALSplitter: Open 
WAL=hdfs://nameservice1/hbase/genie/WALs/hbasedn006.example.org,16020,1578934813898-splitting/hbasedn006.example.org%2C16020%2C1578934813898.1578940951781
 cost 10 ms
 2020-01-13 18:43:12,619 INFO org.apache.zookeeper.ZooKeeper: Initiating client 
connection, 
connectString=hbasezk002.example.org:2181,hbasezk001.example.org:2181,hbasezk004.example.org:2181,hbasezk003.example.org:2181,hbasezk005.example.org:2181
        sessionTimeout=60000 
watcher=org.apache.hadoop.hbase.zookeeper.ReadOnlyZKClient$$Lambda$180/0x00000008005df440@164265a5
 2020-01-13 18:43:12,624 INFO org.apache.zookeeper.ClientCnxn: Opening socket 
connection to server hbasezk003.example.org/17.121.236.6:2181. Will not attempt 
to authenticate using SASL (unknown error)
 2020-01-13 18:43:12,625 INFO org.apache.zookeeper.ClientCnxn: Socket 
connection established to hbasezk003.example.org/17.121.236.6:2181, initiating 
session
 2020-01-13 18:43:12,626 INFO org.apache.zookeeper.ClientCnxn: Session 
establishment complete on server hbasezk003.example.org/17.121.236.6:2181, 
sessionid = 0x46dd0ba61659ec7, negotiated timeout = 60000
 2020-01-13 18:43:12,856 INFO org.apache.hadoop.hbase.wal.OutputSink: 3 split 
writer threads finished
 2020-01-13 18:43:12,890 INFO org.apache.hadoop.hbase.wal.WALSplitter: 
Processed 391 edits across 0 regions cost 250 ms; edits skipped=409; 
WAL=hdfs://nameservice1/hbase/genie/WALs/hbasedn006.example.org,16020,1578934813898-splitting/hbasedn006.
        example.org%2C16020%2C1578934813898.1578940951781, size=130.6 M, 
length=136916443, corrupted=false, progress failed=true
 2020-01-13 18:43:12,890 WARN 
org.apache.hadoop.hbase.regionserver.SplitLogWorker: log splitting of 
WALs/hbasedn006.example.org,16020,1578934813898-splitting/hbasedn006.example.org%2C16020%2C1578934813898.1578940951781
 failed, returning error
 java.io.IOException: java.lang.NullPointerException
         at 
org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.writeRemainingEntryBuffers(BoundedRecoveredHFilesOutputSink.java:173)
         at 
org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.close(BoundedRecoveredHFilesOutputSink.java:140)
         at 
org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:339)
         at 
org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:181)
         at 
org.apache.hadoop.hbase.regionserver.SplitLogWorker.splitLog(SplitLogWorker.java:105)
         at 
org.apache.hadoop.hbase.regionserver.SplitLogWorker.lambda$new$0(SplitLogWorker.java:84)
         at 
org.apache.hadoop.hbase.regionserver.handler.WALSplitterHandler.process(WALSplitterHandler.java:70)
         at 
org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:104)
         at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
         at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
         at java.base/java.lang.Thread.run(Thread.java:834)
 Caused by: java.lang.NullPointerException
         at 
org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.configContextForNonMetaWriter(BoundedRecoveredHFilesOutputSink.java:225)
         at 
org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.createRecoveredHFileWriter(BoundedRecoveredHFilesOutputSink.java:213)
         at 
org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.append(BoundedRecoveredHFilesOutputSink.java:117)
         at 
org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.lambda$writeRemainingEntryBuffers$3(BoundedRecoveredHFilesOutputSink.java:155)
         at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
         at 
java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
         at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
         ... 3 more
{code}

> NPE HFilesOutputSink
> --------------------
>
>                 Key: HBASE-23684
>                 URL: https://issues.apache.org/jira/browse/HBASE-23684
>             Project: HBase
>          Issue Type: Bug
>          Components: wal
>    Affects Versions: 2.3.0
>            Reporter: Michael Stack
>            Priority: Major
>
> Ran into this after enabling hfile splitter:
> {code}
>  2020-01-13 17:37:08,204 INFO org.apache.hadoop.hbase.wal.OutputSink: 3 split 
> writer threads finished
>  2020-01-13 17:37:08,233 INFO org.apache.hadoop.hbase.wal.WALSplitter: 
> Processed 1007 edits across 0 regions cost 284 ms; edits skipped=76; 
> WAL=hdfs://nameservice1/hbase/genie/WALs/hbasedn101.example.org,16020,1578934806382-splitting/hbasedn101.example.org%2C16020%2C1578934806382.1578937008832,
>  size=128.5 M, length=134708720, corrupted=false, progress         failed=true
>  2020-01-13 17:37:08,234 WARN 
> org.apache.hadoop.hbase.regionserver.SplitLogWorker: log splitting of 
> WALs/hbasedn101.example.org,16020,1578934806382-splitting/hbasedn101.example.org%2C16020%2C1578934806382.1578937008832
>  failed, returning error
>  java.io.IOException: java.lang.NullPointerException
>          at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.writeRemainingEntryBuffers(BoundedRecoveredHFilesOutputSink.java:173)
>          at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.close(BoundedRecoveredHFilesOutputSink.java:140)
>          at 
> org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:339)
>          at 
> org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:181)
>          at 
> org.apache.hadoop.hbase.regionserver.SplitLogWorker.splitLog(SplitLogWorker.java:105)
>          at 
> org.apache.hadoop.hbase.regionserver.SplitLogWorker.lambda$new$0(SplitLogWorker.java:84)
>          at 
> org.apache.hadoop.hbase.regionserver.handler.WALSplitterHandler.process(WALSplitterHandler.java:70)
>          at 
> org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:104)
>          at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
>          at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
>          at java.base/java.lang.Thread.run(Thread.java:834)
>  Caused by: java.lang.NullPointerException
>          at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.configContextForNonMetaWriter(BoundedRecoveredHFilesOutputSink.java:225)
>          at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.createRecoveredHFileWriter(BoundedRecoveredHFilesOutputSink.java:213)
>          at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.append(BoundedRecoveredHFilesOutputSink.java:117)
>          at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.lambda$writeRemainingEntryBuffers$3(BoundedRecoveredHFilesOutputSink.java:155)
>          at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
>          at 
> java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
>          at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
> {code}
> It is a bit odd because log says there were zero regions. Not sure what that 
> was about.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to