Pratyush Bhatt created HDDS-8561:
------------------------------------
Summary: [Hbase-Ozone] RS Aborting after "Open file cannot be
renamed" Error
Key: HDDS-8561
URL: https://issues.apache.org/jira/browse/HDDS-8561
Project: Apache Ozone
Issue Type: Bug
Components: Ozone Manager
Affects Versions: 1.4.0
Reporter: Pratyush Bhatt
Region server Aborted after WAL renaming failed with:
RENAME_OPEN_FILE org.apache.hadoop.ozone.om.exceptions.OMException: Open file
cannot be renamed
hbase.regionserver.walroll.archive.retries was set to 10.
{noformat}
2023-05-05 22:16:42,080 ERROR
org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL: Failed log archiving
for the log
ofs://ozone1/vol1/bucket1/hbase4/WALs/ozone-new22-1.ozone-new22.root.hwx.site,22101,1683321374304/ozone-new22-1.ozone-new22.root.hwx.site%2C22101%2C1683321374304.ozone-new22-1.ozone-new22.root.hwx.site%2C22101%2C1683321374304.regiongroup-0.1683321395004,
RENAME_OPEN_FILE org.apache.hadoop.ozone.om.exceptions.OMException: Open file
cannot be renamed
at
org.apache.hadoop.ozone.om.protocolPB.OzoneManagerProtocolClientSideTranslatorPB.handleError(OzoneManagerProtocolClientSideTranslatorPB.java:711)
at
org.apache.hadoop.ozone.om.protocolPB.OzoneManagerProtocolClientSideTranslatorPB.renameKey(OzoneManagerProtocolClientSideTranslatorPB.java:882)
at
org.apache.hadoop.ozone.client.rpc.RpcClient.renameKey(RpcClient.java:1503)
at
org.apache.hadoop.ozone.client.OzoneBucket.renameKey(OzoneBucket.java:611)
at
org.apache.hadoop.fs.ozone.BasicRootedOzoneClientAdapterImpl.rename(BasicRootedOzoneClientAdapterImpl.java:481)
at
org.apache.hadoop.fs.ozone.BasicRootedOzoneFileSystem.renameFSO(BasicRootedOzoneFileSystem.java:446)
at
org.apache.hadoop.fs.ozone.BasicRootedOzoneFileSystem.rename(BasicRootedOzoneFileSystem.java:359)
at
org.apache.hadoop.hbase.util.CommonFSUtils.renameAndSetModifyTime(CommonFSUtils.java:711)
at
org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL.archiveLogFile(AbstractFSWAL.java:741)
at
org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL.archive(AbstractFSWAL.java:705)
at
org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL.lambda$cleanOldLogs$1(AbstractFSWAL.java:694)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
2023-05-05 22:16:42,085 ERROR
org.apache.hadoop.hbase.regionserver.HRegionServer: ***** ABORTING region
server ozone-new22-1.ozone-new22.root.hwx.site,22101,1683321374304: Failed log
archiving *****
RENAME_OPEN_FILE org.apache.hadoop.ozone.om.exceptions.OMException: Open file
cannot be renamed
at
org.apache.hadoop.ozone.om.protocolPB.OzoneManagerProtocolClientSideTranslatorPB.handleError(OzoneManagerProtocolClientSideTranslatorPB.java:711)
at
org.apache.hadoop.ozone.om.protocolPB.OzoneManagerProtocolClientSideTranslatorPB.renameKey(OzoneManagerProtocolClientSideTranslatorPB.java:882)
at
org.apache.hadoop.ozone.client.rpc.RpcClient.renameKey(RpcClient.java:1503)
at
org.apache.hadoop.ozone.client.OzoneBucket.renameKey(OzoneBucket.java:611)
at
org.apache.hadoop.fs.ozone.BasicRootedOzoneClientAdapterImpl.rename(BasicRootedOzoneClientAdapterImpl.java:481)
at
org.apache.hadoop.fs.ozone.BasicRootedOzoneFileSystem.renameFSO(BasicRootedOzoneFileSystem.java:446)
at
org.apache.hadoop.fs.ozone.BasicRootedOzoneFileSystem.rename(BasicRootedOzoneFileSystem.java:359)
at
org.apache.hadoop.hbase.util.CommonFSUtils.renameAndSetModifyTime(CommonFSUtils.java:711)
at
org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL.archiveLogFile(AbstractFSWAL.java:741)
at
org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL.archive(AbstractFSWAL.java:705)
at
org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL.lambda$cleanOldLogs$1(AbstractFSWAL.java:694)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
2023-05-05 22:16:42,085 ERROR
org.apache.hadoop.hbase.regionserver.HRegionServer: RegionServer abort: loaded
coprocessors are:
[org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor,
org.apache.hadoop.hbase.security.token.TokenProvider,
org.apache.hadoop.hbase.security.access.SecureBulkLoadEndpoint]{noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]