[ 
https://issues.apache.org/jira/browse/HDDS-14414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18067535#comment-18067535
 ] 

Arafat Khan commented on HDDS-14414:
------------------------------------

Hi [~adoroszlai], I've been investigating the intermittent Rocks Database is 
closed error earlier. I initially tried fixing it by adding a lock at the 
manager level ( ReconNamespaceSummaryManagerImpl ), but it didn't work because 
the underlying database actually gets closed by the task controller before that 
lock is even reached. The recent parallelization commits made the tasks run 
faster, changing the timing and making this race condition between API reads 
and the database swap happen more often. I have a few ideas to properly fix 
this like but I don't have the bandwidth to try them out for the next few days. 
Could we please hold off on reverting those commits for a few days so I have 
time to implement a fix!

> Intermittent Rocks Database is closed in ReconNamespaceSummaryManagerImpl 
> with legacy bucket
> --------------------------------------------------------------------------------------------
>
>                 Key: HDDS-14414
>                 URL: https://issues.apache.org/jira/browse/HDDS-14414
>             Project: Apache Ozone
>          Issue Type: Bug
>          Components: Ozone Recon
>    Affects Versions: 2.2.0
>            Reporter: Attila Doroszlai
>            Assignee: Arafat Khan
>            Priority: Blocker
>
> {code}
> Check Disk Usage api access                                           | FAIL |
> '500' does not contain '200'
> --
> Recon-Nssummary :: Smoke test for Recon Namespace Summary Endpoint... | FAIL |
> 16 tests, 15 passed, 1 failed
> {code}
> Internal server error root cause:
> {code}
> Caused by: org.apache.hadoop.hdds.utils.db.RocksDatabaseException: Rocks 
> Database is closed
>       at 
> org.apache.hadoop.hdds.utils.db.RocksDatabase.acquire(RocksDatabase.java:456)
>       at 
> org.apache.hadoop.hdds.utils.db.RocksDatabase.get(RocksDatabase.java:694)
>       at org.apache.hadoop.hdds.utils.db.RDBTable.get(RDBTable.java:122)
>       at 
> org.apache.hadoop.hdds.utils.db.TypedTable.getFromTable(TypedTable.java:304)
>       at org.apache.hadoop.hdds.utils.db.TypedTable.get(TypedTable.java:215)
>       at 
> org.apache.hadoop.ozone.recon.spi.impl.ReconNamespaceSummaryManagerImpl.getNSSummary(ReconNamespaceSummaryManagerImpl.java:98)
>       at 
> org.apache.hadoop.ozone.recon.api.handlers.EntityHandler.getTotalDirCount(EntityHandler.java:224)
>       at 
> org.apache.hadoop.ozone.recon.api.handlers.RootEntityHandler.getSummaryResponse(RootEntityHandler.java:67)
>       at 
> org.apache.hadoop.ozone.recon.api.NSSummaryEndpoint.getBasicInfo(NSSummaryEndpoint.java:93)
> {code}
> - 
> https://github.com/adoroszlai/ozone-build-results/tree/master/2025/12/10/45042/acceptance-unsecure
> - 
> https://github.com/adoroszlai/ozone-build-results/tree/master/2025/12/27/45523/acceptance-unsecure
> - 
> https://github.com/adoroszlai/ozone-build-results/tree/master/2026/01/12/45810/acceptance-unsecure
> - 
> https://github.com/adoroszlai/ozone-build-results/tree/master/2026/01/12/45820/acceptance-unsecure
> - 
> https://github.com/adoroszlai/ozone-build-results/tree/master/2026/01/13/45841/acceptance-unsecure



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to