[GitHub] milleruntime opened a new pull request #393: Add TableOfflineException to TableOps.addSplits
milleruntime opened a new pull request #393: Add TableOfflineException to TableOps.addSplits URL: https://github.com/apache/accumulo/pull/393 Also revert commit bcd2dc51 which was hiding this error This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services
[jira] [Resolved] (ACCUMULO-4809) Session manager clean up can happen when lock held.
[ https://issues.apache.org/jira/browse/ACCUMULO-4809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Keith Turner resolved ACCUMULO-4809. Resolution: Fixed > Session manager clean up can happen when lock held. > --- > > Key: ACCUMULO-4809 > URL: https://issues.apache.org/jira/browse/ACCUMULO-4809 > Project: Accumulo > Issue Type: Bug >Affects Versions: 1.7.3, 1.8.1 >Reporter: Keith Turner >Priority: Critical > Labels: pull-request-available > Fix For: 1.7.4, 1.9.0, 2.0.0 > > Time Spent: 1h 10m > Remaining Estimate: 0h > > While working on [PR #382|https://github.com/apache/accumulo/pull/382] for > ACCUMULO-4782 I noticed a significant concurrency bug. Before #382 their was > a single lock for the session manager. The session manager will clean up idle > sessions. This clean up should happen outside the session manager lock, > because all tserver read/write operation use the session manger so it should > be responsive. > The bug is the following. > * Both getActiveScansPerTable() and getActiveScans() lock the session > manager and then lock idleSessions. See [SessionManager line > 233|https://github.com/apache/accumulo/blob/rel/1.7.3/server/tserver/src/main/java/org/apache/accumulo/tserver/session/SessionManager.java#L233] > > * The sweep() method locks idleSessions and does cleanup while this lock is > held. [See SessionManager > 200|https://github.com/apache/accumulo/blob/rel/1.7.3/server/tserver/src/main/java/org/apache/accumulo/tserver/session/SessionManager.java#L200] > > Therefore it is possible for getActiveScansPerTable() or getActiveScans() to > lock the session manager and then block trying to lock idleSessions while > cleanup is happening in sweep(). This will block all access to the session > manager while cleanup happens. > The changes in #382 will fix this for 1.9.0 and 2.0.0. However I Am not sure > about backporting #382 to 1.7. A more targeted fix could be made for 1.7 or > #382 could be backported. > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ACCUMULO-4809) Session manager clean up can happen when lock held.
[ https://issues.apache.org/jira/browse/ACCUMULO-4809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Keith Turner reassigned ACCUMULO-4809: -- Assignee: Keith Turner > Session manager clean up can happen when lock held. > --- > > Key: ACCUMULO-4809 > URL: https://issues.apache.org/jira/browse/ACCUMULO-4809 > Project: Accumulo > Issue Type: Bug >Affects Versions: 1.7.3, 1.8.1 >Reporter: Keith Turner >Assignee: Keith Turner >Priority: Critical > Labels: pull-request-available > Fix For: 1.7.4, 1.9.0, 2.0.0 > > Time Spent: 1h 10m > Remaining Estimate: 0h > > While working on [PR #382|https://github.com/apache/accumulo/pull/382] for > ACCUMULO-4782 I noticed a significant concurrency bug. Before #382 their was > a single lock for the session manager. The session manager will clean up idle > sessions. This clean up should happen outside the session manager lock, > because all tserver read/write operation use the session manger so it should > be responsive. > The bug is the following. > * Both getActiveScansPerTable() and getActiveScans() lock the session > manager and then lock idleSessions. See [SessionManager line > 233|https://github.com/apache/accumulo/blob/rel/1.7.3/server/tserver/src/main/java/org/apache/accumulo/tserver/session/SessionManager.java#L233] > > * The sweep() method locks idleSessions and does cleanup while this lock is > held. [See SessionManager > 200|https://github.com/apache/accumulo/blob/rel/1.7.3/server/tserver/src/main/java/org/apache/accumulo/tserver/session/SessionManager.java#L200] > > Therefore it is possible for getActiveScansPerTable() or getActiveScans() to > lock the session manager and then block trying to lock idleSessions while > cleanup is happening in sweep(). This will block all access to the session > manager while cleanup happens. > The changes in #382 will fix this for 1.9.0 and 2.0.0. However I Am not sure > about backporting #382 to 1.7. A more targeted fix could be made for 1.7 or > #382 could be backported. > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ACCUMULO-4835) Client should throw TableNotFoundException
Michael Miller created ACCUMULO-4835: Summary: Client should throw TableNotFoundException Key: ACCUMULO-4835 URL: https://issues.apache.org/jira/browse/ACCUMULO-4835 Project: Accumulo Issue Type: Bug Components: client Affects Versions: 1.7.4, 1.9.0 Reporter: Michael Miller I saw this misleading exception throw while running randomwalk during the concurrent OfflineTable node: Caused by: org.apache.accumulo.core.client.AccumuloException: Unexpected table state g0 DELETING != ONLINE at org.apache.accumulo.core.client.impl.TableOperationsImpl.waitForTableStateTransition(TableOperationsImpl.java:1072) at org.apache.accumulo.core.client.impl.TableOperationsImpl.online(TableOperationsImpl.java:1240) at org.apache.accumulo.test.randomwalk.concurrent.OfflineTable.visit(OfflineTable.java:47) The table was in the process of being deleted so a TableNotFoundException would be more helpful. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ACCUMULO-4832) Seeing warnings when write ahead log changes.
[ https://issues.apache.org/jira/browse/ACCUMULO-4832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16380533#comment-16380533 ] Keith Turner commented on ACCUMULO-4832: I ran continuous ingest for 1 hour on 8 nodes with this change (commit 3df4acc7f3662d16eb752fdf46cdfd6285f54512) and saw no warning. I took the instance I had setup for 1.7.4-RC0 testing and replaced all of the jars. > Seeing warnings when write ahead log changes. > - > > Key: ACCUMULO-4832 > URL: https://issues.apache.org/jira/browse/ACCUMULO-4832 > Project: Accumulo > Issue Type: Bug >Reporter: Keith Turner >Assignee: Ivan Bella >Priority: Blocker > Labels: pull-request-available > Fix For: 1.7.4, 1.9.0, 2.0.0 > > Time Spent: 2h 50m > Remaining Estimate: 0h > > While running continuous ingest against 1.7.4-rc0 I saw a lot of warning like > the followng. > {noformat} > 2018-02-26 17:51:58,189 [log.TabletServerLogger] WARN : Logs closed while > writing, retrying attempt 1 (suppressing retry messages for 18ms) > 2018-02-26 17:51:58,724 [log.TabletServerLogger] WARN : Logs closed while > writing, retrying attempt 1 (suppressing retry messages for 18ms) > 2018-02-26 17:51:58,940 [log.TabletServerLogger] WARN : Logs closed while > writing, retrying attempt 1 (suppressing retry messages for 18ms) > 2018-02-26 17:51:59,226 [log.TabletServerLogger] WARN : Logs closed while > writing, retrying attempt 1 (suppressing retry messages for 18ms) > 2018-02-26 17:51:59,227 [log.TabletServerLogger] WARN : Logs closed while > writing, retrying attempt 1 (suppressing retry messages for 18ms) > 2018-02-26 17:51:59,227 [log.TabletServerLogger] WARN : Logs closed while > writing, retrying attempt 1 (suppressing retry messages for 18ms) > {noformat} > > The warning are generated by [TabletServerLogger.java line > 341|https://github.com/apache/accumulo/blob/4e91215f101362ef206e9f213b4d8d12b3f6e0e2/server/tserver/src/main/java/org/apache/accumulo/tserver/log/TabletServerLogger.java#L341] > when a write ahead log is closed. Write ahead logs are closed as part of > normal operations as seen on [TabletServerLogger.java line > 386|https://github.com/apache/accumulo/blob/4e91215f101362ef206e9f213b4d8d12b3f6e0e2/server/tserver/src/main/java/org/apache/accumulo/tserver/log/TabletServerLogger.java#L386]. > There should not be a warning when this happens. This is caused by changes > made for ACCUMULO-4777. Before these changes this event was logged at debug. > At this time, these changes have not been released. It would be nice to fix > this before releasing 1.7.4 -- This message was sent by Atlassian JIRA (v7.6.3#76005)