[jira] [Commented] (HBASE-21743) stateless assignment

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749406#comment-16749406 ] Sergey Shelukhin commented on HBASE-21743: -- We've been running a master snapshot. Indeed, we

[jira] [Commented] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749377#comment-16749377 ] Sergey Shelukhin commented on HBASE-21742: -- Well, there are other shutdown activities... rather

[jira] [Commented] (HBASE-21575) memstore above high watermark message is logged too much

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749343#comment-16749343 ] Sergey Shelukhin commented on HBASE-21575: -- Committed to branch-2 and branch-1 > memstore

[jira] [Updated] (HBASE-21575) memstore above high watermark message is logged too much

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21575: - Fix Version/s: 2.2.0 1.5.0 > memstore above high watermark message

[jira] [Resolved] (HBASE-21576) master should proactively reassign meta when killing a RS with it

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin resolved HBASE-21576. -- Resolution: Not A Problem > master should proactively reassign meta when killing a RS

[jira] [Commented] (HBASE-21576) master should proactively reassign meta when killing a RS with it

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749312#comment-16749312 ] Sergey Shelukhin commented on HBASE-21576: -- I filed a separate bug somewhere, RS aborting due

[jira] [Commented] (HBASE-21744) timeout for server list refresh calls

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749309#comment-16749309 ] Sergey Shelukhin commented on HBASE-21744: -- If the refresh is requested immediately (e.g. due

[jira] [Comment Edited] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749276#comment-16749276 ] Sergey Shelukhin edited comment on HBASE-21626 at 1/22/19 11:23 PM:

[jira] [Resolved] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin resolved HBASE-21626. -- Resolution: Fixed Committed to master. Thanks for the review! > log the regions

[jira] [Commented] (HBASE-21720) metric to measure how actions are distributed to servers within a MultiAction

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749271#comment-16749271 ] Sergey Shelukhin commented on HBASE-21720: -- +1 > metric to measure how actions are distributed

[jira] [Updated] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21742: - Status: Patch Available (was: Open) > master can create bad procedures during abort,

[jira] [Comment Edited] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749246#comment-16749246 ] Sergey Shelukhin edited comment on HBASE-21742 at 1/22/19 11:00 PM:

[jira] [Commented] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749246#comment-16749246 ] Sergey Shelukhin commented on HBASE-21742: -- Attempt at a simple fix... shutting down procedure

[jira] [Updated] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21742: - Attachment: HBASE-21742.patch > master can create bad procedures during abort, making

[jira] [Assigned] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin reassigned HBASE-21742: Assignee: Sergey Shelukhin > master can create bad procedures during abort,

[jira] [Updated] (HBASE-21744) timeout for server list refresh calls

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21744: - Attachment: HBASE-21744.patch > timeout for server list refresh calls >

[jira] [Comment Edited] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749199#comment-16749199 ] Sergey Shelukhin edited comment on HBASE-21742 at 1/22/19 10:08 PM:

[jira] [Updated] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21742: - Description: Some small HDFS hiccup causes master and meta RS to fail together. Master

[jira] [Updated] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21742: - Description: Some small HDFS hiccup causes master and meta RS to fail together. Master

[jira] [Commented] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749199#comment-16749199 ] Sergey Shelukhin commented on HBASE-21742: -- This is on master. The problem is not that SCP gave

[jira] [Comment Edited] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16749199#comment-16749199 ] Sergey Shelukhin edited comment on HBASE-21742 at 1/22/19 10:06 PM:

[jira] [Updated] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21742: - Affects Version/s: 3.0.0 > master can create bad procedures during abort, making entire

[jira] [Updated] (HBASE-21744) timeout for server list refresh calls

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21744: - Status: Patch Available (was: Open) [~xucang] does this patch make sense to you? >

[jira] [Assigned] (HBASE-21744) timeout for server list refresh calls

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin reassigned HBASE-21744: Assignee: Sergey Shelukhin > timeout for server list refresh calls >

[jira] [Updated] (HBASE-21744) timeout for server list refresh calls

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21744: - Affects Version/s: 3.0.0 > timeout for server list refresh calls >

[jira] [Updated] (HBASE-21759) master can balance regions onto a known dead server

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21759: - Description: {noformat} 2019-01-18 09:42:45,664 INFO [PEWorker-1]

[jira] [Updated] (HBASE-21757) retrying to close a region incorrectly resets its RIT age metric

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21757: - Description: We have a region stuck in RIT forever -due to some other bug that I will

[jira] [Updated] (HBASE-21757) retrying to close a region incorrectly resets its RIT age metric

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21757: - Description: We have a region stuck in RIT forever due to some other bug -that I will

[jira] [Updated] (HBASE-21759) master can balance regions onto a known dead server

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21759: - Description: {noformat} 2019-01-18 09:42:45,664 INFO [PEWorker-1]

[jira] [Created] (HBASE-21759) master can balance regions onto a known dead server

2019-01-22 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HBASE-21759: Summary: master can balance regions onto a known dead server Key: HBASE-21759 URL: https://issues.apache.org/jira/browse/HBASE-21759 Project: HBase

[jira] [Updated] (HBASE-21759) master can balance regions onto a known dead server

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21759: - Description: {noformat} 2019-01-18 09:42:45,664 INFO [PEWorker-1]

[jira] [Created] (HBASE-21757) retrying to close a region incorrectly resets its RIT age metric

2019-01-22 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HBASE-21757: Summary: retrying to close a region incorrectly resets its RIT age metric Key: HBASE-21757 URL: https://issues.apache.org/jira/browse/HBASE-21757 Project:

[jira] [Updated] (HBASE-21757) retrying to close a region incorrectly resets its RIT age metric

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21757: - Description: We have a region stuck in RIT forever due to some other bug that I will

[jira] [Updated] (HBASE-21757) retrying to close a region incorrectly resets its RIT age metric

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21757: - Attachment: screenshot-1.png > retrying to close a region incorrectly resets its RIT

[jira] [Updated] (HBASE-21757) retrying to close a region incorrectly resets its RIT age metric

2019-01-22 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21757: - Attachment: (was: image-2019-01-22-11-00-37-578.png) > retrying to close a region

[jira] [Commented] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-18 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746874#comment-16746874 ] Sergey Shelukhin commented on HBASE-21626: -- Still need to test on the cluster here with the

[jira] [Commented] (HBASE-21575) memstore above high watermark message is logged too much

2019-01-18 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746859#comment-16746859 ] Sergey Shelukhin commented on HBASE-21575: -- Sure I'll port it, probably next week > memstore

[jira] [Commented] (HBASE-21744) timeout for server list refresh calls

2019-01-18 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16746855#comment-16746855 ] Sergey Shelukhin commented on HBASE-21744: -- We are running a snapshot of master around December

[jira] [Updated] (HBASE-21744) timeout for server list refresh calls

2019-01-18 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21744: - Description: Not sure why yet, but we are seeing the case when cluster is in overall a

[jira] [Created] (HBASE-21744) timeout for server list refresh calls

2019-01-18 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HBASE-21744: Summary: timeout for server list refresh calls Key: HBASE-21744 URL: https://issues.apache.org/jira/browse/HBASE-21744 Project: HBase Issue Type:

[jira] [Updated] (HBASE-21743) stateless assignment

2019-01-18 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21743: - Description: Running HBase for only a few weeks we found dozen(s?) of bugs with

[jira] [Created] (HBASE-21743) stateless assignment

2019-01-18 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HBASE-21743: Summary: stateless assignment Key: HBASE-21743 URL: https://issues.apache.org/jira/browse/HBASE-21743 Project: HBase Issue Type: Bug

[jira] [Created] (HBASE-21742) master can create bad procedures during abort, making entire cluster unusable

2019-01-18 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HBASE-21742: Summary: master can create bad procedures during abort, making entire cluster unusable Key: HBASE-21742 URL: https://issues.apache.org/jira/browse/HBASE-21742

[jira] [Comment Edited] (HBASE-21034) Add new throttle type: read/write capacity unit

2019-01-17 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16745427#comment-16745427 ] Sergey Shelukhin edited comment on HBASE-21034 at 1/17/19 7:42 PM: ---

[jira] [Comment Edited] (HBASE-21034) Add new throttle type: read/write capacity unit

2019-01-17 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16745427#comment-16745427 ] Sergey Shelukhin edited comment on HBASE-21034 at 1/17/19 7:44 PM: ---

[jira] [Comment Edited] (HBASE-21034) Add new throttle type: read/write capacity unit

2019-01-17 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16745427#comment-16745427 ] Sergey Shelukhin edited comment on HBASE-21034 at 1/17/19 7:44 PM: ---

[jira] [Comment Edited] (HBASE-21034) Add new throttle type: read/write capacity unit

2019-01-17 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16745427#comment-16745427 ] Sergey Shelukhin edited comment on HBASE-21034 at 1/17/19 7:44 PM: ---

[jira] [Commented] (HBASE-21034) Add new throttle type: read/write capacity unit

2019-01-17 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16745427#comment-16745427 ] Sergey Shelukhin commented on HBASE-21034: -- branch-2.1 is tracking a dot release though.

[jira] [Updated] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-16 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21626: - Attachment: HBASE-21626.ADDENDUM.patch > log the regions blocking WAL from being

[jira] [Commented] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-16 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16744382#comment-16744382 ] Sergey Shelukhin commented on HBASE-21626: -- [~stack] looks like the initial condition was

[jira] [Reopened] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-16 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin reopened HBASE-21626: -- There's an issue in this patch that causes the message to never be logged. > log the

[jira] [Commented] (HBASE-21564) race condition in WAL rolling resulting in size-based rolling getting stuck

2019-01-16 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16744359#comment-16744359 ] Sergey Shelukhin commented on HBASE-21564: -- [~Apache9] ping? I've addressed your CR feedback >

[jira] [Commented] (HBASE-21627) race condition between a recovered RIT for meta replica, and master startup

2019-01-15 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16743424#comment-16743424 ] Sergey Shelukhin commented on HBASE-21627: -- Update: we've seen another instance of this or

[jira] [Commented] (HBASE-21712) Make submit-patch.py python3 compatible

2019-01-15 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16743410#comment-16743410 ] Sergey Shelukhin commented on HBASE-21712: -- +1 on the addendum > Make submit-patch.py python3

[jira] [Updated] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-14 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21626: - Resolution: Fixed Fix Version/s: 3.0.0 Status: Resolved (was: Patch

[jira] [Updated] (HBASE-21712) Make submit-patch.py python3 compatible

2019-01-14 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21712: - Resolution: Fixed Status: Resolved (was: Patch Available) Committed to master.

[jira] [Commented] (HBASE-21712) Make submit-patch.py python3 compatible

2019-01-14 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16742412#comment-16742412 ] Sergey Shelukhin commented on HBASE-21712: -- Looks like pylint warnings are not caused by this

[jira] [Commented] (HBASE-21625) a runnable procedure v2 does not run

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737695#comment-16737695 ] Sergey Shelukhin commented on HBASE-21625: -- Hmm, the remote call didn't come back for 19 hours

[jira] [Comment Edited] (HBASE-21614) RIT recovery with ServerCrashProcedure doesn't account for all regions

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737692#comment-16737692 ] Sergey Shelukhin edited comment on HBASE-21614 at 1/9/19 12:44 AM: ---

[jira] [Comment Edited] (HBASE-21614) RIT recovery with ServerCrashProcedure doesn't account for all regions

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737692#comment-16737692 ] Sergey Shelukhin edited comment on HBASE-21614 at 1/9/19 12:43 AM: ---

[jira] [Commented] (HBASE-21614) RIT recovery with ServerCrashProcedure doesn't account for all regions

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737692#comment-16737692 ] Sergey Shelukhin commented on HBASE-21614: -- [~Apache9] After master restart the region can be

[jira] [Commented] (HBASE-21601) corrupted WAL is not handled in all places (NegativeArraySizeException)

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737564#comment-16737564 ] Sergey Shelukhin commented on HBASE-21601: -- Looks like we might need to look closer at the

[jira] [Comment Edited] (HBASE-21601) corrupted WAL is not handled in all places (NegativeArraySizeException)

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737564#comment-16737564 ] Sergey Shelukhin edited comment on HBASE-21601 at 1/8/19 9:57 PM: --

[jira] [Commented] (HBASE-21601) corrupted WAL is not handled in all places (NegativeArraySizeException)

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737549#comment-16737549 ] Sergey Shelukhin commented on HBASE-21601: -- As far as I see, skipErrors is only applied to

[jira] [Updated] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21626: - Attachment: (was: HBASE-21626.02.patch) > log the regions blocking WAL from being

[jira] [Updated] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21626: - Attachment: HBASE-21626.02.patch > log the regions blocking WAL from being archived >

[jira] [Commented] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737544#comment-16737544 ] Sergey Shelukhin commented on HBASE-21626: -- [~stack] changed the config name. The sequence ID

[jira] [Updated] (HBASE-21626) log the regions blocking WAL from being archived

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21626: - Attachment: HBASE-21626.02.patch > log the regions blocking WAL from being archived >

[jira] [Commented] (HBASE-21577) do not close regions when RS is dying due to a broken WAL

2019-01-08 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737472#comment-16737472 ] Sergey Shelukhin commented on HBASE-21577: -- The issue happens mostly with other regions,

[jira] [Updated] (HBASE-21614) RIT recovery with ServerCrashProcedure doesn't account for all regions

2018-12-21 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21614: - Attachment: (was: HBASE-21614.master.001.patch) > RIT recovery with

[jira] [Updated] (HBASE-21614) RIT recovery with ServerCrashProcedure doesn't account for all regions

2018-12-21 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21614: - Attachment: HBASE-21614.01.patch > RIT recovery with ServerCrashProcedure doesn't

[jira] [Updated] (HBASE-21614) RIT recovery with ServerCrashProcedure doesn't account for all regions

2018-12-21 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21614: - Attachment: (was: HBASE-21614.master.002.patch) > RIT recovery with

[jira] [Commented] (HBASE-21614) RIT recovery with ServerCrashProcedure doesn't account for all regions

2018-12-21 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16727041#comment-16727041 ] Sergey Shelukhin commented on HBASE-21614: -- Hmm, a wrong patch was attached to this JIRA > RIT

[jira] [Updated] (HBASE-21626) log the regions blocking WAL from being archived

2018-12-21 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21626: - Attachment: HBASE-21626.01.patch > log the regions blocking WAL from being archived >

[jira] [Updated] (HBASE-21623) ServerCrashProcedure can stomp on a RIT for a wrong server

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21623: - Summary: ServerCrashProcedure can stomp on a RIT for a wrong server (was:

[jira] [Created] (HBASE-21627) race condition between a recovered RIT for meta replica, and master startup

2018-12-20 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HBASE-21627: Summary: race condition between a recovered RIT for meta replica, and master startup Key: HBASE-21627 URL: https://issues.apache.org/jira/browse/HBASE-21627

[jira] [Updated] (HBASE-21624) master startup should not wait on assigning meta replicas

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21624: - Description: Due to some other bug, a meta replica is stuck in transition forever.

[jira] [Updated] (HBASE-21624) master startup should not wait (or die) on assigning meta replicas

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21624: - Summary: master startup should not wait (or die) on assigning meta replicas (was:

[jira] [Created] (HBASE-21626) log the regions blocking WAL from being archived

2018-12-20 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HBASE-21626: Summary: log the regions blocking WAL from being archived Key: HBASE-21626 URL: https://issues.apache.org/jira/browse/HBASE-21626 Project: HBase

[jira] [Updated] (HBASE-21626) log the regions blocking WAL from being archived

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21626: - Status: Patch Available (was: Open) > log the regions blocking WAL from being archived

[jira] [Updated] (HBASE-21626) log the regions blocking WAL from being archived

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21626: - Attachment: HBASE-21626.patch > log the regions blocking WAL from being archived >

[jira] [Comment Edited] (HBASE-21625) a runnable procedure v2 does not run

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16726213#comment-16726213 ] Sergey Shelukhin edited comment on HBASE-21625 at 12/20/18 8:54 PM:

[jira] [Commented] (HBASE-21625) a runnable procedure v2 does not run

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16726213#comment-16726213 ] Sergey Shelukhin commented on HBASE-21625: -- Looked a little bit at scheduling... that is much

[jira] [Updated] (HBASE-21625) a runnable procedure v2 does not run

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21625: - Description: This is on master snapshot as of a few weeks ago. Haven't looked at the

[jira] [Updated] (HBASE-21625) a runnable procedure v2 does not run

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21625: - Description: This is on master snapshot as of a few weeks ago. Haven't looked at the

[jira] [Updated] (HBASE-21625) a runnable procedure v2 does not run

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21625: - Description: This is on master as of a few weeks ago. Haven't looked at the code much

[jira] [Updated] (HBASE-21625) a runnable procedure v2 does not run

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21625: - Description: This is on master snapshot as of a few weeks ago. Haven't looked at the

[jira] [Commented] (HBASE-21625) a runnable procedure v2 does not run

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16726193#comment-16726193 ] Sergey Shelukhin commented on HBASE-21625: -- cc [~mbertozzi] > a runnable procedure v2 does not

[jira] [Updated] (HBASE-21625) a runnable procedure v2 does not run

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21625: - Description: This is on master as of a few weeks ago. Haven't looked at the code much

[jira] [Created] (HBASE-21625) a runnable procedure v2 does not run

2018-12-20 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HBASE-21625: Summary: a runnable procedure v2 does not run Key: HBASE-21625 URL: https://issues.apache.org/jira/browse/HBASE-21625 Project: HBase Issue Type: Bug

[jira] [Created] (HBASE-21624) master startup should not sleep on assigning meta replicas

2018-12-20 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HBASE-21624: Summary: master startup should not sleep on assigning meta replicas Key: HBASE-21624 URL: https://issues.apache.org/jira/browse/HBASE-21624 Project: HBase

[jira] [Updated] (HBASE-21624) master startup should not wait on assigning meta replicas

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21624: - Summary: master startup should not wait on assigning meta replicas (was: master

[jira] [Commented] (HBASE-21614) RIT recovery with ServerCrashProcedure doesn't account for all regions

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16726163#comment-16726163 ] Sergey Shelukhin commented on HBASE-21614: -- Cannot repro either test failure locally, and they

[jira] [Updated] (HBASE-21614) RIT recovery with ServerCrashProcedure doesn't account for all regions

2018-12-20 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21614: - Attachment: HBASE-21614.master.002.patch > RIT recovery with ServerCrashProcedure

[jira] [Updated] (HBASE-21623) ServerCrashProcedure can stomp on a RIT for the wrong server

2018-12-19 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21623: - Status: Patch Available (was: Open) [~Apache9] [~stack] can you take a look? a small

[jira] [Updated] (HBASE-21623) ServerCrashProcedure can stomp on a RIT for the wrong server

2018-12-19 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21623: - Attachment: HBASE-21623.patch > ServerCrashProcedure can stomp on a RIT for the wrong

[jira] [Assigned] (HBASE-21623) ServerCrashProcedure can stomp on a RIT for the wrong server

2018-12-19 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin reassigned HBASE-21623: Assignee: Sergey Shelukhin > ServerCrashProcedure can stomp on a RIT for the

[jira] [Updated] (HBASE-21623) ServerCrashProcedure can stomp on a RIT for the wrong server

2018-12-19 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21623: - Description: A server died while some region was being opened on it; eventually the

[jira] [Updated] (HBASE-21623) ServerCrashProcedure can stomp on a RIT for the wrong server

2018-12-19 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21623: - Description: A server died while some region was being opened on it; eventually the

[jira] [Updated] (HBASE-21623) ServerCrashProcedure can stomp on a RIT for the wrong server

2018-12-19 Thread Sergey Shelukhin (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-21623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-21623: - Description: A server died while some region was being opened on it; eventually the

<    1   2   3   4   5   6   7   8   9   10   >