[jira] [Assigned] (KUDU-3258) Expose some kind of transaction dashboard in ksck or the web UI

2021-04-19 Thread Andrew Wong (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Wong reassigned KUDU-3258:
-

Assignee: Andrew Wong

> Expose some kind of transaction dashboard in ksck or the web UI
> ---
>
> Key: KUDU-3258
> URL: https://issues.apache.org/jira/browse/KUDU-3258
> Project: Kudu
>  Issue Type: Improvement
>  Components: ops-tooling, transactions
>Reporter: Andrew Wong
>Assignee: Andrew Wong
>Priority: Major
>
> It would be useful to expose the locations and tablet IDs of the 
> TxnStatusManager replicas, and even show the health of them from unified 
> front, whether that's the web UI, ksck, or both. Some useful things to know 
> about:
>  - The tablet ID, range, and location of each TxnStatusManager partition
>  - The highest transaction ID per TxnStatusManager partition
>  - In-flight (not COMMITTED or ABORTED) transactions and their current state, 
> though would also be nice to filter specific states
>  - Commit timestamp (and other relevant timestamps, if available, reported 
> with physical and logical portions)
>  - We could also consider storing the transaction creation time in the same 
> way that we have a "time created" for tables in the masters
> After some discussion with Alexey, we think it'd be more useful to focus on:
>  * having a separate section in ksck to display the health of the transaction 
> status table
>  * having a separate tool to focus on displaying the business logic of the 
> TxnStatusManager partitions (not the web UI, for now)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KUDU-3258) Expose some kind of transaction dashboard in ksck or the web UI

2021-04-19 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/KUDU-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17325306#comment-17325306
 ] 

ASF subversion and git services commented on KUDU-3258:
---

Commit bdb6d0643ec7b68f11cefc038dc256824ce96f5e in kudu's branch 
refs/heads/master from Andrew Wong
[ https://gitbox.apache.org/repos/asf?p=kudu.git;h=bdb6d06 ]

[ksck] KUDU-3258: allow ksck and rebalancer to work on txn status table

This patch adds the transaction system table to the ksck output in its
own section for system tables. Here's a sample snippet of an output that
has the system table:

Summary by system table
 Name  | RF |  Status  | Total Tablets | 
Healthy | Recovering | Under-replicated | Unavailable
---++--+---+-++--+-
 kudu_system.kudu_transactions | 3  | UNDER_REPLICATED | 1 | 0  
 | 0  | 1| 0

Summary by table
 Name  | RF |   Status| 
Total Tablets | Healthy | Recovering | Under-replicated | Unavailable
---++-+---+-++--+-
 default.loadgen_auto_05cf5be513ea4a84a052e8044f641c1a | 1  | UNAVAILABLE | 8   
  | 6   | 0  | 0| 2
 default.loadgen_auto_0c7ea48d5f6948408694b176f70e69ec | 1  | UNAVAILABLE | 8   
  | 5   | 0  | 0| 3
 default.loadgen_auto_241be343981c46d081ab2b3d2e3b6e6a | 1  | UNAVAILABLE | 8   
  | 5   | 0  | 0| 3
 default.loadgen_auto_385476d5d3b6493f8cbf659c8a4cf7cc | 1  | UNAVAILABLE | 8   
  | 6   | 0  | 0| 2
 default.loadgen_auto_430e280e8aa7450591da67ae15ff0f37 | 1  | UNAVAILABLE | 8   
  | 6   | 0  | 0| 2

The section can be included/excluded via the --sections flag of ksck.

Since ksck and the rebalancer use the same cluster-examining code, this
patch also updates the rebalancer cluster_status class to account for
system tables -- the tool would have crashed upon trying to find the
replication factor of the system table otherwise.

Change-Id: I8162f6eb046d98791c6bdeb5c15a0af72487300d
Reviewed-on: http://gerrit.cloudera.org:8080/17315
Tested-by: Andrew Wong 
Reviewed-by: Alexey Serbin 


> Expose some kind of transaction dashboard in ksck or the web UI
> ---
>
> Key: KUDU-3258
> URL: https://issues.apache.org/jira/browse/KUDU-3258
> Project: Kudu
>  Issue Type: Improvement
>  Components: ops-tooling, transactions
>Reporter: Andrew Wong
>Priority: Major
>
> It would be useful to expose the locations and tablet IDs of the 
> TxnStatusManager replicas, and even show the health of them from unified 
> front, whether that's the web UI, ksck, or both. Some useful things to know 
> about:
>  - The tablet ID, range, and location of each TxnStatusManager partition
>  - The highest transaction ID per TxnStatusManager partition
>  - In-flight (not COMMITTED or ABORTED) transactions and their current state, 
> though would also be nice to filter specific states
>  - Commit timestamp (and other relevant timestamps, if available, reported 
> with physical and logical portions)
>  - We could also consider storing the transaction creation time in the same 
> way that we have a "time created" for tables in the masters
> After some discussion with Alexey, we think it'd be more useful to focus on:
>  * having a separate section in ksck to display the health of the transaction 
> status table
>  * having a separate tool to focus on displaying the business logic of the 
> TxnStatusManager partitions (not the web UI, for now)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KUDU-3269) Add UUID of the server into the metrics output

2021-04-19 Thread Abhishek (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Abhishek reassigned KUDU-3269:
--

Assignee: Abhishek

> Add UUID of the server into the metrics output
> --
>
> Key: KUDU-3269
> URL: https://issues.apache.org/jira/browse/KUDU-3269
> Project: Kudu
>  Issue Type: Improvement
>Reporter: Alexey Serbin
>Assignee: Abhishek
>Priority: Minor
>  Labels: newbie++
>
> The server's section in the metrics output looks like the following:
> {noformat}
> {
> "type": "server", 
>   
> "id": "kudu.tabletserver",
>   
> "attributes": {}, 
>   
> "metrics": [ ... ]
> }
> {noformat}
> It would be great to add the UUID of the server into the output along with 
> {{"type"}}, node/host name might be helpful as well.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KUDU-3245) Provide Client API to set verbose logging filtered by vmodule

2021-04-19 Thread Grant Henke (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke reassigned KUDU-3245:
-

Assignee: Grant Henke

> Provide Client API to set verbose logging filtered by vmodule 
> --
>
> Key: KUDU-3245
> URL: https://issues.apache.org/jira/browse/KUDU-3245
> Project: Kudu
>  Issue Type: Improvement
>  Components: client
>Reporter: Hao Hao
>Assignee: Grant Henke
>Priority: Major
>
> Similar to 
> [{{client::SetVerboseLogLevel}}|https://github.com/apache/kudu/blob/master/src/kudu/client/client.h#L164]
>  API, it will be nice to add another API to allow enabling verbose logging 
> filtered by module.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KUDU-3269) Add UUID of the server into the metrics output

2021-04-19 Thread Grant Henke (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke updated KUDU-3269:
--
Labels: newbie++  (was: )

> Add UUID of the server into the metrics output
> --
>
> Key: KUDU-3269
> URL: https://issues.apache.org/jira/browse/KUDU-3269
> Project: Kudu
>  Issue Type: Improvement
>Reporter: Alexey Serbin
>Priority: Minor
>  Labels: newbie++
>
> The server's section in the metrics output looks like the following:
> {noformat}
> {
> "type": "server", 
>   
> "id": "kudu.tabletserver",
>   
> "attributes": {}, 
>   
> "metrics": [ ... ]
> }
> {noformat}
> It would be great to add the UUID of the server into the output along with 
> {{"type"}}, node/host name might be helpful as well.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KUDU-3270) Add metrics for DNS resolution times and DNS cache-related metrics

2021-04-19 Thread Grant Henke (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke updated KUDU-3270:
--
Labels: roadmap-candidate  (was: )

> Add metrics for DNS resolution times and DNS cache-related metrics
> --
>
> Key: KUDU-3270
> URL: https://issues.apache.org/jira/browse/KUDU-3270
> Project: Kudu
>  Issue Type: Improvement
>Reporter: Alexey Serbin
>Priority: Major
>  Labels: roadmap-candidate
>
> When constructing a proxy for remote peer to perform a RPC, DNS resolution is 
> performed.  It would be great to add metrics to measure how much time the DNS 
> resolution takes.  That information could be provided by :
> * a histogram for DNS resolver times
> * 'standard' cache metrics for the DNS resolver cache



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KUDU-3276) Cannot cancel transactions that have already replicated: Invalid argument: Client provided column menu_module_name STRING NULLABLE not present in tablet transaction

2021-04-19 Thread matebase (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

matebase updated KUDU-3276:
---
Fix Version/s: (was: 1.10.0)

> Cannot cancel transactions that have already replicated: Invalid argument: 
> Client provided column menu_module_name STRING NULLABLE not present in tablet 
> transaction
> 
>
> Key: KUDU-3276
> URL: https://issues.apache.org/jira/browse/KUDU-3276
> Project: Kudu
>  Issue Type: Bug
>Affects Versions: 1.10.0
>Reporter: matebase
>Priority: Major
>
> I've seen an issue that looks as though an ADD_COLUMN is not fully applied 
> before performing writes. This results in a failure to bootstrap(Tablet 
> Server) with an error like:
> T 268c5ebdc7034dffaa3a40861ce0d3ed P 004b6c262dbf49508b69c716dca5bafd S R-NP 
> Ts 6630181836860420096: Cannot cancel transactions that have already 
> replicated: Invalid argument: Client provided column menu_module_name STRING 
> NULLABLE not present in tablet transaction:R-NP WriteTransaction 
> [type=REPLICA, start_time=2021-04-19 18:02:02, state=WriteTransactionState 
> 0x129d4d500 [op_id=(term: 4033 index: 76346), ts=6630181836860420096, 
> rows=[]]]
>  It seems that it has been fixed in version 1.9, but this happened again when 
> I restarted the Tablet Server, refer to KUDU-2690
> How to manually update the field column or how to operate can solve this 
> problem??



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Reopened] (KUDU-3276) Cannot cancel transactions that have already replicated: Invalid argument: Client provided column menu_module_name STRING NULLABLE not present in tablet transaction

2021-04-19 Thread matebase (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

matebase reopened KUDU-3276:


> Cannot cancel transactions that have already replicated: Invalid argument: 
> Client provided column menu_module_name STRING NULLABLE not present in tablet 
> transaction
> 
>
> Key: KUDU-3276
> URL: https://issues.apache.org/jira/browse/KUDU-3276
> Project: Kudu
>  Issue Type: Bug
>Affects Versions: 1.10.0
>Reporter: matebase
>Priority: Major
> Fix For: 1.10.0
>
>
> I've seen an issue that looks as though an ADD_COLUMN is not fully applied 
> before performing writes. This results in a failure to bootstrap(Tablet 
> Server) with an error like:
> T 268c5ebdc7034dffaa3a40861ce0d3ed P 004b6c262dbf49508b69c716dca5bafd S R-NP 
> Ts 6630181836860420096: Cannot cancel transactions that have already 
> replicated: Invalid argument: Client provided column menu_module_name STRING 
> NULLABLE not present in tablet transaction:R-NP WriteTransaction 
> [type=REPLICA, start_time=2021-04-19 18:02:02, state=WriteTransactionState 
> 0x129d4d500 [op_id=(term: 4033 index: 76346), ts=6630181836860420096, 
> rows=[]]]
>  It seems that it has been fixed in version 1.9, but this happened again when 
> I restarted the Tablet Server, refer to KUDU-2690
> How to manually update the field column or how to operate can solve this 
> problem??



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KUDU-3276) Cannot cancel transactions that have already replicated: Invalid argument: Client provided column menu_module_name STRING NULLABLE not present in tablet transaction

2021-04-19 Thread matebase (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

matebase updated KUDU-3276:
---
Fix Version/s: 1.10.0
   Resolution: Fixed
   Status: Resolved  (was: In Review)

> Cannot cancel transactions that have already replicated: Invalid argument: 
> Client provided column menu_module_name STRING NULLABLE not present in tablet 
> transaction
> 
>
> Key: KUDU-3276
> URL: https://issues.apache.org/jira/browse/KUDU-3276
> Project: Kudu
>  Issue Type: Bug
>Affects Versions: 1.10.0
>Reporter: matebase
>Priority: Major
> Fix For: 1.10.0
>
>
> I've seen an issue that looks as though an ADD_COLUMN is not fully applied 
> before performing writes. This results in a failure to bootstrap(Tablet 
> Server) with an error like:
> T 268c5ebdc7034dffaa3a40861ce0d3ed P 004b6c262dbf49508b69c716dca5bafd S R-NP 
> Ts 6630181836860420096: Cannot cancel transactions that have already 
> replicated: Invalid argument: Client provided column menu_module_name STRING 
> NULLABLE not present in tablet transaction:R-NP WriteTransaction 
> [type=REPLICA, start_time=2021-04-19 18:02:02, state=WriteTransactionState 
> 0x129d4d500 [op_id=(term: 4033 index: 76346), ts=6630181836860420096, 
> rows=[]]]
>  It seems that it has been fixed in version 1.9, but this happened again when 
> I restarted the Tablet Server, refer to KUDU-2690
> How to manually update the field column or how to operate can solve this 
> problem??



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KUDU-3276) Cannot cancel transactions that have already replicated: Invalid argument: Client provided column menu_module_name STRING NULLABLE not present in tablet transaction

2021-04-19 Thread matebase (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

matebase updated KUDU-3276:
---
Status: In Review  (was: Open)

> Cannot cancel transactions that have already replicated: Invalid argument: 
> Client provided column menu_module_name STRING NULLABLE not present in tablet 
> transaction
> 
>
> Key: KUDU-3276
> URL: https://issues.apache.org/jira/browse/KUDU-3276
> Project: Kudu
>  Issue Type: Bug
>Affects Versions: 1.10.0
>Reporter: matebase
>Priority: Major
>
> I've seen an issue that looks as though an ADD_COLUMN is not fully applied 
> before performing writes. This results in a failure to bootstrap(Tablet 
> Server) with an error like:
> T 268c5ebdc7034dffaa3a40861ce0d3ed P 004b6c262dbf49508b69c716dca5bafd S R-NP 
> Ts 6630181836860420096: Cannot cancel transactions that have already 
> replicated: Invalid argument: Client provided column menu_module_name STRING 
> NULLABLE not present in tablet transaction:R-NP WriteTransaction 
> [type=REPLICA, start_time=2021-04-19 18:02:02, state=WriteTransactionState 
> 0x129d4d500 [op_id=(term: 4033 index: 76346), ts=6630181836860420096, 
> rows=[]]]
>  It seems that it has been fixed in version 1.9, but this happened again when 
> I restarted the Tablet Server, refer to KUDU-2690
> How to manually update the field column or how to operate can solve this 
> problem??



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KUDU-3276) Cannot cancel transactions that have already replicated: Invalid argument: Client provided column menu_module_name STRING NULLABLE not present in tablet transaction

2021-04-19 Thread matebase (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

matebase updated KUDU-3276:
---
Description: 
I've seen an issue that looks as though an ADD_COLUMN is not fully applied 
before performing writes. This results in a failure to bootstrap(Tablet Server) 
with an error like:

T 268c5ebdc7034dffaa3a40861ce0d3ed P 004b6c262dbf49508b69c716dca5bafd S R-NP Ts 
6630181836860420096: Cannot cancel transactions that have already replicated: 
Invalid argument: Client provided column menu_module_name STRING NULLABLE not 
present in tablet transaction:R-NP WriteTransaction [type=REPLICA, 
start_time=2021-04-19 18:02:02, state=WriteTransactionState 0x129d4d500 
[op_id=(term: 4033 index: 76346), ts=6630181836860420096, rows=[]]]

 It seems that it has been fixed in version 1.9, but this happened again when I 
restarted the Tablet Server, refer to KUDU-2690

How to manually update the field column or how to operate can solve this 
problem??

  was:
I've seen an issue that looks as though an ADD_COLUMN is not fully applied 
before performing writes. This results in a failure to bootstrap(Tablet Server) 
with an error like:

T 268c5ebdc7034dffaa3a40861ce0d3ed P 004b6c262dbf49508b69c716dca5bafd S R-NP Ts 
6630181836860420096: Cannot cancel transactions that have already replicated: 
Invalid argument: Client provided column menu_module_name STRING NULLABLE not 
present in tablet transaction:R-NP WriteTransaction [type=REPLICA, 
start_time=2021-04-19 18:02:02, state=WriteTransactionState 0x129d4d500 
[op_id=(term: 4033 index: 76346), ts=6630181836860420096, rows=[]]]

 

How to manually update the field column or how to operate can solve this problem


> Cannot cancel transactions that have already replicated: Invalid argument: 
> Client provided column menu_module_name STRING NULLABLE not present in tablet 
> transaction
> 
>
> Key: KUDU-3276
> URL: https://issues.apache.org/jira/browse/KUDU-3276
> Project: Kudu
>  Issue Type: Bug
>Affects Versions: 1.10.0
>Reporter: matebase
>Priority: Major
>
> I've seen an issue that looks as though an ADD_COLUMN is not fully applied 
> before performing writes. This results in a failure to bootstrap(Tablet 
> Server) with an error like:
> T 268c5ebdc7034dffaa3a40861ce0d3ed P 004b6c262dbf49508b69c716dca5bafd S R-NP 
> Ts 6630181836860420096: Cannot cancel transactions that have already 
> replicated: Invalid argument: Client provided column menu_module_name STRING 
> NULLABLE not present in tablet transaction:R-NP WriteTransaction 
> [type=REPLICA, start_time=2021-04-19 18:02:02, state=WriteTransactionState 
> 0x129d4d500 [op_id=(term: 4033 index: 76346), ts=6630181836860420096, 
> rows=[]]]
>  It seems that it has been fixed in version 1.9, but this happened again when 
> I restarted the Tablet Server, refer to KUDU-2690
> How to manually update the field column or how to operate can solve this 
> problem??



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KUDU-3276) Cannot cancel transactions that have already replicated: Invalid argument: Client provided column menu_module_name STRING NULLABLE not present in tablet transaction

2021-04-19 Thread matebase (Jira)
matebase created KUDU-3276:
--

 Summary: Cannot cancel transactions that have already replicated: 
Invalid argument: Client provided column menu_module_name STRING NULLABLE not 
present in tablet transaction
 Key: KUDU-3276
 URL: https://issues.apache.org/jira/browse/KUDU-3276
 Project: Kudu
  Issue Type: Bug
Affects Versions: 1.10.0
Reporter: matebase


I've seen an issue that looks as though an ADD_COLUMN is not fully applied 
before performing writes. This results in a failure to bootstrap(Tablet Server) 
with an error like:

T 268c5ebdc7034dffaa3a40861ce0d3ed P 004b6c262dbf49508b69c716dca5bafd S R-NP Ts 
6630181836860420096: Cannot cancel transactions that have already replicated: 
Invalid argument: Client provided column menu_module_name STRING NULLABLE not 
present in tablet transaction:R-NP WriteTransaction [type=REPLICA, 
start_time=2021-04-19 18:02:02, state=WriteTransactionState 0x129d4d500 
[op_id=(term: 4033 index: 76346), ts=6630181836860420096, rows=[]]]

 

How to manually update the field column or how to operate can solve this problem



--
This message was sent by Atlassian Jira
(v8.3.4#803005)