[
https://issues.apache.org/jira/browse/KUDU-2835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16852801#comment-16852801
]
Xu Yao edited comment on KUDU-2835 at 5/31/19 8:37 AM:
-------------------------------------------------------
This is a good question. I think there are three main reasons:
1. Most of our distributed jobs are run by resources of online services. These
resources are constantly changing, so it is difficult to deploy monitor-agents.
2. We have a lot of distributed jobs running (there are tens of thousands of
jobs running at the same time), and using kudu is part of it. It is difficult
to track only the distributed jobs of kudu.
3. We are more concerned about the timeout caused by service-side than the
networking issue. Because the work environment of distributed jobs may be poor.
was (Author: oclarms):
This is a good question. I think there are three main reasons:
1. Most of our distributed jobs are run by resources of online services. These
resources are constantly changing, so it is difficult to deploy monitor-agents.
2. We have a lot of distributed jobs running (there are tens of thousands of
jobs running at the same time), and using kudu is part of it. It is difficult
to track only the distributed jobs of kudu.
3. We are more concerned about the timeout caused by service-side than the
networking issue.
> Add custom id in RpcHeader
> --------------------------
>
> Key: KUDU-2835
> URL: https://issues.apache.org/jira/browse/KUDU-2835
> Project: Kudu
> Issue Type: Improvement
> Reporter: Xu Yao
> Priority: Major
>
> In our production environment, there are many distributed jobs that send
> request to Kudu by KuduClient. However, if there are some RPC timeouts on the
> server, it is difficult to find the affected KuduClient based on the
> information of rpcz. Because there may be many KuduClients on each host.
> So we want to add extra information to RpcHeader to find out the problematic
> distributed tasks.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)