[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-12-01 Thread Yun Gao (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17642246#comment-17642246
 ] 

Yun Gao commented on FLINK-27341:
-

Merged on master via fc5e8bacef34119579defca6256476482da523f9

Merged on 1.15 via 84e8806af16f81a4295ca1f9b0c711c210884b1d

Merged on 1.16 via 8a47420adfe9af7f1c303874c7a32dab3229ea66

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Weihua Hu
>Priority: Major
>  Labels: pull-request-available
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-11-15 Thread Yun Gao (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634670#comment-17634670
 ] 

Yun Gao commented on FLINK-27341:
-

Very thanks [~huwh] for the checking! I'll have a look.

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Weihua Hu
>Priority: Major
>  Labels: pull-request-available
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-11-15 Thread Weihua Hu (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634637#comment-17634637
 ] 

Weihua Hu commented on FLINK-27341:
---

[~gaoyunhaii] I tested it in Kubernetes/Docker/Standalone. Rremoving the 
loopback works fine. Could you help review this PR.

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Weihua Hu
>Priority: Major
>  Labels: pull-request-available
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-11-10 Thread Weihua Hu (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17631543#comment-17631543
 ] 

Weihua Hu commented on FLINK-27341:
---

[~gaoyunhaii] yes, we can directly remove the LOOPBACK, i will do more tests in 
different deployment targets.

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Weihua Hu
>Priority: Major
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-11-10 Thread Chesnay Schepler (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17631507#comment-17631507
 ] 

Chesnay Schepler commented on FLINK-27341:
--

It should be safe to just remove it imo.

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Weihua Hu
>Priority: Major
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-11-09 Thread Yun Gao (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17631342#comment-17631342
 ] 

Yun Gao commented on FLINK-27341:
-

Hi [~chesnay]  sorry for missing the comment, I also think we should drop the 
lookback address, but it may need some tests for different deployment targets. 

And very thanks [~huwh] for helping tracking this issue! I have assigned the 
issue to you. And for the method, do you think we might directly remove the 
LOOPBACK? I previously have some try with 
[https://github.com/gaoyunhaii/flink/pull/new/remove_loopback] and it should 
works in standalone session. 

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Weihua Hu
>Priority: Major
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-11-06 Thread Weihua Hu (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17629614#comment-17629614
 ] 

Weihua Hu commented on FLINK-27341:
---

Hi, [~chesnay] , [~gaoyunhaii]  I think this was introduced by FLINK-24474. It 
uses the loopback address as the default address. In these configs, it only 
supports flink clusters running on a single host, and taskmanager must use the 
loopback interface to connect with the jobmanager, since the jobmanager only 
binds the loopback interface. But if we don't set the bind-address to 
localhost, taskmanager should not use the loopback interface to find its 
external address. otherwise, this will cause other TaskManagers to not connect 
with it.

IMO, we can determine whether to use the loopback interface by whether the 
taskmanager.bind-host is loopback address. 

If this change is acceptable, you can assign this to me.

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Priority: Major
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-11-01 Thread Chesnay Schepler (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17627128#comment-17627128
 ] 

Chesnay Schepler commented on FLINK-27341:
--

[~gaoyunhaii] What makes this so complicated? I thought we just have to drop 
the loopback address resolution strategy again.

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Chesnay Schepler
>Priority: Major
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-10-31 Thread Yun Gao (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17626508#comment-17626508
 ] 

Yun Gao commented on FLINK-27341:
-

Hi [~nobleyd] sorry after I checked the code, it looks more complicated than I 
initially thought, thus I'm not very confident in changing this part of code 
right before the deadline without causing other issues and we'll move the fix 
after the release.

I also still think the issue is critical and cause a lot of confusions, we are 
still working on this issue and will fix it soon. 

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Chesnay Schepler
>Priority: Major
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-10-29 Thread nobleyd (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17626171#comment-17626171
 ] 

nobleyd commented on FLINK-27341:
-

Hi, this issue is not resolve in flink1.16, whilte I think it is important. 
[~gaoyunhaii] 

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Chesnay Schepler
>Priority: Major
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-09-06 Thread Yun Gao (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17600640#comment-17600640
 ] 

Yun Gao commented on FLINK-27341:
-

Hi [~nobleyd] If currently no other progress, I'll open a PR soon and try to 
make it included in 1.16. 

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Chesnay Schepler
>Priority: Major
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-08-26 Thread nobleyd (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17585171#comment-17585171
 ] 

nobleyd commented on FLINK-27341:
-

Hi, I want to known which release will fix this issue, is there any plan?

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Chesnay Schepler
>Priority: Major
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (FLINK-27341) TaskManager running together with JobManager are bind to 127.0.0.1

2022-04-21 Thread Chesnay Schepler (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-27341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525730#comment-17525730
 ] 

Chesnay Schepler commented on FLINK-27341:
--

We're currently checking if the loopback address resolution strategy is still 
necessary; if not we may be able to remove it or hide it behind a feature flag. 
The recent fixes around yarn related to FLINK-24474 may have removed the "need" 
for it. (Quotation marks because I think we were only fighting symptoms).

> TaskManager running together with JobManager are bind to 127.0.0.1
> --
>
> Key: FLINK-27341
> URL: https://issues.apache.org/jira/browse/FLINK-27341
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.15.0, 1.16.0
>Reporter: Yun Gao
>Assignee: Chesnay Schepler
>Priority: Major
>
> If some TaskManagers running with JobManager on the same machine while some 
> other TaskManager not, the TaskManagers running together with JobManager 
> would bind to localhost or 127.0.01, which makes the Netty connections across 
> the TaskManagers fail.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)