[
https://issues.apache.org/jira/browse/TS-4642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15368750#comment-15368750
]
John Rushford edited comment on TS-4642 at 7/8/16 11:49 PM:
------------------------------------------------------------
@zwoop in our production environment we have a parent config line for every
delivery service and they all use the same proxies. We do this precisely for
case #2 so that a a misbehaving origin doesn't cause all parents for all
delivery services to be marked down if the parent records are shared. If they
are shared, services unrelated to the misbehaving origin service would get 502
errors because the shared parents eventually all marked down.
Now for case #1 when a parent proxy becomes unreachable, the non-shared parents
are pretty quickly marked down by connection failures across all delivery
services but don't forget that when a connection failure is detected to a
parent proxy, another parent is selected and that transaction is retried using
the new parent. In practice Leif, I have not seen a problem with this. In the
diags.log, you see the non-shared parent marked down maybe 20 or more times but
that is because there are 20 or more non-shared records. You do not see any
502's because these transactions selected and used another parent proxy. I
don't see a problem here myself. What are your thoughts about this?
was (Author: jrushford):
@zwoop in our production environment we have a parent config line for every
delivery service and they all use the same proxies. We do this precisely for
case #2 so that a a misbehaving origin doesn't cause all parents for all
delivery services to be marked down if the parent records are shared. If they
are shared service unrelated to the misbehaving origin service would get 502
errors as the shared parents are all marked down.
Now for case #1 when a parent proxy becomes unreachable, the non-shared parents
are pretty quickly marked down by connection failures across all delivery
services but don't forget that when a connection failure is detected to a
parent proxy, another parent is selected and that transaction is retried using
the new parent. In practice Leif, I have not seen a problem with this. In the
diags.log, you see the non-shared parent marked down maybe 20 or more times but
that is because there are 20 or more non-shared records. You do not see any
502's because these transactions selected and used another parent proxy. I
don't see a problem here myself. What are your thoughts about this?
> Parent host healths across multiple configurations is inefficient
> -----------------------------------------------------------------
>
> Key: TS-4642
> URL: https://issues.apache.org/jira/browse/TS-4642
> Project: Traffic Server
> Issue Type: Improvement
> Components: Parent Proxy
> Reporter: Leif Hedstrom
> Assignee: John Rushford
> Fix For: sometime
>
>
> It seems that if I have multiple config lines in parent.config, where I use
> the same list of parents for multiple rules, each such rules keeps it's own
> "health" status? That means that if you repeat a line 100 times, the number
> of "failures" until marked down is 100x more than what we configured in
> records.config.
> It seems that the health and failure counts should be per host, such that
> regardless of how many times I use each host in parent.config, it'll
> consistently use the settings from records.config.
> So, I'm not 100% sure this is how things works, but if they are not, I blame
> Phil.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)