Hi guys,

It’s really exciting that we are moving to one flexible solution to have the IP 
and interface tables separately. Based on this, I have some ideas for the 
change of the Traffic Monitor(TM). Actually, the TM have two different purposes 
to polling the cache servers. One is to make sure the connectivity is ok, the 
other is to query the stat of the interface. The connectivity is actually for 
the IP (with port) connectivity, while the stat is for the interface. Since we 
will have multiple interfaces and may have multiple IPs for each interface, I 
think the polling to the cache server should be like:
1) Check the connectivity of all the configured IPs of the cache server
  This function works like the keep alive to make sure all the configured IPs 
is reachable.
  
2) Query the stat for all the configured interfaces of the cache server
     This function is to query the interfaces stat, it doesn’t care which IP 
will be used to query the stat.

So I suggest the change to the current TM to be like:
1) Separate the current polling of cache servers into two different pollings, 
one for the keep alive, the other for the stat query.
  For the keep alive polling, we can use some specific url to check if the IP 
with port is opened or not; For the stat query, we will reuse the astats 
plugin, but need a little change to the uri to support returning multiple 
interfaces stat in the same request.
  If we don’t separate it, we will have to issue each stat query for each IP, 
and all the IPs of one interface will return the same big response, which is 
unnecessary, and will bring much more unnecessary load to the ATS and TM. If we 
separate it, we could make the keep alive check very light weighted, and also 
can use one request to query all the configured interfaces stat. Besides, we 
could also use different polling intervals for the keep alive and stat query.

2) The health availability of the cache server will be based on the configured 
IPs of the cache server
     Currently, we only have the availability of one cache server, but it will 
not be enough when we add multiple interfaces and multiple IPs support. We need 
record the availability for each configured IP so that if it’s assigned, the 
router can check if it can redirect the client request to that assigned IP or 
not.

3) The max available bandwidth check should be changed to per interface.
  Currently, the max available bandwidth is only for the primary interface of 
the cache server, if we have multiple interfaces support, we should check the 
bandwidth availability for each interface. If one interface is overload, we 
will mark all the IPs of that interface to be unavailable.

4) We will pick up one available IP to query the stat.
     Currently, we use the fqdn of the cache server to query the stat, since we 
will have multiple IPs, we could pick up one available IP to query.
 

Best regards,
Neil

On 4/4/18, 1:23 PM, "Nir Sopher" <n...@qwilt.com> wrote:

    +1
    Note that beyond the DB, the change should also be reflected into the
    cr-config.
    As I see it, a flexible model may be built of the below items:
    1 - Edge server.
    2- Interface
    3. IPs
    
    The Interface (or should it be called "delivery unit") is the element we
    redirect the traffic to and which is monitored by the traffic-monitor:
    * Each server may have multiple interfaces
    * Each interface may have multiple IPs
    * Interfaces has priorities (abstraction for primary/secondary)
    * Each interface is given a seperate DNS name by the router. Single name
    for the multiple IPs.
    * Each interface is monitored and reported seperately by the traffic
    monitor, including health an stats.
    
    
    The router "redirect target decision" may look as follows
    0. Select cache as we do today taking into account the consistent hash. A
    server is in the selection group only if one of its interfaces is found to
    be healthy
    1. Once we have server selected, select an interface out of all interfaces
    of the server with max available priority.
    
    An additional improvement, may assign DS to interfaces instead of servers.
    A server serves DS X iff one of its interfaces is assigned to the DS.
    
    Nir
    
    
    On Apr 4, 2018 6:56 AM, "Zhilin Huang (zhilhuan)" <zhilh...@cisco.com>
    wrote:
    
    Updated the DB schema in section 3.1.1.4
    
    Thanks,
    Zhilin
    
    
    
    On 04/04/2018, 11:02 AM, "Zhilin Huang (zhilhuan)" <zhilh...@cisco.com>
    wrote:
    
        Good points. I am happy to make this change in the design doc.
    
        Thanks,
        Zhilin
    
    
        On 03/04/2018, 8:17 PM, "Eric Friedrich (efriedri)" <efrie...@cisco.com>
    wrote:
    
            I would prefer a consistent way to store all interface and IP
    address information. Its good database design practice to store similar
    information in similar tables (i.e. all IP info in 1 table) rather than
    keep some IPs in the server table and some IPs in another table.
    
            I also think this refactoring will give us greater flexibility for
    more changes in the future. Outside of this particular use case, we might
    have additional features like sharing edges between public/private networks
    or having multiple (equal priority) streaming interfaces on a cache.
    
            These future features would be easier if the interface data and IP
    data is all organized into separate tables.
    
            I’d also like to see the delivery service to IP mapping be a many
    to many mapping in the DB. For this particular feature we will only assign
    a single IP (and we can restrict that in the API if we want), but I am near
    certain that in the future we would like the ability to assign a DS to
    multiple IPs on the same cache.
    
    
            —Eric
    
    
    
            > On Apr 3, 2018, at 2:42 AM, Zhilin Huang (zhilhuan) <
    zhilh...@cisco.com> wrote:
            >
            > Hi Mark,
            >
            > Thanks for your comments. Please check my reply in another thread:
            >
            > If we all agreed to use unified tables for all IPs and/or
    interfaces: primary, management, secondary, then there need to be two
    tables: IP and interface.
            > And in the server table, we need to replace the original
    "interface_xxx", "ip_xxx", "ip6_xxx" fields with a "primary_ip_id" field.
    And do similar things to management IP.
            >
            > Thanks,
            > Zhilin
            >
            >
            > On 03/04/2018, 7:08 AM, "Mark Torluemke" <mtorlue...@apache.org>
    wrote:
            >
            >    I would support an 'interfaces' table (adding some sort of a
    'type' column)
            >    that would include moving the management and lights out
    management
            >    interfaces to that table as well.
            >
            >    Cheers,
            >    Mark
            >
            >    On Mon, Apr 2, 2018 at 2:39 PM, Nir Sopher <n...@qwilt.com>
    wrote:
            >
            >> Hi Zhilin,
            >>
            >> I took a quick look into the spec. Hope to have the opportunity
    to dive
            >> deeper into it soon so we can further discuss it.
            >>
            >> For now I have a 2 questions.
            >> In the spec, you refer to "secondary interfaces", and you have a
    list of
            >> secondary interfaces added.
            >> IIUC the secondary interfaces are used as long as they are
    available, and
            >> when down, you move to the primary interface.
            >>
            >> Why not, instead of holding a secondary interfaces table, move
    all
            >> interfaces to a separate table? Primary and secondary.
            >> For each interface you can hold:
            >>
            >>   - Server id
            >>   - name (e.g. eth0)
            >>   - IPv6
            >>   - IPv4
            >>   - Priority (Integer as flexible as you wish: e.g. "1" for
    "secondary",
            >>   "2" for "primary" in your example,)
            >>
            >>
            >> Additionally, it is not clear to me what happens if one of the
    interfaces
            >> fails?
            >> Does every interface has a unique DNS name? If an interface
    fails, are
            >> redirects
            >> sent only to the available (secondary) interfaces?
            >>
            >> Thanks,
            >> Nir
            >>
            >>
            >> On Mon, Apr 2, 2018 at 10:21 AM, Zhilin Huang (zhilhuan) <
            >> zhilh...@cisco.com
            >>> wrote:
            >>
            >>> Hi Guys,
            >>>
            >>> This was originally posted in another discussion. Resend this
    in a
            >>> standalone topic to catch more awareness. The link for the
    design doc:
            >>>
    https://docs.google.com/document/d/1vgq-pGNoLLYf7Y3cu5hWu67TUKpN5hucrp
            >>> -ZS9nSsd4/edit?usp=sharing
            >>>
            >>>
            >>> Short summary for the feature design:
            >>> ---
            >>> There is feature request from market to add secondary IPs
    support on edge
            >>> cache servers, and the functionality to assign a delivery
    service to a
            >>> secondary IP of an edge cache.
            >>>
            >>> This feature requires Traffic Ops implementation to support
    secondary IP
            >>> configuration for edge cache, and delivery service assignment to
            >> secondary
            >>> IP.
            >>>
            >>> Traffic Monitor should also monitor connectivity of secondary
    IPs
            >>> configured. And Traffic Router needs support to resolve
    streamer FQDN to
            >>> secondary IP assigned in a delivery service.
            >>>
            >>> Traffic Server should record the IP serving client request. And
    should
            >>> reject request to an unassigned IP for a delivery service.
            >>>
            >>> This design has taken compatibility into consideration: if no
    secondary
            >> IP
            >>> configured, or some parts of the system has not been upgraded
    to the
            >>> version supports this feature, the traffic will be served by
    primary IPs
            >> as
            >>> before.
            >>> ---
            >>>
            >>> Much appreciated and welcome to any comments. If no major
    objections, we
            >>> planned to start coding this week.
            >>>
            >>> Thanks,
            >>> Zhilin
            >>>
            >>>
            >>
            >
            >
    

Reply via email to