Great.
If any of you think this PR is ready, please approve(but don’t merge) the
PR.

han liu <[email protected]>于2020年4月9日 周四下午10:21写道:

> Hi, Sheng Wu
>
> I have test the alarm module, and it works well.
>
> Sheng Wu <[email protected]> 于2020年4月9日周四 下午8:26写道:
>
>> Summary
>>
> This pull request is open for 10 days, and I worked on this 2 weeks ago.
>> Thanks everyone to help making this PR and tests work now.
>>
>> Here are the list of changes we made in this pull request. Top 3 are the
>> principle changes, others are following these principles.
>>
>>    1. New agent and mesh report protocol.
>>    2. New agent header protocol.
>>    3. Service register, instance register and network address register
>>    have been removed permanently.
>>    4. Service traffic, instance traffic and network alias metrics are
>>    added to replace the service, instance and network address inventory.
>>    5. Register process has been removed.
>>    6. Metrics stream process supports insert only mode, especially for
>>    traffic entities.
>>    7. Metrics stream process supports no-downsampling mode for traffic
>>    entities and network alias.
>>    8. Remove all register mechanism and cache in the java agent.
>>    9. Remove MONTH step in GraphQL query.
>>    10. Update UI to remove MONTH step query, the max query range is 60
>>    days now.
>>    11. Simplify the TTL to metrics and record. And the unit has been
>>    formatted in Day unit. No specific TTL for ElasticSearch storage.
>>    12. Buffer mechanism of trace receiver and mesh receiver has been
>>    removed due to no register.
>>    13. New service id, instance id and endpoint id rules, including
>>    service relation, instance relation and endpoint relation id rules.
>>    14. Java agent support keep tracing mode, meaning, agent generating
>>    tracing context even the backend is unconnected/unavailable.
>>    15. Plugin test tool up to date, in order to support new protocol.
>>    16. Plugin tests expected data files updated.
>>    17. E2E tests updated.
>>    18. [TBD] InfluxDB storage implementation is not available, need
>>    @dmsolr <https://github.com/dmsolr> to fix later, in order to reduce
>>    the master change block by this PR.
>>
>> If anyone plans to review the codes in the blocking mode(meaning don't
>> merge the PR), please let me know. Otherwise, I will try to merge this
>> tomorrow due to unblock the new changes in the master branch.
>>
>> Han Liu [email protected] <[email protected]>
>> I just want to wait for your alarm test result, due to e2e doesn't
>> include this.
>>
>> Sheng Wu 吴晟
>> Twitter, wusheng1108
>>
>>
>> kezhenxu94@apache <[email protected]> 于2020年4月9日周四 下午7:32写道:
>>
>>> The E2E should be fixed according the new codes now, good luck
>>>
>>>
>>>
>>> GitHub @kezhenxu94
>>> Apache SkyWalking, Apache Dubbo
>>>
>>> > On Apr 6, 2020, at 20:45, Sheng Wu <[email protected]> wrote:
>>> >
>>> > Zhenxu Ke
>>> > My PR is ready locally, but e2e seems still failing. Please help with
>>> > locating what are the issues.
>>> >
>>> > Haochao Zhuang
>>> > I noticed you have upgraded the test tool for v3 protocol. Please move
>>> on
>>> > making the plugin tests passed in the v8-core branch.
>>> >
>>> > Sheng Wu 吴晟
>>> > Twitter, wusheng1108
>>> >
>>> >
>>> > Sheng Wu <[email protected]> 于2020年3月31日周二 上午11:29写道:
>>> >
>>> >> Hi Dev Team
>>> >>
>>> >> After the experiences of removing endpoint_inventory, I found out this
>>> >> strategy is successful.
>>> >> Especially, we totally get rid of register, so I want to do more.
>>> >>
>>> >> *SkyWalking 8.0.0*
>>> >> First, it is already unexpected for me, we have to move to 8.0.0 so
>>> >> quickly, but after the discussion with +高洪涛@skywalking
>>> >> <[email protected]>, and thinking about this for several days, I
>>> think
>>> >> we have to.
>>> >>
>>> >> The key chances are following
>>> >> 1. Remove service, service instance, and network address register.
>>> The old
>>> >> register protocols are totally going to be removed.
>>> >> 2. The agent doesn't need to do register anymore. Service name and
>>> Service
>>> >> Instance name are generated by the agent itself, but the extra
>>> information,
>>> >> such as IP, hostname, language, should report to backend separately.
>>> >> 3. Service Traffic should be added just like the endpoint traffic but
>>> keep
>>> >> the time bucket as we need accurate service name in the given duration
>>> >> 4. Service Instance Traffic should be added too, with external
>>> >> information, such as language, hostname.
>>> >> 5. Trace context propagation context should be changed to accept
>>> string in
>>> >> service instance name, endpoint name and network address. This could
>>> ease
>>> >> the agent logic, but also, requires changes in all language agent and
>>> >> plugin test tool,
>>> >> 6. Trace report protocol requires to change too, in order to adopt the
>>> >> string.
>>> >> 7. e2e tests have to ignore PHP and LUA at first, and remove the 6.x
>>> >> compatibility test(doesn't support anymore).
>>> >>
>>> >> The benefits we will get are
>>> >> 1. Don't worry about the inventory(s) that has been deleted randomly
>>> by
>>> >> end users. (We received a lot of issue reports about this)
>>> >> 2. The upgrade could be easier erasing the whole storage and reboot
>>> the
>>> >> new one. (Users don't feel comfortable about upgrade)
>>> >> 3. No hot-reboot case in the agent side
>>> >> 4. No cache of network address register information in the agent.
>>> >> 5. No service and service instance cache in the OAP
>>> >> 6. No register lock in the OAP
>>> >> 7. No file buffer mechanism in the OAP too, same as no register
>>> happens.
>>> >>
>>> >> In my mind, I think this totally break upgrade is super meaningful and
>>> >> will be good change. Even we break many things, they are easy to
>>> follow.
>>> >> [email protected] <[email protected]> I think by following this,
>>> we
>>> >> need to change the collaboration header to `sw8` :) As no 7.1.0
>>> release
>>> >> will happen.
>>> >>
>>> >> Sheng Wu 吴晟
>>> >> Twitter, wusheng1108
>>> >>
>>>
>>> --
Sheng Wu 吴晟

Apache SkyWalking
Apache Incubator
Apache ShardingSphere, ECharts, DolphinScheduler podlings
Zipkin
Twitter, wusheng1108

Reply via email to