hanahmily commented on issue #13811:
URL: https://github.com/apache/skywalking/issues/13811#issuecomment-4230398545

   Neither "intra-node truncation" nor "cross-node truncation" is a real 
problem for the current architecture, because entity-to-shard affinity 
guarantees that all data for a given entity
     is processed by a single node's streaming TopN processor. There's no 
partial aggregation across nodes to worry about.
   
   > Cross-node truncation: An entity that ranks low on one data node might 
rank high globally after merging partial results across all nodes.
   
   This would only be true if the same entity's data were split across multiple 
nodes — for example, if COUNT aggregation required summing partial counts from 
node A and node B. But since entity routing is deterministic by shard:
   
    - Entity5's all data points land on exactly one node
    - That node's streaming processor computes the correct TopN ranking for 
entity5 based on all its data
    - If entity5 doesn't make the local top-N, it genuinely has a lower value 
than the N entities that did — no missing partial data exists on other nodes
   
   > Intra-node truncation: A single data node may have multiple data points 
for the same entity (e.g., across different shards),
   
   It's not true. The same entity's data points always go to the same shard. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to