Hans Zeller created TRAFODION-2437:
--------------------------------------

             Summary: Implement a monitor for communications between ESPs of a 
fragment
                 Key: TRAFODION-2437
                 URL: https://issues.apache.org/jira/browse/TRAFODION-2437
             Project: Apache Trafodion
          Issue Type: New Feature
          Components: sql-exe
            Reporter: Hans Zeller
            Assignee: Hans Zeller
            Priority: Minor


Today, the ESPs that are executing parallel fragment instances can't talk to 
each other. They all do their work independently and may finish at different 
times. This works fine for normal parallel algorithms, but it somewhat 
restricts what we can do in parallel. Sometimes, a communication or maybe just 
a simple synchronization would be desirable. Examples:

- Make all ESPs wait until the last ESP has finished before proceeding. This 
could create a global blocking operator, for example when we need a blocking 
operator to prevent the Halloween problem.
- Dynamic range-repartitioning: After doing local sorts, negotiate common split 
boundaries to perform a range-repartitioning, so that we can produce a globally 
sorted result in parallel.
- Communication between parallel instances of TMUDFs. Some TMUDFs may need 
similar communication, which could allow them to do more things in parallel.

When implementing this, we may also need to pay some attention to potential 
deadlocks, although with blocking operators like sorts there should not be any 
deadlock issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to