Hi Preston,
Thanks for creating those diagrams!
A few comments/proposals:
1) I think that it would be good clarify the meaning of the shapes and
lines. For the first diagram I read regular rectangles as machines,
round rectangles as processes and the rectangle with the wavy bottom as
files. On the second one I'm not sure if the rounded rectangle around
HDFS is a process. Maybe we could add a legend for the diagrams?
2) When naming the machines I would replace "laptop" with "client" as
that's more generic and potentially fix the spelling of controller.
However, I think that the naming of the "Hyracks machines" doesn't add a
lot. Maybe we could just expand on the name of the processes to
NodeController and ClusterController and not have names for the
individual cluster nodes. Having he long process names would also ease
the connection between the diagrams and the code.
Does this make sense?
Cheers,
Till
On 17 Aug 2015, at 12:05, Eldon Carman wrote:
The following diagrams are intended to be used on our documentation
site
(as images in the HTML). I think they will be helpful in discussing
the
actual architecture of the VXQuery cluster, especially in Yarn.
Please post questions or suggestions on how to clarify or improve the
diagrams or cluster architecture.
VXQuery Cluster:
https://docs.google.com/drawings/d/1PZbvJk-G0J3hQffd-fFr2n893bXSNg3xfXFexM5c2A8/edit?usp=sharing
VXQuery Cluster using HDFS:
https://docs.google.com/drawings/d/1ge-0h8wa0Epio42Wor-SeBoafQdLSZxfKZFFQtcN1w0/edit?usp=sharing
VXQuery Yarn Cluster using HDFS: