Greetings everyone,
This week the implementation for the miniDFSCluster that will run
the tests of HDFS for vxquery is completed.In the cluster.properties
configuration file of the vxquery server I added another property that
gives the path to the configuration file of the HDFS.By default this
value is set to the configuration folder of the miniDFSCluster.The user
that want to run his queries on his DFS cluster will have to change that
value to the configuration path of his HDFS cluster.
This functionality along with some minor changes in the code will
be added in my next pull request.
Also, the split scheduler is added to the my vxquery codebase and I
am currently trying to make the XMLParser parse the data blocks from
HDFS.It needs more work since the parser expects well formed xml
documents and the blocks returned from the HDFS are just parts of the
complete file.This is the part that I will focus on completing for the
next week.
Thank you,
Efi