[Supporting Hadoop data and cluster management] weekly update

2015-07-26 Thread Efi
Hello everyone, The update for this week consists of two parts,the CollectionWithTagRule that is about reading the blocks from HDFS using the XMLInputFormat class.This rule informs the parser that it needs to read its data in blocks from HDFS and passes some additional information that are

Re: [Supporting Hadoop data and cluster management] weekly update

2015-07-04 Thread Efi
Hello everyone, This week's update is about the changes that I mentioned in my last update.The JUnit test is not completed yet,I am using a MiniDFSCluster implementation for the tests but I havent managed to get it to work correctly yet.I believe the problems are trivial and have not

Re: [Supporting Hadoop data and cluster management] weekly update

2015-06-16 Thread Eldon Carman
Looks good. One quick comment, take a look at our code format and style guidelines. You can set up eclipse to format your code for you using our sister project's code format profile [1]. [1] http://vxquery.apache.org/development_eclipse_setup.html On Sat, Jun 13, 2015 at 11:03 AM, Michael Carey

Re: [Supporting Hadoop data and cluster management] weekly update

2015-06-13 Thread Efi
Hello everyone, The reading of a single document and a collection of documents from HDFS is completed and tested.New JUnit tests are added in the xtest project, they are just copies of the aggregate tests, that I changed a bit to run for the collection reading from HDFS. I added another

Re: [Supporting Hadoop data and cluster management] weekly update

2015-06-13 Thread Michael Carey
Very cool!! On 6/13/15 9:38 AM, Efi wrote: Hello everyone, The reading of a single document and a collection of documents from HDFS is completed and tested.New JUnit tests are added in the xtest project, they are just copies of the aggregate tests, that I changed a bit to run for the

Re: [Supporting Hadoop data and cluster management] weekly update

2015-06-04 Thread Eldon Carman
We have a set of JUnit tests to validate VXQuery. I think it would be a good idea to add test cases that validate the HDFS code your adding to the code base. Take a look at the vxquery-xtest sub-project. The VXQuery Catalog holds all the vxquery test cases [1]. You could add a new HDFS test group

[Supporting Hadoop data and cluster management] weekly update

2015-06-04 Thread Efi
Hello everyone, This week Preston and Steven helped me with the vxquery code and specifically where my parser and two more functionalities will fit in the code. Along with the hdfs parallel parser that I have been working on these past weeks,two more methods will be implemented.They will

[Supporting Hadoop data and cluster management] weekly update

2015-05-28 Thread Efi
For this week I studied the VXQuery and Hyracks code in detail, in order to add my parser to the project. I will continue working on adding my code to vxquery and try to implement some tests for it as well.Also I am looking into ways to use the Hyracks hdfs code for the hdfs parser. Thank