[ 
https://issues.apache.org/jira/browse/VXQUERY-131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14354036#comment-14354036
 ] 

Preston Carman commented on VXQUERY-131:
----------------------------------------

Python Scripts -> 
https://git-wip-us.apache.org/repos/asf?p=vxquery.git;a=tree;f=vxquery-server/src/main/resources/scripts;h=aa6f2b49a285702bbdd695f1751fd49945c64880;hb=b1109faba960ef07cb6bd55b5285db057eb4d831

CLI -> 
https://git-wip-us.apache.org/repos/asf?p=vxquery.git;a=blob;f=vxquery-cli/src/main/java/org/apache/vxquery/cli/VXQuery.java;h=080f8a12db0189d5d3d705953a84eedc1b474f53;hb=b1109faba960ef07cb6bd55b5285db057eb4d831

XML Parser -> 
https://git-wip-us.apache.org/repos/asf?p=vxquery.git;a=tree;f=vxquery-core/src/main/java/org/apache/vxquery/xmlparser;h=27b267a29886bbcefc3c82ce13769b4afe53b421;hb=b1109faba960ef07cb6bd55b5285db057eb4d831

> Supporting Hadoop data and cluster management
> ---------------------------------------------
>
>                 Key: VXQUERY-131
>                 URL: https://issues.apache.org/jira/browse/VXQUERY-131
>             Project: VXQuery
>          Issue Type: Improvement
>            Reporter: Preston Carman
>            Assignee: Preston Carman
>              Labels: gsoc, gsoc2015, hadoop, java, mentor, xml
>
> Many organizations support Hadoop. It would be nice to be able to read data 
> from this source. The project will include creating a strategy (with the 
> mentor's guidance) for reading XML data from HDFS and implementing it. When 
> connecting VXQuery to HDFS, the strategy may need to consider how to read 
> sections of an XML file. 
> In addition, we could use Yarn as our cluster manager. The Apache Hadoop YARN 
> (Yet Another Resource Negotiator) would be a good cluster management tool for 
> VXQuery. If VXQuery can read data from HDFS, then why not also manage the 
> cluster with a tool provided by Hadoop. The solution would replace the 
> current custom python scripts for cluster management.
> Goal
> - Read XML from HDFS
> - Manage the VXQuery cluster with Yarn



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to