Thanks for you interest in the project. We (or just me) are here to help. Do you have questions about the project?
On Wed, Mar 4, 2015 at 1:30 PM, sagarsharma (JIRA) <[email protected]> wrote: > > [ > https://issues.apache.org/jira/browse/VXQUERY-131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14347597#comment-14347597 > ] > > sagarsharma commented on VXQUERY-131: > ------------------------------------- > > i know some kind of Big-Data techniques like Hadoop , H-Base , Cassendra , > Hive and i really want to do this project so can anybody help me please > ..... > > > > Supporting Hadoop data and cluster management > > --------------------------------------------- > > > > Key: VXQUERY-131 > > URL: https://issues.apache.org/jira/browse/VXQUERY-131 > > Project: VXQuery > > Issue Type: Improvement > > Reporter: Preston Carman > > Assignee: Preston Carman > > Labels: gsoc, gsoc2015, hadoop, java, mentor, xml > > > > Many organizations support Hadoop. It would be nice to be able to read > data from this source. The project will include creating a strategy (with > the mentor's guidance) for reading XML data from HDFS and implementing it. > When connecting VXQuery to HDFS, the strategy may need to consider how to > read sections of an XML file. > > In addition, we could use Yarn as our cluster manager. The Apache Hadoop > YARN (Yet Another Resource Negotiator) would be a good cluster management > tool for VXQuery. If VXQuery can read data from HDFS, then why not also > manage the cluster with a tool provided by Hadoop. The solution would > replace the current custom python scripts for cluster management. > > Goal > > - Read XML from HDFS > > - Manage the VXQuery cluster with Yarn > > > > -- > This message was sent by Atlassian JIRA > (v6.3.4#6332) >
