Hi Tom,

We recently added something like this, please take a look at 
MultiVertexInputFormat. That one can basically wrap any number of vertex input 
formats, coming from any sources. You can also take a look at HiveGiraphRunner 
to see how it's used there. As for multiple vertex types, we don't have that 
directly supported, but you can have some variable describing the vertex type 
inside of your vertex value.

Hope this helps, please let us know if you have any questions!

Maja

From: Tom M <[email protected]<mailto:[email protected]>>
Reply-To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Date: Monday, July 15, 2013 9:54 AM
To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Subject: Multiple Data Sources

Hi,

    I am a new to Giraph. I am working on implementing a graph algorithm that 
first reads vertex values from multiple sources (HDFS, MySQL). So basically, I 
would have two types of vertices, values of each vertex type can be read from a 
different data source. I know that, in MR, we can use DBInputFormat to retrieve 
tuples from RDBMS for example, and then join them with data read from HDFS. My 
question, can we do that in Giraph? i.e. can the graph be constructed from 
different data sources? Thanks a lot in advance.

Best,
Tom

Reply via email to