Have you checked to make sure that your hashing function doesn't have any collisions? Node ids have to be unique; so, if you're getting repeated ids out of your hasher, it could certainly lead to dropping of duplicate ids, and therefore loss of vertices.
On Sat, Sep 26, 2015 at 10:37 AM JJ <jessjeff...@gmail.com> wrote: > Here is all of my code. My first post had a simplified version. As I post > this, I realize one issue may be that when I convert my Ids to long (I > define a pageHash function to convert string Ids to long), the nodeIds are > no longer the same between the 'vertices' object and the 'edges' object. Do > you think this is what is causing the issue? > > > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/GraphX-create-graph-with-multiple-node-attributes-tp24827p24832.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > >