Re: Numbering RDD members Sequentially

2015-03-11 Thread Mark Hamstra
om: Steve Lewis > Date: Wed, Mar 11, 2015 at 9:13 AM > Subject: Re: Numbering RDD members Sequentially > To: "Daniel, Ronald (ELS-SDG)" > > > perfect - exactly what I was looking for, not quite sure why it is called > zipWithIndex > since zipping is not involved &

Fwd: Numbering RDD members Sequentially

2015-03-11 Thread Steve Lewis
-- Forwarded message -- From: Steve Lewis Date: Wed, Mar 11, 2015 at 9:13 AM Subject: Re: Numbering RDD members Sequentially To: "Daniel, Ronald (ELS-SDG)" perfect - exactly what I was looking for, not quite sure why it is called zipWithIndex since zipping is not i

Numbering RDD members Sequentially

2015-03-10 Thread Steve Lewis
I have Hadoop Input Format which reads records and produces JavaPairRDD locatedData where _1() is a formatted version of the file location - like "12690",, "24386 ."27523 ... _2() is data to be processed For historical reasons I want to convert _1() into in integer representing the