Where's your driver code (the code interacting with the RDDs)? Are you getting serialization errors?
2014년 5월 17일 토요일, Samarth Mailinglist<mailinglistsama...@gmail.com>님이 작성한 메시지: > Hi all, > > I am trying to store the results of a reduce into mongo. > I want to share the variable "collection" in the mappers. > > > Here's what I have so far (I'm using pymongo) > > db = MongoClient()['spark_test_db'] > collec = db['programs'] > db = MongoClient()['spark_test_db'] > *collec = db['programs']* > > def mapper(val): > asc = val.encode('ascii','ignore') > json = convertToJSON(asc, indexMap) > collec.insert(json) # *this is not working* > > def convertToJSON(string, indexMap): > values = string.strip().split(",") > json = {} > for i in range(len(values)): > json[indexMap[i]] = values[i] > return json > > How do I do this? >