parse json-file with scala-api and json4s

Norman Spangenberg Wed, 06 Aug 2014 09:31:04 -0700

hello,
i hope this is the right place for this question.

i'm currently experimenting and comparing flink/stratosphere and apachespark.my goal is to analyse large json-files of twitter-data and now i'mlooking for a way to parse the json-tuples in a map-function and put ina dataset.

for this i'm using the flink scala api and json4s.
but in flink the problem is to parse the json-file.
val words = cleaned.map { ( line =>  parse(line) }
Error Message is:

Error analyzing UDT org.json4s.JValue: Subtypeorg.json4s.JsonAST.JInt - Field num: BigInt - Unsupported type BigIntSubtypeorg.json4s.JsonAST.JArray - Field arr:List[org.json4s.JsonAST.JValue] - Subtype org.json4s.JsonAST.JInt -Field num: BigInt - Unsupported type BigInt Subtypeorg.json4s.JsonAST.JArray - Field arr:List[org.json4s.JsonAST.JValue] - Subtype org.json4s.JsonAST.JDecimal -Field num: BigDecimal - Unsupported typeBigDecimal Subtype org.json4s.JsonAST.JDecimal - Field num:BigDecimal - Unsupported type BigDecimal Subtypeorg.json4s.JsonAST.JObject - Field obj:List[(String, org.json4s.JsonAST.JValue)] - Field _2:org.json4s.JsonAST.JValue - Subtype org.json4s.JsonAST.JInt - Field num:BigInt - Unsupported type BigIntSubtype org.json4s.JsonAST.JObject - Field obj: List[(String,org.json4s.JsonAST.JValue)] - Field _2: org.json4s.JsonAST.JValue - Subtypeorg.json4s.JsonAST.JDecimal - Field num: BigDecimal - Unsupportedtype BigDecimal

in spark i found a way based onhttps://gist.github.com/cotdp/b471cfff183b59d65ae1


val user_interest = lines.map(line => {parse(line)})

.map(json => {implicit lazy val formats =org.json4s.DefaultFormatsval name = (json \"name").extract[String]val location_x = (json \ "location"\ "x").extract[Double]val location_y = (json \ "location"\ "y").extract[Double]val likes = (json \"likes").extract[Seq[String]].map(_.toLowerCase()).mkString(";")( UserInterest(name, location_x,location_y, likes) )

                                   })

this works fine in spark, but is it possible to do the same with flink?

kind regards,
norman

parse json-file with scala-api and json4s

Reply via email to