Is there a way in spark to parse wikipedia xml dump? It seems like the freebase dump is longer available. Also does the spark shell support the xml load file sax parser that is present in scala.
Thanks AJ
Is there a way in spark to parse wikipedia xml dump? It seems like the freebase dump is longer available. Also does the spark shell support the xml load file sax parser that is present in scala.
Thanks AJ