I opened JIRA issue with Spark, as an improvement though, not as a bug. Hopefully, someone there would notice it.
From: Tobias Pfeiffer <t...@preferred.jp<mailto:t...@preferred.jp>> Reply-To: "user@spark.apache.org<mailto:user@spark.apache.org>" <user@spark.apache.org<mailto:user@spark.apache.org>> Date: Thursday, July 3, 2014 at 9:41 PM To: "user@spark.apache.org<mailto:user@spark.apache.org>" <user@spark.apache.org<mailto:user@spark.apache.org>> Subject: Re: Kafka - streaming from multiple topics Sergey, On Fri, Jul 4, 2014 at 1:06 AM, Sergey Malov <sma...@collective.com<mailto:sma...@collective.com>> wrote: On the other hand, under the hood KafkaInputDStream which is create with this KafkaUtils call, calls ConsumerConnector.createMessageStream which returns a Map[String, List[KafkaStream] keyed by topic. It is, however, not exposed. I wonder if this is a bug. After all, KafkaUtils.createStream() returns a DStream[(String, String)], which pretty much looks like it should be a (topic -> message) mapping. However, for me, the key is always null. Maybe you could consider filing a bug/wishlist report? Tobias