GitHub user yanlin-Lynn opened a pull request:
https://github.com/apache/bahir/pull/78
Bahir 183 branch
Currently in spark-sql-streaming-mqtt, the received mqtt message is saved
in a local file by driver, this will have the risks of losing data for cluster
mode when application master failover occurs. So add a hdfs-based mqtt source
to solve this problem.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/yanlin-Lynn/bahir bahir-183-branch
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/bahir/pull/78.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #78
----
commit 3a08b3f61cc7a3d8d6b142d2700fed9258194dee
Author: wangyanlin01 <wangyanlin01@...>
Date: 2018-12-02T03:00:21Z
[BAHIR-183]Using HDFS for saving message for mqtt source.
commit 515ffc7f62fcbb26f5c601401872bebcd882e170
Author: wangyanlin01 <wangyanlin01@...>
Date: 2018-12-09T08:13:25Z
code reuse and add unit test.
commit f0b93c0e9042279b388ff1bac9c7ecc0ddbfc696
Author: wangyanlin01 <wangyanlin01@...>
Date: 2018-12-09T13:06:27Z
rebase and add unit test.
----
---