Re: spark-deployer 3.0.1 released

2016-09-09 Thread Bhupendra Mishra
Grt will test and share feedback Sent from my iPhone > On 09-Sep-2016, at 9:37 PM, pishen tsai wrote: > > spark-deployer is a sbt plugin that help deploying a Spark stand-alone > cluster on EC2 and submit your Spark jobs. All the works are done in sbt. > > We just

Please help: Spark job hung/stop writing after exceeding the folder size

2016-08-10 Thread Bhupendra Mishra
Dear All, I have struggling with an issue where spark steam job gets hung after exceeding size of output folder path. here is more details: I have Flume sending and configuration agent1.sources = source1 agent1.sinks = sink1 agent1.channels = channel2 # Describe/configure source1

flume.thrift.queuesize

2016-08-05 Thread Bhupendra Mishra
Please suggest me where/in which file I should set/configure " flume.thrift.queuesize" Many Thanks!

Spark steaming with Flume jobs failing

2016-08-03 Thread Bhupendra Mishra
Hi team, I have integrated SparkSteam with Flume and my flume as well spark job gets failed and getting following error. Your help will be highly appreciative. Many Thanks my flume configuration is as follows flume.conf *** agent1.sources = source1 agent1.sinks = sink1 agent1.channels =

How to read content of hdfs files

2016-07-24 Thread Bhupendra Mishra
I have hdfs data in zip formate which includes data, name and nameseconday folder. Pretty much structure is like datanode, name node and secondary node. How to read the content of data. would be great if some can suggest tips/steps. Thanks

Re: ImportError: No module named numpy

2016-06-17 Thread Bhupendra Mishra
If on many, are you sure numpy is correctly installed on all machines? >> >> To check that the environment is set-up correctly, you can try something >> like >> >> import os >> pythonpaths = sc.range(10).map(lambda i: >> os.environ.get("PYTHONPATH&quo

Re: ImportError: No module named numpy

2016-06-02 Thread Bhupendra Mishra
did not resolved. :( On Thu, Jun 2, 2016 at 3:01 PM, Sergio Fernández <wik...@apache.org> wrote: > > On Thu, Jun 2, 2016 at 9:59 AM, Bhupendra Mishra < > bhupendra.mis...@gmail.com> wrote: >> >> and i have already exported environment variable in spark-e

Re: ImportError: No module named numpy

2016-06-02 Thread Bhupendra Mishra
ry adding to spark-env.sh (renaming if you still have it with .template > at the end): > > PYSPARK_PYTHON=/path/to/your/bin/python > > Where your bin/python is your actual Python environment with Numpy > installed. > > > El 1 jun 2016, a las 20:16, Bhupendra Mishra <

Re: ImportError: No module named numpy

2016-06-01 Thread Bhupendra Mishra
I have numpy installed but where I should setup PYTHONPATH? On Wed, Jun 1, 2016 at 11:39 PM, Sergio Fernández <wik...@apache.org> wrote: > sudo pip install numpy > > On Wed, Jun 1, 2016 at 5:56 PM, Bhupendra Mishra < > bhupendra.mis...@gmail.com> wrote: > >> Tha

Re: ImportError: No module named numpy

2016-06-01 Thread Bhupendra Mishra
Thanks . How can this be resolved? On Wed, Jun 1, 2016 at 9:02 PM, Holden Karau <hol...@pigscanfly.ca> wrote: > Generally this means numpy isn't installed on the system or your > PYTHONPATH has somehow gotten pointed somewhere odd, > > On Wed, Jun 1, 2016 at 8:31 AM

ImportError: No module named numpy

2016-06-01 Thread Bhupendra Mishra
If any one please can help me with following error. File "/opt/mapr/spark/spark-1.6.1/python/lib/pyspark.zip/pyspark/mllib/__init__.py", line 25, in ImportError: No module named numpy Thanks in advance!

spark-ts

2016-04-27 Thread Bhupendra Mishra
Guys, please help me with following question on Spark-TS liabrary You’ve just acquired a new dataset showing the purchases of stock from market resellers during the day over a ten month period. You’ve looked at the daily data and have decided that you can model this using a time series analysis.