Re: running pyspark on kubernetes - no space left on device

2022-09-01 Thread Qian SUN
Hi Spark provides spark.local.dir configuration to specify work folder on the pod. You can specify spark.local.dir as your mount path. Best regards Manoj GEORGE 于2022年9月1日周四 21:16写道: > CONFIDENTIAL & RESTRICTED > > Hi Team, > > > > I am new to spark, so please excuse my ignorance. > > > >

Re: running pyspark on kubernetes - no space left on device

2022-09-01 Thread Matt Proetsch
Hi George, You can try mounting a larger PersistentVolume to the work directory as described here instead of using localdir which might have site-specific size constraints: https://spark.apache.org/docs/latest/running-on-kubernetes.html#using-kubernetes-volumes -Matt > On Sep 1, 2022, at

running pyspark on kubernetes - no space left on device

2022-09-01 Thread Manoj GEORGE
CONFIDENTIAL & RESTRICTED Hi Team, I am new to spark, so please excuse my ignorance. Currently we are trying to run PySpark on Kubernetes cluster. The setup is working fine for some jobs, but when we are processing a large file ( 36 gb), we run into one of space issues. Based on what was

Re: com.esotericsoftware.kryo.KryoException: java.io.IOException: No space left on device\n\t

2021-03-08 Thread Sachit Murarka
Thanks Sean. Kind Regards, Sachit Murarka On Mon, Mar 8, 2021 at 6:23 PM Sean Owen wrote: > It's there in the error: No space left on device > You ran out of disk space (local disk) on one of your machines. > > On Mon, Mar 8, 2021 at 2:02 AM Sachit Murarka > wrote: > >

Re: com.esotericsoftware.kryo.KryoException: java.io.IOException: No space left on device\n\t

2021-03-08 Thread Sean Owen
It's there in the error: No space left on device You ran out of disk space (local disk) on one of your machines. On Mon, Mar 8, 2021 at 2:02 AM Sachit Murarka wrote: > Hi All, > > I am getting the following error in my spark job. > > Can someone ple

Re: com.esotericsoftware.kryo.KryoException: java.io.IOException: No space left on device\n\t

2021-03-08 Thread Sachit Murarka
failed 4 times, most recent failure: Lost task 0.3 in stage >> 41.0 (TID 80817, executor 193): com.esotericsoftware.kryo.KryoException: >> java.io.IOException: No space left on device\n\tat >> com.esotericsoftware.kryo.io.Output.flush(Output.java:188)\n\tat >> com.esotericsoftware.k

Re: com.esotericsoftware.kryo.KryoException: java.io.IOException: No space left on device\n\t

2021-03-08 Thread Gourav Sengupta
t failure: Lost task 0.3 in stage > 41.0 (TID 80817, executor 193): com.esotericsoftware.kryo.KryoException: > java.io.IOException: No space left on device\n\tat > com.esotericsoftware.kryo.io.Output.flush(Output.java:188)\n\tat > com.esotericsoftware.kryo.io.O

com.esotericsoftware.kryo.KryoException: java.io.IOException: No space left on device\n\t

2021-03-08 Thread Sachit Murarka
): com.esotericsoftware.kryo.KryoException: java.io.IOException: No space left on device\n\tat com.esotericsoftware.kryo.io.Output.flush(Output.java:188)\n\tat com.esotericsoftware.kryo.io.Output.require(Output.java:164)\n\tat com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:251)\n\tat

Re: No space left on device

2018-08-22 Thread Gourav Sengupta
cause though. I'll be happy to >>> hear deeper insight as well. >>> >>> On Mon, Aug 20, 2018 at 7:08 PM, Steve Lewis >>> wrote: >>> >>>> >>>> We are trying to run a job that has previously run on Spark 1.3 on a >>>> diffe

Re: No space left on device

2018-08-22 Thread Vitaliy Pisarev
o run a job that has previously run on Spark 1.3 on a >>> different cluster. The job was converted to 2.3 spark and this is a new >>> cluster. >>> >>> The job dies after completing about a half dozen stages with >>> >>> java.io.IOException

Re: No space left on device

2018-08-22 Thread Gourav Sengupta
reviously run on Spark 1.3 on a >> different cluster. The job was converted to 2.3 spark and this is a new >> cluster. >> >> The job dies after completing about a half dozen stages with >> >> java.io.IOException: No space left on device >> >> >>

Re: No space left on device

2018-08-21 Thread Vitaliy Pisarev
at has previously run on Spark 1.3 on a > different cluster. The job was converted to 2.3 spark and this is a new > cluster. > > The job dies after completing about a half dozen stages with > > java.io.IOException: No space left on device > > >It appears that

No space left on device

2018-08-20 Thread Steve Lewis
We are trying to run a job that has previously run on Spark 1.3 on a different cluster. The job was converted to 2.3 spark and this is a new cluster. The job dies after completing about a half dozen stages with java.io.IOException: No space left on device It appears that the nodes

Re: java.nio.file.FileSystemException: /tmp/spark- .._cache : No space left on device

2018-08-19 Thread naresh Goud
java.nio.file.FileSystemException: >> /tmp/spark-523d5331-3884-440c-ac0d-f46838c2029f/executor-390c9cd7-217e-42f3-97cb-fa2734405585/spark-206d92c0-f0d3-443c-97b2-39494e2c5fdd/-4230744641534510169119_cache >> -> ./PublishGainersandL

Re: java.nio.file.FileSystemException: /tmp/spark- .._cache : No space left on device

2018-08-17 Thread Jeevan K. Srivatsa
ge > 0.0 (TID 6, 172.29.62.145, executor 0): java.nio.file.FileSystemException: > /tmp/spark-523d5331-3884-440c-ac0d-f46838c2029f/executor-390c9cd7-217e-42f3-97cb-fa2734405585/spark-206d92c0-f0d3-443c-97b2-39494e2c5fdd/-4230744641534510169119_cache > -> ./PublishGainersandLosers-1.

java.nio.file.FileSystemException: /tmp/spark- .._cache : No space left on device

2018-08-17 Thread Polisetti, Venkata Siva Rama Gopala Krishna
/-4230744641534510169119_cache -> ./PublishGainersandLosers-1.0-SNAPSHOT-shaded-Gopal.jar: No space left on device at sun.nio.fs.UnixException.translateToIOException(UnixException.java:91) at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:

Re: Iterative rdd union + reduceByKey operations on small dataset leads to "No space left on device" error on account of lot of shuffle spill.

2018-07-27 Thread Dinesh Dharme
rectory, i.e. SPARK_LOCAL_DIRS , to be one > > having 100 GB space. > > The heirarchical dataset, whose size is (< 400kB), remains constant > > throughout the iterations. > > I have tried the worker cleanup flag but it has no effect i.e. > > "spark.worker.

Re: Iterative rdd union + reduceByKey operations on small dataset leads to "No space left on device" error on account of lot of shuffle spill.

2018-07-27 Thread Vadim Semenov
I have tried the worker cleanup flag but it has no effect i.e. > "spark.worker.cleanup.enabled=true" > > > > Error : > Caused by: java.io.IOException: No space left on device > at java.io.FileOutputStream.writeBytes(Native Me

Iterative rdd union + reduceByKey operations on small dataset leads to "No space left on device" error on account of lot of shuffle spill.

2018-07-27 Thread dineshdharme
. SPARK_LOCAL_DIRS , to be one having 100 GB space. The heirarchical dataset, whose size is (< 400kB), remains constant throughout the iterations. I have tried the worker cleanup flag but it has no effect i.e. "spark.worker.cleanup.enabled=true" Error : Caused by: java.io.IOException: No space l

Re: No space left on device

2017-10-17 Thread Imran Rajjad
<raj...@gmail.com> wrote: > >> did you try calling rdd.unpersist() >> >> On Wed, Oct 18, 2017 at 10:04 AM, Mina Aslani <aslanim...@gmail.com> >> wrote: >> >>> Hi, >>> >>> I get "No space left on device" error

Re: No space left on device

2017-10-17 Thread Mina Aslani
wrote: > >> Hi, >> >> I get "No space left on device" error in my spark worker: >> >> Error writing stream to file /usr/spark-2.2.0/work/app-.../0/stderr >> java.io.IOException: No space left on device >> >> In my spark cluste

Re: No space left on device

2017-10-17 Thread Imran Rajjad
did you try calling rdd.unpersist() On Wed, Oct 18, 2017 at 10:04 AM, Mina Aslani <aslanim...@gmail.com> wrote: > Hi, > > I get "No space left on device" error in my spark worker: > > Error writing stream to file /usr/spark-2.2.0/work/app-.../0/stderr >

Re: No space left on device

2017-10-17 Thread Chetan Khatri
Process data in micro batch On 18-Oct-2017 10:36 AM, "Chetan Khatri" <chetan.opensou...@gmail.com> wrote: > Your hard drive don't have much space > On 18-Oct-2017 10:35 AM, "Mina Aslani" <aslanim...@gmail.com> wrote: > >> Hi, >> >&g

Re: No space left on device

2017-10-17 Thread Chetan Khatri
Your hard drive don't have much space On 18-Oct-2017 10:35 AM, "Mina Aslani" <aslanim...@gmail.com> wrote: > Hi, > > I get "No space left on device" error in my spark worker: > > Error writing stream to file /usr/spark-2.2.0/work/app-.../0/stderr >

No space left on device

2017-10-17 Thread Mina Aslani
Hi, I get "No space left on device" error in my spark worker: Error writing stream to file /usr/spark-2.2.0/work/app-.../0/stderr java.io.IOException: No space left on device In my spark cluster, I have one worker and one master. My program consumes stream of data from kafka and

RE: No space left on device when running graphx job

2015-10-05 Thread Jack Yang
September 2015 12:27 AM To: Jack Yang Cc: Ted Yu; Andy Huang; user@spark.apache.org Subject: Re: No space left on device when running graphx job Would you mind sharing what your solution was? It would help those on the forum who might run into the same problem. Even it it’s a silly ‘gotcha

Re: No space left on device when running graphx job

2015-09-24 Thread Ted Yu
4 machines + 4G memory + 4 CPU >> cores) >> >> Basically, I load data using GraphLoader.edgeListFile mthod and then count >> number of nodes using: graph.vertices.count() method. >> >> The problem is : >> >> >> >> Lost task 11972.0 in sta

No space left on device when running graphx job

2015-09-24 Thread Jack Yang
): java.io.IOException: No space left on device at java.io.FileOutputStream.writeBytes(Native Method) at java.io.FileOutputStream.write(FileOutputStream.java:345) when I try a small amount of data, the code is working. So I guess the error comes from the amount of data. This is how

Re: No space left on device when running graphx job

2015-09-24 Thread Andy Huang
4.0 + 4 machines + 4G memory + 4 CPU > cores) > > Basically, I load data using GraphLoader.edgeListFile mthod and then count > number of nodes using: graph.vertices.count() method. > > The problem is : > > > > *Lost task 11972.0 in stage 6.0 (TID 54585, 192.168.70.1

RE: No space left on device when running graphx job

2015-09-24 Thread Jack Yang
Hi all, I resolved the problems. Thanks folk. Jack From: Jack Yang [mailto:j...@uow.edu.au] Sent: Friday, 25 September 2015 9:57 AM To: Ted Yu; Andy Huang Cc: user@spark.apache.org Subject: RE: No space left on device when running graphx job Also, please see the screenshot below from spark web

Re: SparkContext initialization error- java.io.IOException: No space left on device

2015-09-06 Thread shenyan zhen
>> >> 2015-09-05 9:50 GMT+08:00 shenyan zhen <shenya...@gmail.com>: >> >>> Has anyone seen this error? Not sure which dir the program was trying to >>> write to. >>> >>> I am running Spark 1.4.1, submitting Spark job to Yarn, in yarn-cli

Re: SparkContext initialization error- java.io.IOException: No space left on device

2015-09-06 Thread Shixiong Zhu
to. > > I am running Spark 1.4.1, submitting Spark job to Yarn, in yarn-client > mode. > > 15/09/04 21:36:06 ERROR SparkContext: Error adding jar > (java.io.IOException: No space left on device), was the --addJars option > used? > > 15/09/04 21:36:08 ERROR SparkContex

Re: SparkContext initialization error- java.io.IOException: No space left on device

2015-09-06 Thread Ted Yu
>> I am running Spark 1.4.1, submitting Spark job to Yarn, in yarn-client >> mode. >> >> 15/09/04 21:36:06 ERROR SparkContext: Error adding jar >> (java.io.IOException: No space left on device), was the --addJars option >> used? >> >>

SparkContext initialization error- java.io.IOException: No space left on device

2015-09-04 Thread shenyan zhen
Has anyone seen this error? Not sure which dir the program was trying to write to. I am running Spark 1.4.1, submitting Spark job to Yarn, in yarn-client mode. 15/09/04 21:36:06 ERROR SparkContext: Error adding jar (java.io.IOException: No space left on device), was the --addJars option used

Re: java.io.IOException: No space left on device--regd.

2015-07-06 Thread Akhil Das
on spark which involves expensive shuffle operation. Basically I require a self-join to be performed on a sparkDataFrame RDD . The job runs fine for around 15 hours and when the stage(which performs the sef-join) is about to complete, I get a *java.io.IOException: No space left on device*. I

Re: java.io.IOException: No space left on device--regd.

2015-07-06 Thread Akhil Das
require a self-join to be performed on a sparkDataFrame RDD . The job runs fine for around 15 hours and when the stage(which performs the sef-join) is about to complete, I get a *java.io.IOException: No space left on device*. I initially thought this could be due to *spark.local.dir* pointing

java.io.IOException: No space left on device--regd.

2015-07-05 Thread Devarajan Srinivasan
: No space left on device*. I initially thought this could be due to *spark.local.dir* pointing to */tmp* directory which was configured with *2GB* of space, since this job requires expensive shuffles,spark requires more space to write the shuffle files. Hence I configured *spark.local.dir* to point

No space left on device??

2015-05-06 Thread Yifan LI
Hi, I am running my graphx application on Spark, but it failed since there is an error on one executor node(on which available hdfs space is small) that “no space left on device”. I can understand why it happened, because my vertex(-attribute) rdd was becoming bigger and bigger during

Re: No space left on device??

2015-05-06 Thread Saisai Shao
since there is an error on one executor node(on which available hdfs space is small) that “no space left on device”. I can understand why it happened, because my vertex(-attribute) rdd was becoming bigger and bigger during computation…, so maybe sometime the request on that node was too bigger

Re: No space left on device??

2015-05-06 Thread Yifan LI
:35 GMT+08:00 Yifan LI iamyifa...@gmail.com mailto:iamyifa...@gmail.com: Hi, I am running my graphx application on Spark, but it failed since there is an error on one executor node(on which available hdfs space is small) that “no space left on device”. I can understand why it happened

Re: No space left on device??

2015-05-06 Thread Saisai Shao
manager. 2015-05-06 20:35 GMT+08:00 Yifan LI iamyifa...@gmail.com: Hi, I am running my graphx application on Spark, but it failed since there is an error on one executor node(on which available hdfs space is small) that “no space left on device”. I can understand why it happened, because my

Re: No space left on device??

2015-05-06 Thread Yifan LI
, but it failed since there is an error on one executor node(on which available hdfs space is small) that “no space left on device”. I can understand why it happened, because my vertex(-attribute) rdd was becoming bigger and bigger during computation…, so maybe sometime the request on that node

Re: No space left on device??

2015-05-06 Thread Saisai Shao
set by the cluster manager. 2015-05-06 20:35 GMT+08:00 Yifan LI iamyifa...@gmail.com: Hi, I am running my graphx application on Spark, but it failed since there is an error on one executor node(on which available hdfs space is small) that “no space left on device”. I can understand why

Re: java.io.IOException: No space left on device while doing repartitioning in Spark

2015-05-05 Thread Akhil Das
on device exception when doing repartitioning of approx. 285 MB of data while these is still 2 GB space left ?? does it mean that repartitioning needs more space (more than 2 GB) for repartitioning of 285 MB of data ?? best, /Shahab java.io.IOException: No space left on device

java.io.IOException: No space left on device while doing repartitioning in Spark

2015-05-04 Thread shahab
Hi, I am getting No space left on device exception when doing repartitioning of approx. 285 MB of data while these is still 2 GB space left ?? does it mean that repartitioning needs more space (more than 2 GB) for repartitioning of 285 MB of data ?? best, /Shahab java.io.IOException

Re: java.io.IOException: No space left on device while doing repartitioning in Spark

2015-05-04 Thread Ted Yu
See https://wiki.gentoo.org/wiki/Knowledge_Base:No_space_left_on_device_while_there_is_plenty_of_space_available What's the value for spark.local.dir property ? Cheers On Mon, May 4, 2015 at 6:57 AM, shahab shahab.mok...@gmail.com wrote: Hi, I am getting No space left on device exception

Re: java.io.IOException: No space left on device

2015-04-29 Thread Dean Wampler
8.1M). The problem is that when trying to presist a partition into memory and there is not enought memory, the partition is persisted on disk and despite Having 229G of free disk space, I got No space left on device.. This is how I'm running the program : ./spark-submit --class

Re: java.io.IOException: No space left on device

2015-04-29 Thread Dean Wampler
of free disk space, I got No space left on device.. This is how I'm running the program : ./spark-submit --class com.custom.sentimentAnalysis.MainPipeline --master local[2] --driver-memory 5g ml_pipeline.jar labeledTrainData.tsv testData.tsv And this is a part of the log: If you need more

java.io.IOException: No space left on device

2015-04-29 Thread Selim Namsi
229G of free disk space, I got No space left on device.. This is how I'm running the program : ./spark-submit --class com.custom.sentimentAnalysis.MainPipeline --master local[2] --driver-memory 5g ml_pipeline.jar labeledTrainData.tsv testData.tsv And this is a part of the log: If you need more

Re: java.io.IOException: No space left on device

2015-04-29 Thread Anshul Singhle
). The problem is that when trying to presist a partition into memory and there is not enought memory, the partition is persisted on disk and despite Having 229G of free disk space, I got No space left on device.. This is how I'm running the program : ./spark-submit --class

Re: java.io.IOException: No space left on device

2015-04-29 Thread selim namsi
got No space left on device.. This is how I'm running the program : ./spark-submit --class com.custom.sentimentAnalysis.MainPipeline --master local[2] --driver-memory 5g ml_pipeline.jar labeledTrainData.tsv testData.tsv And this is a part of the log: If you need more informations

Re: java.io.IOException: No space left on device

2015-04-29 Thread selim namsi
Sorry I put the log messages when creating the thread in http://apache-spark-user-list.1001560.n3.nabble.com/java-io-IOException-No-space-left-on-device-td22702.html but I forgot that raw messages will not be sent in emails. So this is the log related to the error : 15/04/29 02:48:50 INFO

FileNotFoundException (No space left on device) writing to S3

2014-08-27 Thread Daniil Osipov
): java.io.FileNotFoundException: /mnt/spa$ k/spark-local-20140827191008-05ae/0c/shuffle_1_7570_5768 (No space left on device) java.io.FileOutputStream.open(Native Method) java.io.FileOutputStream.init(FileOutputStream.java:221) org.apache.spark.storage.DiskBlockObjectWriter.open

No space left on device

2014-08-09 Thread kmatzen
I need some configuration / debugging recommendations to work around no space left on device. I am completely new to Spark, but I have some experience with Hadoop. I have a task where I read images stored in sequence files from s3://, process them with a map in scala, and write the result back

Re: No space left on device

2014-08-09 Thread Matei Zaharia
directory to someplace on /mnt instead. If it's /tmp, you can set java.io.tmpdir to another directory in Spark's JVM options. Matei On August 8, 2014 at 11:02:48 PM, kmatzen (kmat...@gmail.com) wrote: I need some configuration / debugging recommendations to work around no space left on device. I am

Re: No space left on device

2014-08-09 Thread Jim Donahue
...@gmail.commailto:matei.zaha...@gmail.com Date: Saturday, August 9, 2014 at 1:48 PM To: u...@spark.incubator.apache.orgmailto:u...@spark.incubator.apache.org u...@spark.incubator.apache.orgmailto:u...@spark.incubator.apache.org, kmatzen kmat...@gmail.commailto:kmat...@gmail.com Subject: Re: No space left on device

Re: Error: No space left on device

2014-07-17 Thread Chris DuBois
, Chris DuBois chris.dub...@gmail.com wrote: Hi all, I am encountering the following error: INFO scheduler.TaskSetManager: Loss was due to java.io.IOException: No space left on device [duplicate 4] For each slave, df -h looks roughtly

Re: Error: No space left on device

2014-07-17 Thread Bill Jay
...@gmail.com wrote: Hi all, I am encountering the following error: INFO scheduler.TaskSetManager: Loss was due to java.io.IOException: No space left on device [duplicate 4] For each slave, df -h looks roughtly like this, which makes

Error: No space left on device

2014-07-16 Thread Chris DuBois
Hi all, I am encountering the following error: INFO scheduler.TaskSetManager: Loss was due to java.io.IOException: No space left on device [duplicate 4] For each slave, df -h looks roughtly like this, which makes the above error surprising. FilesystemSize Used Avail Use% Mounted

Re: Error: No space left on device

2014-07-16 Thread Xiangrui Meng
: No space left on device [duplicate 4] For each slave, df -h looks roughtly like this, which makes the above error surprising. FilesystemSize Used Avail Use% Mounted on /dev/xvda17.9G 4.4G 3.5G 57% / tmpfs 7.4G 4.0K 7.4G 1% /dev/shm /dev/xvdb

Re: Error: No space left on device

2014-07-16 Thread Chris DuBois
(df -i). The assembly build may create many small files. -Xiangrui On Tue, Jul 15, 2014 at 11:35 PM, Chris DuBois chris.dub...@gmail.com wrote: Hi all, I am encountering the following error: INFO scheduler.TaskSetManager: Loss was due to java.io.IOException: No space left on device

Re: Error: No space left on device

2014-07-16 Thread Chris Gore
scheduler.TaskSetManager: Loss was due to java.io.IOException: No space left on device [duplicate 4] For each slave, df -h looks roughtly like this, which makes the above error surprising. FilesystemSize Used Avail Use% Mounted on /dev/xvda17.9G 4.4G 3.5G 57

Re: Error: No space left on device

2014-07-16 Thread Chris DuBois
, Chris DuBois chris.dub...@gmail.com wrote: Hi all, I am encountering the following error: INFO scheduler.TaskSetManager: Loss was due to java.io.IOException: No space left on device [duplicate 4] For each slave, df -h looks roughtly like this, which makes the above error

Re: Error: No space left on device

2014-07-16 Thread Xiangrui Meng
encountering the following error: INFO scheduler.TaskSetManager: Loss was due to java.io.IOException: No space left on device [duplicate 4] For each slave, df -h looks roughtly like this, which makes the above error surprising. FilesystemSize Used Avail Use% Mounted

Re: Error: No space left on device

2014-07-16 Thread Chris DuBois
: INFO scheduler.TaskSetManager: Loss was due to java.io.IOException: No space left on device [duplicate 4] For each slave, df -h looks roughtly like this, which makes the above error surprising. FilesystemSize Used Avail Use% Mounted on /dev/xvda17.9G

Re: Error: No space left on device

2014-07-16 Thread Chris DuBois
, Chris DuBois chris.dub...@gmail.com wrote: Hi all, I am encountering the following error: INFO scheduler.TaskSetManager: Loss was due to java.io.IOException: No space left on device [duplicate 4] For each slave, df -h looks roughtly like this, which makes

Re: Error: No space left on device

2014-07-16 Thread Chris DuBois
left on device [duplicate 4] For each slave, df -h looks roughtly like this, which makes the above error surprising. FilesystemSize Used Avail Use% Mounted on /dev/xvda17.9G 4.4G 3.5G 57% / tmpfs 7.4G 4.0K 7.4G

Re: No space left on device error when pulling data from s3

2014-05-15 Thread darkjh
this in the spark-ec2 script. Writing lots of tmp files in the 8GB `/` is not a great idea. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/No-space-left-on-device-error-when-pulling-data-from-s3-tp5450p5518.html Sent from the Apache Spark User List mailing list archive

No space left on device error when pulling data from s3

2014-05-06 Thread Han JU
Hi, I've a `no space left on device` exception when pulling some 22GB data from s3 block storage to the ephemeral HDFS. The cluster is on EC2 using spark-ec2 script with 4 m1.large. The code is basically: val in = sc.textFile(s3://...) in.saveAsTextFile(hdfs://...) Spark creates 750 input

Re: No space left on device error when pulling data from s3

2014-05-06 Thread Akhil Das
I wonder why is your / is full. Try clearing out /tmp and also make sure in the spark-env.sh you have put SPARK_JAVA_OPTS+= -Dspark.local.dir=/mnt/spark Thanks Best Regards On Tue, May 6, 2014 at 9:35 PM, Han JU ju.han.fe...@gmail.com wrote: Hi, I've a `no space left on device` exception

Re: No space left on device error when pulling data from s3

2014-05-06 Thread Han JU
write temp files to /tmp/hadoop-root ? 2014-05-06 18:05 GMT+02:00 Han JU ju.han.fe...@gmail.com: Hi, I've a `no space left on device` exception when pulling some 22GB data from s3 block storage to the ephemeral HDFS. The cluster is on EC2 using spark-ec2 script with 4 m1.large. The code

Re: No space left on device exception

2014-03-24 Thread Ognen Duzlevski
this thread). What is interesting is that only two out of the 16 slaves had this problem :) Ognen On 3/24/14, 12:57 AM, Patrick Wendell wrote: Ognen - just so I understand. The issue is that there weren't enough inodes and this was causing a No space left on device error? Is that correct

Re: No space left on device exception

2014-03-24 Thread Ognen Duzlevski
is that there weren't enough inodes and this was causing a No space left on device error? Is that correct? If so, that's good to know because it's definitely counter intuitive. On Sun, Mar 23, 2014 at 8:36 PM, Ognen Duzlevski og...@nengoiksvelzud.com wrote: I would love to work on this (and other) stuff if I

No space left on device exception

2014-03-23 Thread Ognen Duzlevski
4 times (most recent failure: Exception failure: java.io.FileNotFoundException: /tmp/spark-local-20140323214638-72df/31/shuffle_31_3_127 (No space left on device)) org.apache.spark.SparkException: Job aborted: Task 167.0:3 failed 4 times (most recent failure: Exception failure

Re: No space left on device exception

2014-03-23 Thread Aaron Davidson
: java.io.FileNotFoundException: /tmp/spark-local-20140323214638-72df/31/shuffle_31_3_127 (No space left on device)) org.apache.spark.SparkException: Job aborted: Task 167.0:3 failed 4 times (most recent failure: Exception failure: java.io.FileNotFoundException: /tmp/spark-local-20140323214638-72df/31

Re: No space left on device exception

2014-03-23 Thread Ognen Duzlevski
On 3/23/14, 5:49 PM, Matei Zaharia wrote: You can set spark.local.dir to put this data somewhere other than /tmp if /tmp is full. Actually it’s recommended to have multiple local disks and set to to a comma-separated list of directories, one per disk. Matei, does the number of tasks/partitions

Re: No space left on device exception

2014-03-23 Thread Ognen Duzlevski
Aaron, thanks for replying. I am very much puzzled as to what is going on. A job that used to run on the same cluster is failing with this mysterious message about not having enough disk space when in fact I can see through watch df -h that the free space is always hovering around 3+GB on the

Re: No space left on device exception

2014-03-23 Thread Ognen Duzlevski
Bleh, strike that, one of my slaves was at 100% inode utilization on the file system. It was /tmp/spark* leftovers that apparently did not get cleaned up properly after failed or interrupted jobs. Mental note - run a cron job on all slaves and master to clean up /tmp/spark* regularly. Thanks

Re: No space left on device exception

2014-03-23 Thread Aaron Davidson
Thanks for bringing this up, 100% inode utilization is an issue I haven't seen raised before and this raises another issue which is not on our current roadmap for state cleanup (cleaning up data which was not fully cleaned up from a crashed process). On Sun, Mar 23, 2014 at 7:57 PM, Ognen

Re: No space left on device exception

2014-03-23 Thread Ognen Duzlevski
I would love to work on this (and other) stuff if I can bother someone with questions offline or on a dev mailing list. Ognen On 3/23/14, 10:04 PM, Aaron Davidson wrote: Thanks for bringing this up, 100% inode utilization is an issue I haven't seen raised before and this raises another issue

Re: No space left on device exception

2014-03-23 Thread Patrick Wendell
Ognen - just so I understand. The issue is that there weren't enough inodes and this was causing a No space left on device error? Is that correct? If so, that's good to know because it's definitely counter intuitive. On Sun, Mar 23, 2014 at 8:36 PM, Ognen Duzlevski og...@nengoiksvelzud.com wrote