user

Messages by Thread

- Re: Help with Shuffle Read performance Igor Calabria
depolying stage-level scheduling for Spark SQL and how to expose RDD code from Spark SQL? Chenghao Lyu
Does 'Stage cancelled because SparkContext was shut down' is a error lk_spark
[Spark Kubernetes] Question about Configurability of Labeling Driver Service Shiqi Sun
- Re: [Spark Kubernetes] Question about Configurability of Labeling Driver Service Shiqi Sun
Kyro Serializer not getting set : Spark3 rajat kumar
- Re: Kyro Serializer not getting set : Spark3 Qian SUN
- Re: Kyro Serializer not getting set : Spark3 rajat kumar
HELP, Populating an empty pyspark dataframe with auto-generated dates Jamie Arodi
Query regarding Proleptic Gregorian Calendar Spark3 Sachit Murarka
- Re: Query regarding Proleptic Gregorian Calendar Spark3 Sachit Murarka
Error - Spark STREAMING Akash Vellukai
- Re: Error - Spark STREAMING Anupam Singh
Re: Issue with SparkContext Bjørn Jørgensen
- Re: Issue with SparkContext javacaoyu
NoClassDefError and SparkSession should only be created and accessed on the driver. rajat kumar
- 答复: NoClassDefError and SparkSession should only be created and accessed on the driver. Xiao, Alton
- Re: NoClassDefError and SparkSession should only be created and accessed on the driver. rajat kumar
- Re: NoClassDefError and SparkSession should only be created and accessed on the driver. Paul Rogalinski
Spark Structured Streaming - stderr getting filled up karan alang
- Re: Spark Structured Streaming - stderr getting filled up karan alang
- Re: Spark Structured Streaming - stderr getting filled up karan alang
[how to]RDD using JDBC data source in PySpark [email protected]
- 答复: [how to]RDD using JDBC data source in PySpark Xiao, Alton
- 回复: 答复: [how to]RDD using JDBC data source in PySpark [email protected]
- Re: 答复: [how to]RDD using JDBC data source in PySpark Bjørn Jørgensen
- Re: Re: [how to]RDD using JDBC data source in PySpark [email protected]
- Re: Re: [how to]RDD using JDBC data source in PySpark Bjørn Jørgensen
- Re: 答复: [how to]RDD using JDBC data source in PySpark Sean Owen
Driver throws exception every few hours Kiran Biswal
[Spark Core] Joining Same DataFrame Multiple Times Results in Column not getting dropped Shahban Riaz
[Spark Internals]: Is sort order preserved after partitioned write? Swetha Baskaran
- Re: [Spark Internals]: Is sort order preserved after partitioned write? Enrico Minack
- Re: [Spark Internals]: Is sort order preserved after partitioned write? Swetha Baskaran
- Re: [Spark Internals]: Is sort order preserved after partitioned write? Enrico Minack
- Re: [Spark Internals]: Is sort order preserved after partitioned write? Swetha Baskaran
Big Data Contract Roles ? sri hari kali charan Tummala
Splittable or not? Sid
- Re: Splittable or not? Amit Joshi
- Re: Splittable or not? Sid
- Re: Splittable or not? Enrico Minack
- Re: Splittable or not? Sid
- Re: Splittable or not? Jack Goodson
Network time out property is not getting set in Spark Sachit Murarka
- Re: EXT: Network time out property is not getting set in Spark Vibhor Gupta
- Re: EXT: Network time out property is not getting set in Spark Sachit Murarka
Long running task in spark rajat kumar
- Re: Long running task in spark Sid
[SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage akshit marwah
- Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage Artemis User
- Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage Adam Binford
Dynamic shuffle partitions in a single job Vibhor Gupta
- Re: Dynamic shuffle partitions in a single job Anupam Singh
- RE: [EXTERNAL] Re: Dynamic shuffle partitions in a single job Kapil Kumar Singh
Spark SQL Mayur Benodekar
- Re: Spark SQL Gourav Sengupta
- Re: Spark SQL Mayur Benodekar
- Re: Spark SQL Gourav Sengupta
- Re: EXT: Re: Spark SQL Vibhor Gupta
Pipelined execution in Spark (???) Sungwoo Park
- Re: Pipelined execution in Spark (???) Russell Jurney
- Re: Pipelined execution in Spark (???) Sungwoo Park
- Re: Pipelined execution in Spark (???) Sean Owen
- Re: Pipelined execution in Spark (???) Sungwoo Park
- Re: Pipelined execution in Spark (???) Russell Jurney
- Re: Pipelined execution in Spark (???) Gourav Sengupta
- Re: Pipelined execution in Spark (???) Russell Jurney
- Re: Pipelined execution in Spark (???) Russell Jurney
Spark equivalent to hdfs groups phiroc
- Re: Spark equivalent to hdfs groups Sean Owen
- Re: Spark equivalent to hdfs groups phiroc
- Re: Spark equivalent to hdfs groups Sean Owen
- Re: Spark equivalent to hdfs groups phiroc
Spark Structured Streaming - unable to change max.poll.records (showing as 1) karan alang
[ANNOUNCE] Apache Kyuubi (Incubating) released 1.6.0-incubating Nicholas Jiang
Error in Spark in Jupyter Notebook Mamata Shee
- Re: Error in Spark in Jupyter Notebook Sean Owen
Apache Spark - How to concert DataFrame json string to structured element and using schema_of_json M Singh
Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
- Re: Jupyter notebook on Dataproc versus GKE Holden Karau
- Re: Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
- Re: Jupyter notebook on Dataproc versus GKE Holden Karau
- Re: Jupyter notebook on Dataproc versus GKE Bjørn Jørgensen
- Re: Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
- Re: Jupyter notebook on Dataproc versus GKE Bjørn Jørgensen
- Re: Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
- Re: Jupyter notebook on Dataproc versus GKE Holden Karau
- Re: Jupyter notebook on Dataproc versus GKE Bjørn Jørgensen
Spark Issue with Istio in Distributed Mode Deepak Sharma
- Re: Spark Issue with Istio in Distributed Mode Deepak Sharma
- Re: Spark Issue with Istio in Distributed Mode Deepak Sharma
Data Type Issue while upgrading to Spark3 rajat kumar
Creating Custom Broadcast Join Murali S
- ERROR MicroBatchExecution Ravi Chandran
running pyspark on kubernetes - no space left on device Manoj GEORGE
- Re: running pyspark on kubernetes - no space left on device Matt Proetsch
- Re: running pyspark on kubernetes - no space left on device Qian SUN
Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15 FengYu Cao
- Re: Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15 Chao Sun
- Re: Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15 FengYu Cao
Moving to Spark 3x from Spark2 rajat kumar
- Re: Moving to Spark 3x from Spark2 Khalid Mammadov
- Re: Moving to Spark 3x from Spark2 Martin Andersson
deciding Spark tasks & optimization resource rajat kumar
- Re: deciding Spark tasks & optimization resource Gibson
Spark 3.3.0 with Structure Streaming from Kafka Issue on commons-pools2 Raymond Tang
Spark SQL Predict Pushdown for Hive Bucketed Table Raymond Tang
Structured Streaming - data not being read (offsets not getting committed ?) karan alang
回复：Re: Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2 ckgppl_yan
Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2 ckgppl_yan
- Re: Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2 Sean Owen
- Re: Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2 pengyh
Profiling PySpark Pandas UDF Subash Prabanantham
- Re: Profiling PySpark Pandas UDF Gourav Sengupta
- Re: Profiling PySpark Pandas UDF Andrew Melo
- Re: Profiling PySpark Pandas UDF Sean Owen
- Re: Profiling PySpark Pandas UDF Russell Jurney
- Re: Profiling PySpark Pandas UDF Takuya UESHIN
- Re: Profiling PySpark Pandas UDF Sean Owen
- Re: Profiling PySpark Pandas UDF Russell Jurney
- Re: Profiling PySpark Pandas UDF Subash Prabanantham
- Re: Profiling PySpark Pandas UDF Abdeali Kothari
- RE: Profiling PySpark Pandas UDF Luca Canali
- Re: Profiling PySpark Pandas UDF Abdeali Kothari
- RE: Profiling PySpark Pandas UDF Luca Canali
- Re: Profiling PySpark Pandas UDF Gourav Sengupta
spark-3.2.2-bin-without-hadoop : NoClassDefFoundError: org/apache/log4j/spi/Filter when starting the master FLORANCE Grégory
- Re: spark-3.2.2-bin-without-hadoop : NoClassDefFoundError: org/apache/log4j/spi/Filter when starting the master Sean Owen
Question regarding checkpointing with kafka structured streaming Martin Andersson
[Spark SQL]: Does Spark preserve the order in a nested ORDER BY? Vinay Londhe
Filtering by job group in the Spark UI / API Yeachan Park
Spark streaming Prajith Vellukkai
- Re: Spark streaming ミユナ (alice)
- Spark streaming sandra sukumaran
- Re: Spark streaming Ajit Kumar Amit
- Re: [EXTERNAL] Re: Spark streaming Saurabh Gulati
- Re: [EXTERNAL] Re: Spark streaming sandra sukumaran
- Re: Spark streaming Gourav Sengupta
Data ingestion Akash Vellukai
- Re: Data ingestion Pasha Finkelshtein
- Re: Data ingestion Yuri Oleynikov (‫יורי אולייניקוב‬‎)
- Re: Data ingestion pengyh
- Re: Data ingestion Pasha Finkelshtein
Spark streaming - Data Ingestion Akash Vellukai
- Re: Spark streaming - Data Ingestion Gibson
- Re: [EXTERNAL] Re: Spark streaming - Data Ingestion Saurabh Gulati
- Re: [EXTERNAL] Re: Spark streaming - Data Ingestion Akash Vellukai
- Re: [EXTERNAL] Re: Spark streaming - Data Ingestion Gibson
Supported Hadoop versions for Spark 3.3 Håkan Nordgren
- Re: Supported Hadoop versions for Spark 3.3 pengyh
PySpark schema sanitization Shay Elbaz
- Unsubscribe Peter Kovgan
Spark with GPU rajat kumar
- Re: Spark with GPU Sean Owen
- Re: Spark with GPU rajat kumar
- Re: Spark with GPU Sean Owen
- Re: Spark with GPU Alessandro Bellina
- Re: Spark with GPU Gourav Sengupta
- Spark with GPU Irene Markelic
- Re: Spark with GPU Mich Talebzadeh
- Re: Spark with GPU Jack Goodson
- Re: Spark with GPU Alessandro Bellina
[no subject] GAURAV GUPTA
pyspark not starting Kelum Perera
Joins internally Sid
Memory leak while caching in foreachBatch block kineret M
[Spark SQL] Omit Create Table Statement in Spark Sql 阿强
- Re: [Spark SQL] Omit Create Table Statement in Spark Sql pengyh
Spark program not receiving messages from Cloud Pubsub Pramod Biligiri
- Re: Spark program not receiving messages from Cloud Pubsub Pramod Biligiri
- High number of tasks when ran on a hybrid cluster murat migdisoglu
Spark Scala API still not updated for 2.13 or it's a mistake? Roman I
- Re: Spark Scala API still not updated for 2.13 or it's a mistake? Sean Owen
- Re: Spark Scala API still not updated for 2.13 or it's a mistake? Roman I
- Re: Spark Scala API still not updated for 2.13 or it's a mistake? Sean Owen
- Re: Spark Scala API still not updated for 2.13 or it's a mistake? pengyh
log transfering into hadoop/spark pengyh
- Re: log transfering into hadoop/spark ayan guha
- Re: log transfering into hadoop/spark Gourav Sengupta
[pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Kumba Janga
- Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Sean Owen
- Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Kumba Janga
- Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty ayan guha
- Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Sean Owen
- Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Stelios Philippou
- Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Sean Owen
WARN: netlib.BLAS 陈刚
- Re: WARN: netlib.BLAS Sean Owen
Use case idea Gioele Sal. Perri
- Re: Use case idea pengyh
- Re: Use case idea Gourav Sengupta
- Re: Use case idea pengyh
- Re: Use case idea Gourav Sengupta
- Re: Use case idea pengyh
Salting technique doubt Sid
- Re: Salting technique doubt Amit Joshi
- Re: Salting technique doubt Sid
- Re: Salting technique doubt Amit Joshi
- Re: Salting technique doubt Jacob Lynn
- Re: Salting technique doubt ayan guha