user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: [EXTERNAL] Re: Spark-submit without access to HDFS
Eugene Miretsky
Re: [EXTERNAL] Re: Spark-submit without access to HDFS
Eugene Miretsky
Re: [EXTERNAL] Re: Spark-submit without access to HDFS
Mich Talebzadeh
Re: [EXTERNAL] Re: [EXTERNAL] Re: Spark-submit without access to HDFS
Eugene Miretsky
[Spark Structured Streaming] Two sink from Single stream
Subash Prabanantham
The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Hanyu Huang
The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Hanyu Huang
RE: The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Stevens, Clay
The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Hanyu Huang
Why create/drop/alter/rename partition does not post listener event in ExternalCatalogWithListener?
李响
Pass xmx values to SparkLauncher launched Java process
Deepthi Sathia Raj
How grouping rows without shuffle
Yoel Benharrous
help needed with SPARK-45598 and SPARK-45769
Maksym M
Storage Partition Joins only works for buckets?
Arwin Tio
org.apache.ranger.authorization.hive.authorizer.RangerHiveAuthorizerFactory ClassNotFoundException
Yi Zheng
[ANNOUNCE] Apache Kyuubi released 1.8.0
Cheng Pan
Spark master shuts down when one of zookeeper dies
Kaustubh Ghode
Re: Spark master shuts down when one of zookeeper dies
Mich Talebzadeh
How to configure authentication from a pySpark client to a Spark Connect server ?
Xiaolong Wang
[Spark SQL] [Bug] Adding `checkpoint()` causes "column [...] cannot be resolved" error
Robin Zimmerman
Parser error when running PySpark on Windows connecting to GCS
Richard Smith
Re: Parser error when running PySpark on Windows connecting to GCS
Mich Talebzadeh
Data analysis issues
Jauru Lin
Re: Data analysis issues
Mich Talebzadeh
Spark / Scala conflict
Harry Jamison
Re: Spark / Scala conflict
Aironman DirtDiver
Re: Spark / Scala conflict
Harry Jamison
Fixed byte array issue
KhajaAsmath Mohammed
jackson-databind version mismatch
moshik.vitas
Re: jackson-databind version mismatch
eab...@163.com
Re: jackson-databind version mismatch
Bjørn Jørgensen
Re: jackson-databind version mismatch
Bjørn Jørgensen
Re: Re: jackson-databind version mismatch
eab...@163.com
RE: jackson-databind version mismatch
moshik.vitas
Elasticity and scalability for Spark in Kubernetes
Mich Talebzadeh
[Structured Streaming] Joins after aggregation don't work in streaming
Andrzej Zera
Re: [Structured Streaming] Joins after aggregation don't work in streaming
Jungtaek Lim
Re: [Structured Streaming] Joins after aggregation don't work in streaming
Andrzej Zera
spark schema conflict behavior records being silently dropped
Carlos Aguni
submitting tasks failed in Spark standalone mode due to missing failureaccess jar file
eab...@163.com
Contribution Recommendations
Phil Dakin
Maximum executors in EC2 Machine
KhajaAsmath Mohammed
Re: Maximum executors in EC2 Machine
Riccardo Ferrari
automatically/dinamically renew aws temporary token
Carlos Aguni
Re: automatically/dinamically renew aws temporary token
Jörn Franke
Re: automatically/dinamically renew aws temporary token
Pol Santamaria
Re: automatically/dinamically renew aws temporary token
Carlos Aguni
Spark join produce duplicate rows in resultset
Meena Rajani
Re: Spark join produce duplicate rows in resultset
Patrick Tucci
Re: Spark join produce duplicate rows in resultset
Sadha Chilukoori
Re: Spark join produce duplicate rows in resultset
Bjørn Jørgensen
Re: Spark join produce duplicate rows in resultset
Meena Rajani
Error when trying to get the data from Hive Materialized View
Siva Sankar Reddy
spark.stop() cannot stop spark connect session
eab...@163.com
[Resolved] Re: spark.stop() cannot stop spark connect session
eab...@163.com
"Premature end of Content-Length" Error
Sandhya Bala
hive: spark as execution engine. class not found problem
Amirhossein Kabiri
Re: hive: spark as execution engine. class not found problem
Vijay Shankar
[ANNOUNCE] Apache Celeborn(incubating) 0.3.1 available
Cheng Pan
[ SPARK SQL ]: PPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column
Suyash Ajmera
Re: [ SPARK SQL ]: UPPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column
Suyash Ajmera
Re: [ SPARK SQL ]: UPPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column
Suyash Ajmera
Can not complete the read csv task
Kelum Perera
Fw: Can not complete the read csv task
Kelum Perera
Fwd: Fw: Can not complete the read csv task
KP Youtuber
Re: Can not complete the read csv task
Khalid Mammadov
Autoscaling in Spark
Kiran Biswal
Re: Autoscaling in Spark
Mich Talebzadeh
Log file location in Spark on K8s
Agrawal, Sanket
Re: Log file location in Spark on K8s
Prashant Sharma
Clarification with Spark Structured Streaming
ashok34...@yahoo.com.INVALID
Re: Clarification with Spark Structured Streaming
Mich Talebzadeh
Re: Clarification with Spark Structured Streaming
ashok34...@yahoo.com.INVALID
Re: Clarification with Spark Structured Streaming
Mich Talebzadeh
Re: Clarification with Spark Structured Streaming
Danilo Sousa
Spark Compatibility with Spring Boot 3.x
Ahmed Albalawi
Re: Spark Compatibility with Spring Boot 3.x
Sean Owen
Re: Spark Compatibility with Spring Boot 3.x
Angshuman Bhattacharya
RE: Re: Spark Compatibility with Spring Boot 3.x
Guru Panda
Connection pool shut down in Spark Iceberg Streaming Connector
Agrawal, Sanket
Re: Connection pool shut down in Spark Iceberg Streaming Connector
Prashant Sharma
Re: Connection pool shut down in Spark Iceberg Streaming Connector
Igor Calabria
[PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Raghavendra Ganesh
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Perez
[PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Mich Talebzadeh
[Spark Core]: Recomputation cost of a job due to executor failures
Faiz Halde
Updating delta file column data
Karthick Nk
Re: Updating delta file column data
Karthick Nk
Re: Updating delta file column data
Mich Talebzadeh
Re: Updating delta file column data
Mich Talebzadeh
using facebook Prophet + pyspark for forecasting - Dataframe has less than 2 non-NaN rows
karan alang
Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jon Rodríguez Aranguren
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jayabindu Singh
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Mich Talebzadeh
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jon Rodríguez Aranguren
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Thread dump only shows 10 shuffle clients
Nebi Aydin
Files io threads vs shuffle io threads
Nebi Aydin
Inquiry about Processing Speed
Haseeb Khalid
Re: Inquiry about Processing Speed
Deepak Goel
Re: Inquiry about Processing Speed
Jack Goodson
Reading Glue Catalog Views through Spark.
Agrawal, Sanket
[PySpark][Spark logs] Is it possible to dynamically customize Spark logs?
Ayman Rekik
[ANNOUNCE] Apache Kyuubi released 1.7.3
Zhen Wang
Spark Connect Multi-tenant Support
Kezhi Xiong
Parallel write to different partitions
Shrikant Prasad
Re: Parallel write to different partitions
Shrikant Prasad
Need to split incoming data into PM on time column and find the top 5 by volume of data
ashok34...@yahoo.com.INVALID
Re: Need to split incoming data into PM on time column and find the top 5 by volume of data
Mich Talebzadeh
PySpark 3.5.0 on PyPI
Kezhi Xiong
Re: PySpark 3.5.0 on PyPI
Sean Owen
Re: PySpark 3.5.0 on PyPI
Kezhi Xiong
[Spark 3.5.0] Is the protobuf-java JAR no longer shipped with Spark?
Gijs Hendriksen
Create an external table with DataFrameWriterV2
Christophe Préaud
Spark streaming sourceArchiveDir does not move file to archive directory
Yunus Emre G?rses
Discriptency sample standard deviation pyspark and Excel
Helene Bøe
Re: Discriptency sample standard deviation pyspark and Excel
Sean Owen
Re: Discriptency sample standard deviation pyspark and Excel
Mich Talebzadeh
Re: Discriptency sample standard deviation pyspark and Excel
Sean Owen
Re: Discriptency sample standard deviation pyspark and Excel
Bjørn Jørgensen
Re: Discriptency sample standard deviation pyspark and Excel
Mich Talebzadeh
Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem
Karthick
Re: Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem
Gowtham S
Re: Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem
Karthick
getting emails in different order!
Mich Talebzadeh
Re: getting emails in different order!
Sean Owen
Re: getting emails in different order!
Mich Talebzadeh
[ANNOUNCE] Apache Kyuubi released 1.7.2
Zhen Wang
About Peak Jvm Memory Onheap
Nebi Aydin
Fwd: First Time contribution.
ram manickam
Re: First Time contribution.
Denny Lee
Re: First Time contribution.
Haejoon Lee
[Spark Core]: How does rpc threads influence shuffle?
Nebi Aydin
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
Mich Talebzadeh
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
Mich Talebzadeh
Re: Filter out 20% of rows
Mich Talebzadeh
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
ashok34...@yahoo.com.INVALID
Spark stand-alone mode
Ilango
Re: Spark stand-alone mode
Patrick Tucci
Re: Spark stand-alone mode
Sean Owen
Re: Spark stand-alone mode
Mich Talebzadeh
Re: Spark stand-alone mode
Bjørn Jørgensen
Re: Spark stand-alone mode
Ilango
Re: Spark stand-alone mode
Patrick Tucci
Re: Spark stand-alone mode
Ilango
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Craig Alfieri
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Jerry Peng
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
russell . spitzer
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Craig Alfieri
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Jerry Peng
APACHE Spark adoption/growth chart
Andrew Petersen
Write Spark Connection client application in Go
bo yang
Re: Write Spark Connection client application in Go
Holden Karau
Re: Write Spark Connection client application in Go
Martin Grund
Re: Write Spark Connection client application in Go
bo yang
Feedback on Testing Guidelines for Data Stream Processing Applications
Alexandre Strapacao Guedes Vianna
Re: IDEA compile fail but sbt test succeed
Pasha Finkelshteyn
About /mnt/hdfs/current/BP directories
Nebi Aydin
Re: About /mnt/hdfs/current/BP directories
Jack Wells
Re: [External Email] Re: About /mnt/hdfs/current/BP directories
Nebi Aydin
Re: [External Email] Re: About /mnt/hdfs/current/BP directories
Jack Wells
Re: [External Email] Re: About /mnt/hdfs/current/BP directories
Nebi Aydin
RE: Spark 3.4.1 and Hive 3.1.3
Agrawal, Sanket
Re: Spark 3.4.1 and Hive 3.1.3
Yeachan Park
Re: Spark 3.4.1 and Hive 3.1.3
Chao Sun
RE: Spark 3.4.1 and Hive 3.1.3
Agrawal, Sanket
Re: Spark 3.4.1 and Hive 3.1.3
Nagatomi Yasukazu
RE: Spark 3.4.1 and Hive 3.1.3
Agrawal, Sanket
RE: Spark 3.4.1 and Hive 3.1.3
Agrawal, Sanket
how can i use spark with yarn cluster in java
BCMS
Re: how can i use spark with yarn cluster in java
Mich Talebzadeh
Change default timestamp offset on data load
Jack Goodson
Re: Change default timestamp offset on data load
Mich Talebzadeh
Re: Change default timestamp offset on data load
Jack Goodson
Re: Change default timestamp offset on data load
Mich Talebzadeh
Re: Change default timestamp offset on data load
Jack Goodson
Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community
Varun Shah
Re: Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community
Mich Talebzadeh
Re: Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community
ashok34...@yahoo.com.INVALID
Re: Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community
Mich Talebzadeh
pyspark.ml.recommendation is using the wrong python version
Harry Jamison
Re: pyspark.ml.recommendation is using the wrong python version
Harry Jamison
Re: pyspark.ml.recommendation is using the wrong python version
Mich Talebzadeh
Running Spark Connect Server in Cluster Mode on Kubernetes
Nagatomi Yasukazu
Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Cleyson Barros
Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Nagatomi Yasukazu
Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Mich Talebzadeh
Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Nagatomi Yasukazu
Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Nagatomi Yasukazu
Re: Re: Running Spark Connect Server in Cluster Mode on Kubernetes
eab...@163.com
Re: Re: Running Spark Connect Server in Cluster Mode on Kubernetes
eab...@163.com
Earlier messages
Later messages