user
Thread
Date
Earlier messages
Messages by Thread
SSH Tunneling issue with Apache Spark
Venkatesan Muniappan
Re: SSH Tunneling issue with Apache Spark
Venkatesan Muniappan
Re: SSH Tunneling issue with Apache Spark
Nicholas Chammas
Re: SSH Tunneling issue with Apache Spark
Venkatesan Muniappan
ordering of rows in dataframe
Som Lima
Re: ordering of rows in dataframe
Enrico Minack
ML advice
Zahid Rahman
Do we have any mechanism to control requests per second for a Kafka connect sink?
Yeikel Santana
Re: Do we have any mechanism to control requests per second for a Kafka connect sink?
Yeikel Santana
Spark-Connect: Param `--packages` does not take effect for executors.
Xiaolong Wang
Re: Spark-Connect: Param `--packages` does not take effect for executors.
Aironman DirtDiver
Re: Spark-Connect: Param `--packages` does not take effect for executors.
Holden Karau
[PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation?
Михаил Кулаков
Re: [PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation?
Enrico Minack
Re: [PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation?
Enrico Minack
ML using Spark Connect
Faiz Halde
[FYI] SPARK-45981: Improve Python language test coverage
Dongjoon Hyun
Re: [FYI] SPARK-45981: Improve Python language test coverage
Hyukjin Kwon
[Streaming (DStream) ] : Does Spark Streaming supports pause/resume consumption of message from Kafka?
Saurabh Agrawal (180813)
Re: [Streaming (DStream) ] : Does Spark Streaming supports pause/resume consumption of message from Kafka?
Mich Talebzadeh
[ANNOUNCE] Apache Spark 3.4.2 released
Dongjoon Hyun
Re:[ANNOUNCE] Apache Spark 3.4.2 released
beliefer
[sql] how to connect query stage to Spark job/stages?
Chenghao Lyu
Tuning Best Practices
Bryant Wright
Re: Tuning Best Practices
Jack Goodson
Re: Tuning Best Practices
Bryant Wright
Classpath isolation per SparkSession without Spark Connect
Faiz Halde
Re: Classpath isolation per SparkSession without Spark Connect
Holden Karau
Re: Classpath isolation per SparkSession without Spark Connect
Faiz Halde
Re: Classpath isolation per SparkSession without Spark Connect
Pasha Finkelshtein
Re: Classpath isolation per SparkSession without Spark Connect
Faiz Halde
Re: Classpath isolation per SparkSession without Spark Connect
Pasha Finkelshtein
Re: Spark structured streaming tab is missing from spark web UI
Jungtaek Lim
[Spark-sql 3.2.4] Wrong Statistic INFO From 'ANALYZE TABLE' Command
Nick Luo
Query fails on CASE statement depending on order of summed columns
Evgenii Ignatev
How exactly does dropDuplicatesWithinWatermark work?
Perfect Stranger
Re: How exactly does dropDuplicatesWithinWatermark work?
Jungtaek Lim
Setting fs.s3a.aws.credentials.provider through a connect server.
Leandro Martelli
Spark-submit without access to HDFS
Eugene Miretsky
Re: Spark-submit without access to HDFS
eab...@163.com
Re: [EXTERNAL] Re: Spark-submit without access to HDFS
Eugene Miretsky
Re: Re: [EXTERNAL] Re: Spark-submit without access to HDFS
eab...@163.com
Re: Spark-submit without access to HDFS
Jörn Franke
Re: Spark-submit without access to HDFS
Mich Talebzadeh
[Spark Structured Streaming] Two sink from Single stream
Subash Prabanantham
The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Hanyu Huang
The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Hanyu Huang
RE: The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Stevens, Clay
The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Hanyu Huang
Why create/drop/alter/rename partition does not post listener event in ExternalCatalogWithListener?
李响
Pass xmx values to SparkLauncher launched Java process
Deepthi Sathia Raj
How grouping rows without shuffle
Yoel Benharrous
help needed with SPARK-45598 and SPARK-45769
Maksym M
Storage Partition Joins only works for buckets?
Arwin Tio
org.apache.ranger.authorization.hive.authorizer.RangerHiveAuthorizerFactory ClassNotFoundException
Yi Zheng
[ANNOUNCE] Apache Kyuubi released 1.8.0
Cheng Pan
Spark master shuts down when one of zookeeper dies
Kaustubh Ghode
Re: Spark master shuts down when one of zookeeper dies
Mich Talebzadeh
How to configure authentication from a pySpark client to a Spark Connect server ?
Xiaolong Wang
[Spark SQL] [Bug] Adding `checkpoint()` causes "column [...] cannot be resolved" error
Robin Zimmerman
Parser error when running PySpark on Windows connecting to GCS
Richard Smith
Re: Parser error when running PySpark on Windows connecting to GCS
Mich Talebzadeh
Data analysis issues
Jauru Lin
Re: Data analysis issues
Mich Talebzadeh
Spark / Scala conflict
Harry Jamison
Re: Spark / Scala conflict
Aironman DirtDiver
Re: Spark / Scala conflict
Harry Jamison
Fixed byte array issue
KhajaAsmath Mohammed
jackson-databind version mismatch
moshik.vitas
Re: jackson-databind version mismatch
eab...@163.com
Re: jackson-databind version mismatch
Bjørn Jørgensen
Re: jackson-databind version mismatch
Bjørn Jørgensen
Re: Re: jackson-databind version mismatch
eab...@163.com
RE: jackson-databind version mismatch
moshik.vitas
Elasticity and scalability for Spark in Kubernetes
Mich Talebzadeh
[Structured Streaming] Joins after aggregation don't work in streaming
Andrzej Zera
Re: [Structured Streaming] Joins after aggregation don't work in streaming
Jungtaek Lim
Re: [Structured Streaming] Joins after aggregation don't work in streaming
Andrzej Zera
spark schema conflict behavior records being silently dropped
Carlos Aguni
submitting tasks failed in Spark standalone mode due to missing failureaccess jar file
eab...@163.com
Contribution Recommendations
Phil Dakin
Maximum executors in EC2 Machine
KhajaAsmath Mohammed
Re: Maximum executors in EC2 Machine
Riccardo Ferrari
automatically/dinamically renew aws temporary token
Carlos Aguni
Re: automatically/dinamically renew aws temporary token
Jörn Franke
Re: automatically/dinamically renew aws temporary token
Pol Santamaria
Re: automatically/dinamically renew aws temporary token
Carlos Aguni
Spark join produce duplicate rows in resultset
Meena Rajani
Re: Spark join produce duplicate rows in resultset
Patrick Tucci
Re: Spark join produce duplicate rows in resultset
Sadha Chilukoori
Re: Spark join produce duplicate rows in resultset
Bjørn Jørgensen
Re: Spark join produce duplicate rows in resultset
Meena Rajani
Error when trying to get the data from Hive Materialized View
Siva Sankar Reddy
spark.stop() cannot stop spark connect session
eab...@163.com
[Resolved] Re: spark.stop() cannot stop spark connect session
eab...@163.com
"Premature end of Content-Length" Error
Sandhya Bala
hive: spark as execution engine. class not found problem
Amirhossein Kabiri
Re: hive: spark as execution engine. class not found problem
Vijay Shankar
[ANNOUNCE] Apache Celeborn(incubating) 0.3.1 available
Cheng Pan
[ SPARK SQL ]: PPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column
Suyash Ajmera
Re: [ SPARK SQL ]: UPPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column
Suyash Ajmera
Re: [ SPARK SQL ]: UPPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column
Suyash Ajmera
Can not complete the read csv task
Kelum Perera
Fw: Can not complete the read csv task
Kelum Perera
Fwd: Fw: Can not complete the read csv task
KP Youtuber
Re: Can not complete the read csv task
Khalid Mammadov
Autoscaling in Spark
Kiran Biswal
Re: Autoscaling in Spark
Mich Talebzadeh
Log file location in Spark on K8s
Agrawal, Sanket
Re: Log file location in Spark on K8s
Prashant Sharma
Clarification with Spark Structured Streaming
ashok34...@yahoo.com.INVALID
Re: Clarification with Spark Structured Streaming
Mich Talebzadeh
Re: Clarification with Spark Structured Streaming
ashok34...@yahoo.com.INVALID
Re: Clarification with Spark Structured Streaming
Mich Talebzadeh
Re: Clarification with Spark Structured Streaming
Danilo Sousa
Spark Compatibility with Spring Boot 3.x
Ahmed Albalawi
Re: Spark Compatibility with Spring Boot 3.x
Sean Owen
Re: Spark Compatibility with Spring Boot 3.x
Angshuman Bhattacharya
RE: Re: Spark Compatibility with Spring Boot 3.x
Guru Panda
Connection pool shut down in Spark Iceberg Streaming Connector
Agrawal, Sanket
Re: Connection pool shut down in Spark Iceberg Streaming Connector
Prashant Sharma
Re: Connection pool shut down in Spark Iceberg Streaming Connector
Igor Calabria
[PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Raghavendra Ganesh
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Perez
[PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Mich Talebzadeh
[Spark Core]: Recomputation cost of a job due to executor failures
Faiz Halde
Updating delta file column data
Karthick Nk
Re: Updating delta file column data
Karthick Nk
Re: Updating delta file column data
Mich Talebzadeh
Re: Updating delta file column data
Mich Talebzadeh
using facebook Prophet + pyspark for forecasting - Dataframe has less than 2 non-NaN rows
karan alang
Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jon Rodríguez Aranguren
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jayabindu Singh
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Mich Talebzadeh
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jon Rodríguez Aranguren
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Thread dump only shows 10 shuffle clients
Nebi Aydin
Files io threads vs shuffle io threads
Nebi Aydin
Inquiry about Processing Speed
Haseeb Khalid
Re: Inquiry about Processing Speed
Deepak Goel
Re: Inquiry about Processing Speed
Jack Goodson
Reading Glue Catalog Views through Spark.
Agrawal, Sanket
[PySpark][Spark logs] Is it possible to dynamically customize Spark logs?
Ayman Rekik
[ANNOUNCE] Apache Kyuubi released 1.7.3
Zhen Wang
Spark Connect Multi-tenant Support
Kezhi Xiong
Parallel write to different partitions
Shrikant Prasad
Re: Parallel write to different partitions
Shrikant Prasad
Need to split incoming data into PM on time column and find the top 5 by volume of data
ashok34...@yahoo.com.INVALID
Re: Need to split incoming data into PM on time column and find the top 5 by volume of data
Mich Talebzadeh
PySpark 3.5.0 on PyPI
Kezhi Xiong
Re: PySpark 3.5.0 on PyPI
Sean Owen
Re: PySpark 3.5.0 on PyPI
Kezhi Xiong
[Spark 3.5.0] Is the protobuf-java JAR no longer shipped with Spark?
Gijs Hendriksen
Create an external table with DataFrameWriterV2
Christophe Préaud
Spark streaming sourceArchiveDir does not move file to archive directory
Yunus Emre G?rses
Discriptency sample standard deviation pyspark and Excel
Helene Bøe
Re: Discriptency sample standard deviation pyspark and Excel
Sean Owen
Re: Discriptency sample standard deviation pyspark and Excel
Mich Talebzadeh
Re: Discriptency sample standard deviation pyspark and Excel
Sean Owen
Re: Discriptency sample standard deviation pyspark and Excel
Bjørn Jørgensen
Re: Discriptency sample standard deviation pyspark and Excel
Mich Talebzadeh
Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem
Karthick
Re: Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem
Gowtham S
Re: Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem
Karthick
getting emails in different order!
Mich Talebzadeh
Re: getting emails in different order!
Sean Owen
Re: getting emails in different order!
Mich Talebzadeh
[ANNOUNCE] Apache Kyuubi released 1.7.2
Zhen Wang
About Peak Jvm Memory Onheap
Nebi Aydin
Fwd: First Time contribution.
ram manickam
Re: First Time contribution.
Denny Lee
Re: First Time contribution.
Haejoon Lee
[Spark Core]: How does rpc threads influence shuffle?
Nebi Aydin
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
Mich Talebzadeh
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
Mich Talebzadeh
Re: Filter out 20% of rows
Mich Talebzadeh
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
ashok34...@yahoo.com.INVALID
Spark stand-alone mode
Ilango
Re: Spark stand-alone mode
Patrick Tucci
Re: Spark stand-alone mode
Sean Owen
Re: Spark stand-alone mode
Mich Talebzadeh
Re: Spark stand-alone mode
Bjørn Jørgensen
Re: Spark stand-alone mode
Ilango
Re: Spark stand-alone mode
Patrick Tucci
Re: Spark stand-alone mode
Ilango
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Craig Alfieri
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Jerry Peng
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
russell . spitzer
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Craig Alfieri
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Jerry Peng
APACHE Spark adoption/growth chart
Andrew Petersen
Earlier messages