user

Messages by Thread

Re: Python for the kids and now PySpark Farshid Ashouri
- Re: Python for the kids and now PySpark Meena Rajani
[Release Question]: Estimate on 3.5.2 release? Paul Gerver
[SparkListener] Accessing classes loaded via the '--packages' option Damien Hawes
DataFrameReader: timestampFormat default value keen
[spark-graphframes]: Generating incorrect edges Nijland, J.G.W. (Jelle, Student M-CS)
- Re: [spark-graphframes]: Generating incorrect edges Mich Talebzadeh
- Re: [spark-graphframes]: Generating incorrect edges Nijland, J.G.W. (Jelle, Student M-CS)
- Re: [spark-graphframes]: Generating incorrect edges Mich Talebzadeh
- Re: [spark-graphframes]: Generating incorrect edges Nijland, J.G.W. (Jelle, Student M-CS)
How to add MaxDOP option in spark mssql JDBC Elite
- RE: How to add MaxDOP option in spark mssql JDBC Appel, Kevin
- Re:RE: How to add MaxDOP option in spark mssql JDBC Elite
How to use Structured Streaming in Spark SQL ????
How to access the internal hidden columns of table by spark jdbc casel.chen
Accounting the impact of failures in spark jobs Faiz Halde
StreamingQueryListener integration with Spark native metric sink (JmxSink) Mason Chen
[ANNOUNCE] Apache Spark 3.4.3 released Dongjoon Hyun
[Spark SQL][How-To] Remove builtin function support from Spark Matthew McMillian
- [Spark SQL][How-To] Remove builtin function support from Spark Matthew McMillian
should OutputCommitCoordinator fail stages for authorized committer failures when using s3a optimized committers? Dylan McClelland
[Spark SQL] xxhash64 default seed of 42 confusion Igor Calabria
auto create event log directory if not exist second_co...@yahoo.com.INVALID
Spark streaming job for kafka transaction does not consume read_committed messages correctly. Kidong Lee
- Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly. Mich Talebzadeh
- Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly. Kidong Lee
- Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly. Kidong Lee
- Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly. Mich Talebzadeh
- Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly. Kidong Lee
Spark column headings, camelCase or snake case? Mich Talebzadeh
[Spark SQL]: Source code for PartitionedFile Ashley McManamon
- Re: [Spark SQL]: Source code for PartitionedFile Mich Talebzadeh
- Re: [Spark SQL]: Source code for PartitionedFile Ashley McManamon
How to get db related metrics when use spark jdbc to read db table? casel.chen
- Re: How to get db related metrics when use spark jdbc to read db table? Mich Talebzadeh
- Re: How to get db related metrics when use spark jdbc to read db table? Femi Anthony
Spark UDAF in examples fail with not serializable error Owen Bell
Idiomatic way to rate-limit streaming sources to avoid OutOfMemoryError? Baran, Mert
- Re: Idiomatic way to rate-limit streaming sources to avoid OutOfMemoryError? Mich Talebzadeh
Example UDAF fails with "not serializable" exception Owen Bell
External Spark shuffle service for k8s Mich Talebzadeh
- Re: External Spark shuffle service for k8s Bjørn Jørgensen
- Re: External Spark shuffle service for k8s Mich Talebzadeh
- Re: External Spark shuffle service for k8s Vakaris Baškirov
- Re: External Spark shuffle service for k8s Mich Talebzadeh
- Re: External Spark shuffle service for k8s roryqi
- Re: External Spark shuffle service for k8s Vakaris Baškirov
- Re: External Spark shuffle service for k8s Mich Talebzadeh
- Re: External Spark shuffle service for k8s Arun Ravi
- Re: External Spark shuffle service for k8s Bjørn Jørgensen
- Re: External Spark shuffle service for k8s Bjørn Jørgensen
- Re: External Spark shuffle service for k8s Cheng Pan
- Re: External Spark shuffle service for k8s Mich Talebzadeh
- Re: External Spark shuffle service for k8s Enrico Minack
Clarification on what "[id=#]" refers to in Physical Plan Exchange hashpartitioning Tahj Anderson
- Clarification on what "[id=#]" refers to in Physical Plan Exchange hashpartitioning Tahj Anderson
Participate in the ASF 25th Anniversary Campaign Brian Proffitt
[Spark]: Spark / Iceberg / hadoop-aws compatibility matrix Oxlade, Dan
- Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix Aaron Grubb
- Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix Oxlade, Dan
- Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix Oxlade, Dan
- Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix Oxlade, Dan
[Spark SQL] How can I use .sql() in conjunction with watermarks? Chloe He
- Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? Mich Talebzadeh
- RE: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? Chloe He
- Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? Mich Talebzadeh
- Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? 刘唯
- Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? 刘唯
- Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? Mich Talebzadeh
Apache Spark integration with Spring Boot 3.0.0+ Szymon Kasperkiewicz
Community Over Code NA 2024 Travel Assistance Applications now open! Gavin McDonald
[DISCUSS] MySQL version support policy Cheng Pan
- Re: [DISCUSS] MySQL version support policy Dongjoon Hyun
Is one Spark partition mapped to one and only Spark Task ? Sreyan Chakravarty
Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering Mich Talebzadeh
- Re: Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering Mich Talebzadeh
Bug in org.apache.spark.util.sketch.BloomFilter Nathan Conroy
[no subject] Рамик И
- Re: Mich Talebzadeh
Announcing the Community Over Code 2024 Streaming Track James Hughes
[ANNOUNCE] Apache Kyuubi released 1.9.0 Binjie Yang
pyspark - Use Spark to generate a large dataset on the fly Sreyan Chakravarty
- pyspark - Use Spark to generate a large dataset on the fly Sreyan Chakravarty
A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Mich Talebzadeh
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community ashok34...@yahoo.com.INVALID
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Parsian, Mahmoud
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Mich Talebzadeh
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Hyukjin Kwon
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Code Tutelage
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Deepak Sharma
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Bjørn Jørgensen
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Mich Talebzadeh
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Reynold Xin
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Mich Talebzadeh
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Joris Billen
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Mich Talebzadeh
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Varun Shah
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Farshid Ashouri
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Kiran Kumar Dusi
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Jay Han
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Winston Lai
[GraphX]: Prevent recomputation of DAG Marek Berith
- Re: [GraphX]: Prevent recomputation of DAG Mich Talebzadeh
Python library that generates fake data using Faker Mich Talebzadeh
Requesting further assistance with Spark Scala code coverage 里昂
pyspark - Where are Dataframes created from Python objects stored? Sreyan Chakravarty
- Re: pyspark - Where are Dataframes created from Python objects stored? Mich Talebzadeh
- Re: pyspark - Where are Dataframes created from Python objects stored? Sreyan Chakravarty
- Re: pyspark - Where are Dataframes created from Python objects stored? Mich Talebzadeh
- Re: pyspark - Where are Dataframes created from Python objects stored? Sreyan Chakravarty
- Re: pyspark - Where are Dataframes created from Python objects stored? Varun Shah
Data ingestion into elastic failing using pyspark Karthick Nk
Bug in How to Monitor Streaming Queries in PySpark Mich Talebzadeh
- Re: Bug in How to Monitor Streaming Queries in PySpark 刘唯
- Re: Bug in How to Monitor Streaming Queries in PySpark 刘唯
- Re: Bug in How to Monitor Streaming Queries in PySpark Mich Talebzadeh
- Re: Bug in How to Monitor Streaming Queries in PySpark 刘唯
- Re: Bug in How to Monitor Streaming Queries in PySpark Mich Talebzadeh
Spark on Kubenets, execute dataset.show raise exceptions BODY NO
Spark-UI stages and other tabs not accessible in standalone mode when reverse-proxy is enabled sharad mishra
- Spark-UI stages and other tabs not accessible in standalone mode when reverse-proxy is enabled sharad mishra
Creating remote tables using PySpark Tom Barber
- Re: Creating remote tables using PySpark Tom Barber
- Re: Creating remote tables using PySpark Tom Barber
- Re: Creating remote tables using PySpark Mich Talebzadeh
Dark mode logo Mike Drob
S3 committer for dynamic partitioning Nikhil Goyal
It seems --py-files only takes the first two arguments. Can someone please confirm? Pedro, Chuck
- Re: It seems --py-files only takes the first two arguments. Can someone please confirm? Mich Talebzadeh
- Re: It seems --py-files only takes the first two arguments. Can someone please confirm? Mich Talebzadeh
Working with a text file that is both compressed by bz2 followed by zip in PySpark Mich Talebzadeh
pyspark dataframe join with two different data type Karthick Nk
- Re: pyspark dataframe join with two different data type Mich Talebzadeh
[ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- Re:[ANNOUNCE] Apache Spark 3.5.1 released beliefer
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Dongjoon Hyun
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Xinrong Meng
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Prem Sahoo
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Peter Toth
- Re: [ANNOUNCE] Apache Spark 3.5.1 released John Zhuge
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Dongjoon Hyun
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Hyukjin Kwon
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- Re: [ANNOUNCE] Apache Spark 3.5.1 released yangjie01
- 答复: [ANNOUNCE] Apache Spark 3.5.1 released Pan,Bingkun
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- 答复: [ANNOUNCE] Apache Spark 3.5.1 released Pan,Bingkun
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- 答复: [ANNOUNCE] Apache Spark 3.5.1 released Pan,Bingkun
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- 答复: [ANNOUNCE] Apache Spark 3.5.1 released Pan,Bingkun
[Spark Core] Potential bug in JavaRDD#countByValue Stuart Fehr
- Re: [Spark Core] Potential bug in JavaRDD#countByValue Mich Talebzadeh
Bugs with joins and SQL in Structured Streaming Andrzej Zera
- Re: Bugs with joins and SQL in Structured Streaming Mich Talebzadeh
- Re: Bugs with joins and SQL in Structured Streaming Andrzej Zera
- Re: Bugs with joins and SQL in Structured Streaming Andrzej Zera
Re: Bintray replacement for spark-packages.org Richard Eggert
Issue of spark with antlr version Chawla, Parul
- RE: Issue of spark with antlr version Sahni, Ashima
- Re: Issue of spark with antlr version Mich Talebzadeh
- Re: Issue of spark with antlr version Bjørn Jørgensen
- Re: [External] Re: Issue of spark with antlr version Chawla, Parul
- Re: [External] Re: Issue of spark with antlr version Bjørn Jørgensen
- Re: [External] Re: Issue of spark with antlr version Chawla, Parul
- Re: [External] Re: Issue of spark with antlr version Bjørn Jørgensen
Re: AQE coalesce 60G shuffle data into a single partition Enrico Minack
[Beginner Debug]: Executor OutOfMemoryError Shawn Ligocki
- Re: [Beginner Debug]: Executor OutOfMemoryError Mich Talebzadeh
Kafka-based Spark Streaming and Vertex AI for Sentiment Analysis Mich Talebzadeh
[ANNOUNCE] Apache Kyuubi 1.8.1 is available Cheng Pan
Re: Spark 3.3 Query Analyzer Bug Report Sharma, Anup
Spark 4.0 Query Analyzer Bug Report Sharma, Anup
- Re: Spark 4.0 Query Analyzer Bug Report Holden Karau
- Re: Spark 4.0 Query Analyzer Bug Report Mich Talebzadeh
Community Over Code Asia 2024 Travel Assistance Applications now open! Gavin McDonald
[Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Sri Potluri
- Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Mich Talebzadeh
- Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Mich Talebzadeh
- Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Sri Potluri
- Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Mich Talebzadeh
- Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Cheng Pan
Regarding Spark on Kubernetes(EKS) Jagannath Majhi
- Re: Regarding Spark on Kubernetes(EKS) Richard Smith
- Re: Regarding Spark on Kubernetes(EKS) Mich Talebzadeh
- Re: Regarding Spark on Kubernetes(EKS) Jagannath Majhi
- Re: Regarding Spark on Kubernetes(EKS) Mich Talebzadeh
- Re: Regarding Spark on Kubernetes(EKS) Mich Talebzadeh
- Re: Regarding Spark on Kubernetes(EKS) Jagannath Majhi
Re: Re-create SparkContext of SparkSession inside long-lived Spark app Adam Binford
- Re: Re-create SparkContext of SparkSession inside long-lived Spark app Jörn Franke
- Re: Re-create SparkContext of SparkSession inside long-lived Spark app Mich Talebzadeh
- Re: Re-create SparkContext of SparkSession inside long-lived Spark app Saha, Daniel
- Re: Re-create SparkContext of SparkSession inside long-lived Spark app Mich Talebzadeh
Re: job uuid not unique Mich Talebzadeh
- Re: job uuid not unique Xin Zhang
Effectively append the dataset to avro directory Rushikesh Kavar
Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Chao Sun

Earlier messages