Messages by Date
-
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
-
2022/06/17
Re: how to properly filter a dataset by dates ?
Stelios Philippou
-
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
-
2022/06/17
Re: how to properly filter a dataset by dates ?
Stelios Philippou
-
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
-
2022/06/17
Re: how to properly filter a dataset by dates ?
Sean Owen
-
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
-
2022/06/17
Re: how to properly filter a dataset by dates ?
Sean Owen
-
2022/06/17
how to properly filter a dataset by dates ?
marc nicole
-
2022/06/16
How to update TaskMetrics from Python?
Shay Elbaz
-
2022/06/15
Spark Structured streaming(batch mode) - running dependent jobs concurrently
karan alang
-
2022/06/15
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
-
2022/06/14
Re: Stickers and Swag
Qian Sun
-
2022/06/14
Re: Stickers and Swag
Reynold Xin
-
2022/06/14
Re: Stickers and Swag
Gengliang Wang
-
2022/06/14
Re: Stickers and Swag
Hyukjin Kwon
-
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
-
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
Sean Owen
-
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
-
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
-
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
Sean Owen
-
2022/06/14
How to recognize and get the min of a date/string column in Java?
marc nicole
-
2022/06/13
Stickers and Swag
Xiao Li
-
2022/06/13
Re: API Problem
Enrico Minack
-
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Sid
-
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Gourav Sengupta
-
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Sid
-
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Gourav Sengupta
-
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Sid
-
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Gourav Sengupta
-
2022/06/13
Redesign approach for hitting the APIs using PySpark
Sid
-
2022/06/11
Re: API Problem
Sid
-
2022/06/10
Re: API Problem
Enrico Minack
-
2022/06/10
Re: API Problem
Sid
-
2022/06/10
Re: API Problem
Enrico Minack
-
2022/06/10
Re:
Aironman DirtDiver
-
2022/06/10
Re: API Problem
Enrico Minack
-
2022/06/10
Re: API Problem
Sid
-
2022/06/10
[no subject]
Rodrigo
-
2022/06/10
Re: API Problem
Stelios Philippou
-
2022/06/10
Re: API Problem
Sid
-
2022/06/09
Re: API Problem
Sean Owen
-
2022/06/09
Spark streaming / confluent Kafka- messages are empty
KhajaAsmath Mohammed
-
2022/06/09
Re: API Problem
Stelios Philippou
-
2022/06/09
API Problem
Sid
-
2022/06/09
Re: to find Difference of locations in Spark Dataframe rows
Bjørn Jørgensen
-
2022/06/09
Re: Retrieve the count of spark nodes
Poorna Murali
-
2022/06/08
Re: Retrieve the count of spark nodes
Stephen Coy
-
2022/06/08
Retrieve the count of spark nodes
Poorna Murali
-
2022/06/07
to find Difference of locations in Spark Dataframe rows
Chetan Khatri
-
2022/06/07
Re: How the data is distributed
Sid
-
2022/06/06
Re: How the data is distributed
Sean Owen
-
2022/06/06
Re: How the data is distributed
Peyman Mohajerian
-
2022/06/06
How the data is distributed
Sid
-
2022/06/06
Structured streaming with protobuf proto3 schema registry
Kiran Biswal
-
2022/06/06
Re: How to convert a Dataset<Row> to a Dataset<String>?
Stelios Philippou
-
2022/06/06
Re: How to convert a Dataset<Row> to a Dataset<String>?
Christophe Préaud
-
2022/06/04
Re: partitionBy creating lot of small files
Enrico Minack
-
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
-
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
Enrico Minack
-
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
-
2022/06/04
partitionBy creating lot of small files
Nikhil Goyal
-
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
Enrico Minack
-
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
-
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
Sean Owen
-
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
-
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
Sean Owen
-
2022/06/04
How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
-
2022/06/03
Re: PartitionBy and SortWithinPartitions
Nikhil Goyal
-
2022/06/03
Re: PartitionBy and SortWithinPartitions
Enrico Minack
-
2022/06/03
PartitionBy and SortWithinPartitions
Nikhil Goyal
-
2022/06/02
approx_count_distinct in spark always return 1
marc nicole
-
2022/06/02
Does adaptive auto broadcast respect spark.sql.autoBroadcastJoinThreshold
Henry Quan
-
2022/06/01
What's the expected Spark 3.1.4 release date ?
Sandeep Vinayak
-
2022/05/31
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Gourav Sengupta
-
2022/05/31
Kotlin API for Apache Spark feedback
finkel
-
2022/05/31
Unsubscribe
Daan Stroep
-
2022/05/30
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Ranadip Chatterjee
-
2022/05/30
Re: protobuf data as input to spark streaming
Kiran Biswal
-
2022/05/30
Re: Unable to format timestamp values in pyspark
Sid
-
2022/05/30
Re: Unable to format timestamp values in pyspark
Stelios Philippou
-
2022/05/30
Unable to format timestamp values in pyspark
Sid
-
2022/05/30
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Ori Popowski
-
2022/05/29
Re: Unable to convert double values
marc nicole
-
2022/05/29
Re: Unable to convert double values
marc nicole
-
2022/05/29
Re: Unable to convert double values
Stelios Philippou
-
2022/05/29
Unable to convert double values
Sid
-
2022/05/28
k-anonymity with Spark in Java
marc nicole
-
2022/05/28
Re: Spark Push-Based Shuffle causing multiple stage failures
Ye Zhou
-
2022/05/27
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Aniket Mokashi
-
2022/05/26
Re: Issues getting Apache Spark
Apostolos N. Papadopoulos
-
2022/05/26
Issues getting Apache Spark
Martin, Michael
-
2022/05/26
Re: Complexity with the data
Sid
-
2022/05/26
Re: Complexity with the data
Gourav Sengupta
-
2022/05/26
java.lang.NoSuchMethodError: org.apache.hadoop.hive.common.FileUtils.mkdir --> Spark to Hive
Prasanth M Sasidharan
-
2022/05/26
Re: Complexity with the data
Bjørn Jørgensen
-
2022/05/26
Re: Complexity with the data
Sid
-
2022/05/26
Fwd: java.lang.NoSuchMethodError: org.apache.hadoop.hive.common.FileUtils.mkdir --> Spark to Hive
Prasanth M Sasidharan
-
2022/05/26
Re: Complexity with the data
Bjørn Jørgensen
-
2022/05/26
Re: Complexity with the data
Sid
-
2022/05/26
Re: Complexity with the data
Apostolos N. Papadopoulos
-
2022/05/26
Re: Complexity with the data
Sid
-
2022/05/26
Re: Complexity with the data
Bjørn Jørgensen
-
2022/05/26
Re: Complexity with the data
Sid
-
2022/05/25
Re: Spark Push-Based Shuffle causing multiple stage failures
Han Altae-Tran
-
2022/05/25
Re: Complexity with the data
Bjørn Jørgensen
-
2022/05/25
Re: Complexity with the data
Sid
-
2022/05/25
Re: Complexity with the data
Sid
-
2022/05/25
Re: Complexity with the data
Gavin Ray
-
2022/05/25
Re: Complexity with the data
Sid
-
2022/05/25
Re: Complexity with the data
Apostolos N. Papadopoulos
-
2022/05/25
Complexity with the data
Sid
-
2022/05/25
[SPARK SQL] Spark Thrift server, It is not releasing memory.
Ramakrishna Chilaka
-
2022/05/25
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Ranadip Chatterjee
-
2022/05/24
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Ori Popowski
-
2022/05/24
Re: Spark Push-Based Shuffle causing multiple stage failures
Ye Zhou
-
2022/05/24
Re: Spark Push-Based Shuffle causing multiple stage failures
Mridul Muralidharan
-
2022/05/24
GCP Dataproc - adding multiple packages(kafka, mongodb) while submitting spark jobs not working
karan alang
-
2022/05/24
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Ranadip Chatterjee
-
2022/05/24
Re: Problem with implementing the Datasource V2 API for Salesforce
Gourav Sengupta
-
2022/05/23
Re: how to add a column for percent
Raghavendra Ganesh
-
2022/05/23
Spark Push-Based Shuffle causing multiple stage failures
Han Altae-Tran
-
2022/05/22
how to add a column for percent
wilson
-
2022/05/21
Problem with implementing the Datasource V2 API for Salesforce
Rohit Pant
-
2022/05/19
Re: [Spark SQL]: Does Spark SQL support WAITFOR?
Someshwar Kale
-
2022/05/19
Re: [Spark SQL]: Does Spark SQL support WAITFOR?
Artemis User
-
2022/05/19
Re: [Spark SQL]: Does Spark SQL support WAITFOR?
K. N. Ramachandran
-
2022/05/19
Final reminder: ApacheCon North America call for presentations closing soon
Rich Bowen
-
2022/05/18
Re: A scene with unstable Spark performance
Chang Chen
-
2022/05/18
Re: [Spark SQL]: Configuring/Using Spark + Catalyst optimally for read-heavy transactional workloads in JDBC sources?
Gavin Ray
-
2022/05/18
[SQL] Why does a small two-source JDBC query take ~150-200ms with all optimizations (AQE, CBO, pushdown, Kryo, unsafe) enabled? (v3.4.0-SNAPSHOT)
Gavin Ray
-
2022/05/18
Spark 3 migration question
Jason Xu
-
2022/05/18
Re: What does Apache Spark do?
Pasha Finkelshtein
-
2022/05/18
What does Apache Spark do?
Turritopsis Dohrnii Teo En Ming
-
2022/05/18
Stopping streaming after the write commit and before the read commit?
kineret M
-
2022/05/17
Re: A scene with unstable Spark performance
Sungwoo Park
-
2022/05/17
Re: A scene with unstable Spark performance
Bowen Song
-
2022/05/17
Re: A scene with unstable Spark performance
Qian SUN
-
2022/05/17
Re: [Spark SQL]: Does Spark SQL support WAITFOR?
Sean Owen
-
2022/05/17
Re: [Spark SQL]: Does Spark SQL support WAITFOR?
K. N. Ramachandran
-
2022/05/17
Re: Reverse proxy for Spark UI on Kubernetes
bo yang
-
2022/05/17
Re: Reverse proxy for Spark UI on Kubernetes
Holden Karau
-
2022/05/17
Re: Reverse proxy for Spark UI on Kubernetes
bo yang
-
2022/05/17
Re: Reverse proxy for Spark UI on Kubernetes
bo yang
-
2022/05/17
A scene with unstable Spark performance
Bowen Song
-
2022/05/16
Re: Reverse proxy for Spark UI on Kubernetes
wilson
-
2022/05/16
Re: Reverse proxy for Spark UI on Kubernetes
Holden Karau
-
2022/05/16
Reverse proxy for Spark UI on Kubernetes
bo yang
-
2022/05/16
[Spark SQL]: Configuring/Using Spark + Catalyst optimally for read-heavy transactional workloads in JDBC sources?
Gavin Ray
-
2022/05/15
[Spark SQL]: Does Spark SQL support WAITFOR?
K. N. Ramachandran
-
2022/05/15
RE: [EXTERNAL] Re: Spark on K8s - repeating annoying exception
Shay Elbaz
-
2022/05/13
Re: Spark on K8s - repeating annoying exception
Martin Grigorov
-
2022/05/09
Structured streaming help on releasing memory
Xavi Gervilla
-
2022/05/09
Spark on K8s - repeating annoying exception
Shay Elbaz
-
2022/05/09
Re: How do I read parquet with python object
Sean Owen
-
2022/05/09
How do I read parquet with python object
ben
-
2022/05/08
Need help on migrating Spark on Hortonworks to Kubernetes Cluster
Chetan Khatri
-
2022/05/07
Re: Count() action leading to errors | Pyspark
Bjørn Jørgensen
-
2022/05/06
Count() action leading to errors | Pyspark
Sid
-
2022/05/05
Re: groupby question
wilson
-
2022/05/05
Re: [EXTERNAL] Parse Execution Plan from PySpark
Pablo Alcain
-
2022/05/05
groupby question
Irene Markelic
-
2022/05/05
Re: Something about Spark which has bothered me for a very long time, which I've never understood
Lalwani, Jayesh
-
2022/05/05
Something about Spark which has bothered me for a very long time, which I've never understood
Denarian Kislata
-
2022/05/05
Kafka Spark Structure Streaming Error
nayan sharma
-
2022/05/05
Re: Disable/Remove datasources in Spark
wilson
-
2022/05/05
Re: Disable/Remove datasources in Spark
wilson
-
2022/05/05
Re: Disable/Remove datasources in Spark
Aditya
-
2022/05/05
Re: Disable/Remove datasources in Spark
wilson
-
2022/05/04
Disable/Remove datasources in Spark
Aditya
-
2022/05/04
Re: structured streaming- checkpoint metadata growing indefinetely
Wojciech Indyk
-
2022/05/04
Re: Spark error with jupyter
Gourav Sengupta
-
2022/05/03
trouble using spark in kubernetes
Andreas Klos
-
2022/05/03
Re: Spark error with jupyter
Bjørn Jørgensen
-
2022/05/03
REMINDER - Travel Assistance available for ApacheCon NA New Orleans 2022
Gavin McDonald
-
2022/05/03
Re: [EXTERNAL] Parse Execution Plan from PySpark
Walaa Eldin Moustafa
-
2022/05/03
RE: [EXTERNAL] Parse Execution Plan from PySpark
Shay Elbaz
-
2022/05/02
Parse Execution Plan from PySpark
Pablo Alcain
-
2022/05/02
unsubscribe
Ahmed Kamal Abdelfatah
-
2022/05/02
unsubscribe
Ray Qiu
-
2022/05/02
Re: Vulnerabilities in htrace-core4-4.1.0-incubating.jar jar used in spark.
Artemis User
-
2022/05/02
Unsubscribe
Sahil Bali
-
2022/05/02
Re: how spark handle the abnormal values
wilson
-
2022/05/02
Re: how spark handle the abnormal values
Mich Talebzadeh
-
2022/05/01
Re: Vulnerabilities in htrace-core4-4.1.0-incubating.jar jar used in spark.
HARSH TAKKAR
-
2022/05/01
Re: how spark handle the abnormal values
Artemis User
-
2022/05/01
Re: spark null values calculation
wilson
-
2022/05/01
Re: how spark handle the abnormal values
wilson
-
2022/05/01
Idea for improving performance when reading from hive-like partition folders and specifying a filter [Spark 3.2]
Martin
-
2022/05/01
how spark handle the abnormal values
wilson
-
2022/04/30
spark null values calculation
wilson
-
2022/04/30
Re: structured streaming- checkpoint metadata growing indefinetely
Wojciech Indyk
-
2022/04/29
Re: structured streaming- checkpoint metadata growing indefinetely
Gourav Sengupta
-
2022/04/29
Re: structured streaming- checkpoint metadata growing indefinetely
Wojciech Indyk
-
2022/04/28
structured streaming- checkpoint metadata growing indefinetely
Wojciech Indyk
-
2022/04/28
Unsubscribe
Sahil Bali
-
2022/04/28
Re: Unsubscribe
wilson
-
2022/04/28
Unsubscribe
Ajay Thompson
-
2022/04/28
Re: Reg: CVE-2020-9480
Sean Owen
-
2022/04/28
Reg: CVE-2020-9480
Sundar Sabapathi Meenakshi