Messages by Date
-
2023/02/10
How to improve efficiency of this piece of code (returning distinct column values)
sam smith
-
2023/02/10
Re:
Sunil Prabhakara
-
2023/02/09
Fwd: [Spark SQL] : Delete is only supported on V2 tables.
Jeevan Chhajed
-
2023/02/09
Executor metrics are missing on prometheus sink
Qian Sun
-
2023/02/09
Jira Account for Contributions
Jack Goodson
-
2023/02/09
Unsubscribe
Patrik Medvedev
-
2023/02/08
Re: Unsubscribe
LinuxGuy
-
2023/02/08
[Spark SQL]: Spark 3.2 generates different results to query when columns name have mixed casing vs when they have same casing
Amit Singh Rathore
-
2023/02/08
Unsubscribe
fuwei901
-
2023/02/08
Is sparkSession.sql now an action in Spark 3 and later?
Sayeh Roshan
-
2023/02/08
Re: Graceful shutdown SPARK Structured Streaming
Brian Wylie
-
2023/02/08
Unsubscribe
fuwei901
-
2023/02/07
Unsubscribe
Tushar Machavolu
-
2023/02/07
Re: Spark with GPU
Alessandro Bellina
-
2023/02/07
Re: How to upgrade a spark structure streaming application
Mich Talebzadeh
-
2023/02/07
Fwd: Graceful shutdown SPARK Structured Streaming
Mich Talebzadeh
-
2023/02/07
[Spark SQL] : Delete is only supported on V2 tables.
Jeevan Chhajed
-
2023/02/07
How to upgrade a spark structure streaming application
Yoel Benharrous
-
2023/02/07
SQL GROUP BY alias with dots, was: Spark SQL question
Enrico Minack
-
2023/02/07
Unsubscribe
Spyros Gasteratos
-
2023/02/05
big data products
LinuxGuy
-
2023/02/05
Re: Spark with GPU
Jack Goodson
-
2023/02/05
Re: Spark with GPU
Mich Talebzadeh
-
2023/02/05
Spark with GPU
Irene Markelic
-
2023/02/02
Re: Create table before inserting in SQL
Harut Martirosyan
-
2023/02/02
Re: Create table before inserting in SQL
Mich Talebzadeh
-
2023/02/02
Re: Create table before inserting in SQL
Harut Martirosyan
-
2023/02/02
Re: Create table before inserting in SQL
Harut Martirosyan
-
2023/02/01
Re: Create table before inserting in SQL
Mich Talebzadeh
-
2023/02/01
Create table before inserting in SQL
Harut Martirosyan
-
2023/02/01
Spark Thrift Server issue with external HDFS table
Kalhara Gurugamage
-
2023/01/31
What is DataFilters and while joining why is the filter isnotnull[joinKey] applied twice
Nitin Siwach
-
2023/01/31
Fwd: [Spark Standalone Mode] How to read from kerberised HDFS in spark standalone mode
Wei Yan
-
2023/01/31
[Spark/deeplyR] how come spark is caching tables read through jdbc connection from oracle, even when memory=false is chosen
Joris Billen
-
2023/01/30
Re: Help needed regarding error with 5 node Spark cluster (shuffle error)- Comcast
Artemis User
-
2023/01/30
Re: Help needed regarding error with 5 node Spark cluster (shuffle error)- Comcast
Mich Talebzadeh
-
2023/01/30
Help needed regarding error with 5 node Spark cluster (shuffle error)- Comcast
Jain, Sanchi
-
2023/01/30
Re: Re: spark+kafka+dynamic resource allocation
Mich Talebzadeh
-
2023/01/29
Re: Re: spark+kafka+dynamic resource allocation
Lingzhe Sun
-
2023/01/29
Re: Re: spark+kafka+dynamic resource allocation
Mich Talebzadeh
-
2023/01/28
Re: Re: spark+kafka+dynamic resource allocation
Lingzhe Sun
-
2023/01/28
Fwd: Spark-submit doesn't load all app classes in the classpath
Soheil Pourbafrani
-
2023/01/28
Re: spark+kafka+dynamic resource allocation
ashok34...@yahoo.com.INVALID
-
2023/01/28
Re: Spark SQL question
Bjørn Jørgensen
-
2023/01/28
Re: Spark SQL question
Mich Talebzadeh
-
2023/01/27
spark+kafka+dynamic resource allocation
Lingzhe Sun
-
2023/01/27
Spark SQL question
Kohki Nishio
-
2023/01/27
Re: Question regarding Spark 3.X performance
Athanasios Kordelas
-
2023/01/27
Re: Question regarding Spark 3.X performance
Mich Talebzadeh
-
2023/01/26
Re: Question regarding Spark 3.X performance
Mich Talebzadeh
-
2023/01/26
Re: Question regarding Spark 3.X performance
Mich Talebzadeh
-
2023/01/26
Question regarding Spark 3.X performance
Athanasios Kordelas
-
2023/01/23
Re: Dynamic Scaling without Kubernetes
Mich Talebzadeh
-
2023/01/23
Re: Duplicates in Collaborative Filtering Output
Kartik Ohri
-
2023/01/23
Unsubscribe
Calum
-
2023/01/22
Duplicates in Collaborative Filtering Output
Kartik Ohri
-
2023/01/22
Re: Any advantages of using sql.adaptive.autoBroadcastJoinThreshold over sql.autoBroadcastJoinThreshold?
Balakrishnan Ayyappan
-
2023/01/22
Any advantages of using sql.adaptive.autoBroadcastJoinThreshold over sql.autoBroadcastJoinThreshold?
Soumyadeep Mukhopadhyay
-
2023/01/21
Re: Table created with saveAsTable behaves differently than a table created with spark.sql("CREATE TABLE....)
krexos
-
2023/01/21
Re: Table created with saveAsTable behaves differently than a table created with spark.sql("CREATE TABLE....)
Peyman Mohajerian
-
2023/01/21
Table created with saveAsTable behaves differently than a table created with spark.sql("CREATE TABLE....)
krexos
-
2023/01/20
unsubscribe
peng
-
2023/01/20
Writing protobuf RDD to parquet
David Diebold
-
2023/01/19
unsubscribe
김병찬
-
2023/01/19
[Spark Standalone Mode] How to read from kerberised HDFS in spark standalone mode
Bansal, Jaimita
-
2023/01/19
How to check the liveness of a SparkSession
Yeachan Park
-
2023/01/18
Re: [PySPark] How to check if value of one column is in array of another column
Oliver Ruebenacker
-
2023/01/17
Re: [PySPark] How to check if value of one column is in array of another column
Sean Owen
-
2023/01/17
[PySPark] How to check if value of one column is in array of another column
Oliver Ruebenacker
-
2023/01/15
Is there any Job/Career channel
Chetan Khatri
-
2023/01/14
[Spark SQL] Data duplicate or data lost with non-deterministic function
李建伟
-
2023/01/13
Re: pyspark.sql.dataframe.DataFrame versus pyspark.pandas.frame.DataFrame
Sean Owen
-
2023/01/12
pyspark.sql.dataframe.DataFrame versus pyspark.pandas.frame.DataFrame
second_co...@yahoo.com.INVALID
-
2023/01/11
unsubscribe
Sebastian Schere
-
2023/01/11
[UNSUBSCRIBE]
Sebastian Schere
-
2023/01/10
[pyspark/pandas] Pandas UDF accepting more than 2 pandas dataframe when cogroup + applyInPandas?
pzm6...@hotmail.com
-
2023/01/08
Re: Hive 3 has big performance improvement from my test
Mich Talebzadeh
-
2023/01/07
Re: Hive 3 has big performance improvement from my test
Mich Talebzadeh
-
2023/01/06
Re: [pyspark/sparksql]: How to overcome redundant/repetitive code? Is a for loop over an sql statement with a variable a bad idea?
Sean Owen
-
2023/01/06
[pyspark/sparksql]: How to overcome redundant/repetitive code? Is a for loop over an sql statement with a variable a bad idea?
Joris Billen
-
2023/01/06
Re: [PySpark] Error using SciPy: ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 88 from C header, got 80 from PyObject
Oliver Ruebenacker
-
2023/01/06
Re: [PySpark] Error using SciPy: ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 88 from C header, got 80 from PyObject
Bjørn Jørgensen
-
2023/01/06
Re: [PySpark] Error using SciPy: ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 88 from C header, got 80 from PyObject
Mich Talebzadeh
-
2023/01/06
Re: [PySpark] Error using SciPy: ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 88 from C header, got 80 from PyObject
Oliver Ruebenacker
-
2023/01/06
Re: [PySpark] Error using SciPy: ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 88 from C header, got 80 from PyObject
Bjørn Jørgensen
-
2023/01/06
[PySpark] Error using SciPy: ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 88 from C header, got 80 from PyObject
Oliver Ruebenacker
-
2023/01/06
Re: Spark reading from HBase using hbase-connectors - any benefit from localization?
Aaron Grubb
-
2023/01/05
Re: Spark reading from HBase using hbase-connectors - any benefit from localization?
Mich Talebzadeh
-
2023/01/05
Re: Spark reading from HBase using hbase-connectors - any benefit from localization?
Aaron Grubb
-
2023/01/05
Re: Spark reading from HBase using hbase-connectors - any benefit from localization?
Mich Talebzadeh
-
2023/01/05
Re: Got Error Creating permanent view in Postgresql through Pyspark code
ayan guha
-
2023/01/05
Re: GPU Support
Sean Owen
-
2023/01/05
Re: Got Error Creating permanent view in Postgresql through Pyspark code
Stelios Philippou
-
2023/01/05
GPU Support
K B M Kaala Subhikshan
-
2023/01/05
Re: [EXTERNAL] Re: Incorrect csv parsing when delimiter used within the data
Saurabh Gulati
-
2023/01/05
Re: [EXTERNAL] Re: Incorrect csv parsing when delimiter used within the data
Saurabh Gulati
-
2023/01/05
Re: [EXTERNAL] Re: Re: Incorrect csv parsing when delimiter used within the data
Saurabh Gulati
-
2023/01/05
Spark reading from HBase using hbase-connectors - any benefit from localization?
Aaron Grubb
-
2023/01/05
Re: How to set a config for a single query?
Khalid Mammadov
-
2023/01/04
Re: [EXTERNAL] Re: Incorrect csv parsing when delimiter used within the data
Sean Owen
-
2023/01/04
Re: [EXTERNAL] Re: Re: Incorrect csv parsing when delimiter used within the data
Shay Elbaz
-
2023/01/04
Re: [EXTERNAL] Re: Incorrect csv parsing when delimiter used within the data
Saurabh Gulati
-
2023/01/04
Re: Got Error Creating permanent view in Postgresql through Pyspark code
Stelios Philippou
-
2023/01/04
Re: [EXTERNAL] Re: Incorrect csv parsing when delimiter used within the data
Sean Owen
-
2023/01/04
Got Error Creating permanent view in Postgresql through Pyspark code
Vajiha Begum S A
-
2023/01/04
[BUG?] How to handle with special characters or scape them on spark version 3.3.0?
Vieira, Thiago
-
2023/01/04
Re: [EXTERNAL] Re: Incorrect csv parsing when delimiter used within the data
Saurabh Gulati
-
2023/01/04
Re: How to set a config for a single query?
Shay Elbaz
-
2023/01/04
Re: How to set a config for a single query?
Saurabh Gulati
-
2023/01/04
Re: Incorrect csv parsing when delimiter used within the data
Mich Talebzadeh
-
2023/01/03
How to set a config for a single query?
Felipe Pessoto
-
2023/01/03
[SparkR] Compare datetime with Sys.time() throws error in R (>= 4.2.0)
Vivek Atal
-
2023/01/03
Re: Incorrect csv parsing when delimiter used within the data
Sean Owen
-
2023/01/03
Re: Incorrect csv parsing when delimiter used within the data
Mich Talebzadeh
-
2023/01/03
Re: Incorrect csv parsing when delimiter used within the data
Sean Owen
-
2023/01/03
Incorrect csv parsing when delimiter used within the data
Saurabh Gulati
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Shrikant Prasad
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Sean Owen
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Shrikant Prasad
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Sean Owen
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Shrikant Prasad
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Sean Owen
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Shrikant Prasad
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Sean Owen
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Shrikant Prasad
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Sean Owen
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Shrikant Prasad
-
2023/01/02
Re: Spark migration from 2.3 to 3.0.1
Stelios Philippou
-
2023/01/02
Spark migration from 2.3 to 3.0.1
Shrikant Prasad
-
2022/12/30
Help with ClassNotFoundException: org.apache.spark.internal.io.cloud.PathOutputCommitProtocol
Meharji Arumilli
-
2022/12/29
Re: Profiling data quality with Spark
Chitral Verma
-
2022/12/28
Re: Profiling data quality with Spark
infa elance
-
2022/12/28
Re: EXT: Re: Check if shuffle is caused for repartitioned pyspark dataframes
Vibhor Gupta
-
2022/12/28
Cannot build Apache Spark 3.3.1 with Apache Hive 3.1.2 and Apache Hadoop 3.1.1
שוהם יהודה
-
2022/12/28
Re: Profiling data quality with Spark
vaquar khan
-
2022/12/28
Re: Profiling data quality with Spark
vaquar khan
-
2022/12/28
Re: Profiling data quality with Spark
rajat kumar
-
2022/12/28
Re: [Spark Core] [Advanced] [How-to] How to map any external field to job ids spawned by Spark.
Gourav Sengupta
-
2022/12/28
Re: [Spark Core] [Advanced] [How-to] How to map any external field to job ids spawned by Spark.
Khalid Mammadov
-
2022/12/27
Re: Profiling data quality with Spark
Gourav Sengupta
-
2022/12/27
Re: Profiling data quality with Spark
vaquar khan
-
2022/12/27
Re: Profiling data quality with Spark
ayan guha
-
2022/12/27
Re: Profiling data quality with Spark
Walaa Eldin Moustafa
-
2022/12/27
Re: Profiling data quality with Spark
Sean Owen
-
2022/12/27
Re: Profiling data quality with Spark
Gourav Sengupta
-
2022/12/27
Re: Profiling data quality with Spark
Mich Talebzadeh
-
2022/12/27
Profiling data quality with Spark
rajat kumar
-
2022/12/27
[Spark Core] [Advanced] [How-to] How to map any external field to job ids spawned by Spark.
Dhruv Toshniwal
-
2022/12/27
Re: spark-submit fails in kubernetes 1.24.x cluster
Saurabh Gulati
-
2022/12/26
Re: Check if shuffle is caused for repartitioned pyspark dataframes
Shivam Verma
-
2022/12/25
RE: Re: RDD to InputStream
ayuio5799
-
2022/12/23
Re: Check if shuffle is caused for repartitioned pyspark dataframes
Russell Jurney
-
2022/12/23
Re: Check if shuffle is caused for repartitioned pyspark dataframes
Shivam Verma
-
2022/12/23
spark-submit fails in kubernetes 1.24.x cluster
Thimme Gowda TP (Nokia)
-
2022/12/21
Re: [PySpark] Getting the best row from each group
Oliver Ruebenacker
-
2022/12/21
Re: [PySpark] Getting the best row from each group
Mich Talebzadeh
-
2022/12/20
Re: [PySpark] Getting the best row from each group
Artemis User
-
2022/12/20
Re: [PySpark] Getting the best row from each group
Bjørn Jørgensen
-
2022/12/20
Re: [PySpark] Getting the best row from each group
Oliver Ruebenacker
-
2022/12/20
Re: [PySpark] Getting the best row from each group
Oliver Ruebenacker
-
2022/12/20
Re: [PySpark] Getting the best row from each group
Raghavendra Ganesh
-
2022/12/20
Re: [PySpark] Getting the best row from each group
Mich Talebzadeh
-
2022/12/19
Re: [Spark SQL]: unpredictable errors: java.io.IOException: can not read class org.apache.parquet.format.PageHeader
Eric Hanchrow
-
2022/12/19
Re: [PySpark] Getting the best row from each group
Bjørn Jørgensen
-
2022/12/19
Re: [PySpark] Getting the best row from each group
Oliver Ruebenacker
-
2022/12/19
Re: [PySpark] Getting the best row from each group
Bjørn Jørgensen
-
2022/12/19
Re: [PySpark] Getting the best row from each group
Patrick Tucci
-
2022/12/19
Re: [PySpark] Getting the best row from each group
Oliver Ruebenacker
-
2022/12/19
Re: [PySpark] Getting the best row from each group
Sean Owen
-
2022/12/19
Re: [PySpark] Getting the best row from each group
Oliver Ruebenacker
-
2022/12/19
Re: [PySpark] Getting the best row from each group
Mich Talebzadeh
-
2022/12/18
Re: Spark-on-Yarn ClassNotFound Exception
Hariharan
-
2022/12/16
Re: Unable to run Spark Job(3.3.2 SNAPSHOT) with Volcano scheduler in Kubernetes
Gnana Kumar
-
2022/12/16
Re: Unable to run Spark Job(3.3.2 SNAPSHOT) with Volcano scheduler in Kubernetes
Bjørn Jørgensen
-
2022/12/16
Re: Unable to run Spark Job(3.3.2 SNAPSHOT) with Volcano scheduler in Kubernetes
Sean Owen
-
2022/12/16
Re: Unable to run Spark Job(3.3.2 SNAPSHOT) with Volcano scheduler in Kubernetes
Sean Owen
-
2022/12/15
RE: [EXTERNAL] Re: [Spark vulnerability] replace jackson-mapper-asl
haibo.w...@morganstanley.com
-
2022/12/15
Re: [EXTERNAL] Re: [Spark vulnerability] replace jackson-mapper-asl
Sean Owen
-
2022/12/15
UNSUBSCRIBE
prashanth t
-
2022/12/15
Re: Spark-on-Yarn ClassNotFound Exception
scrypso
-
2022/12/15
Re: Query regarding Apache spark version 3.0.1
Sean Owen
-
2022/12/15
Query regarding Apache spark version 3.0.1
Pranav Kumar (EXT)
-
2022/12/14
UNSUBSCRIBE
Agostino Calamita
-
2022/12/14
Re: [EXTERNAL] Re: [Spark vulnerability] replace jackson-mapper-asl
Sean Owen
-
2022/12/14
RE: [EXTERNAL] Re: [Spark vulnerability] replace jackson-mapper-asl
haibo.w...@morganstanley.com
-
2022/12/14
Re: [Spark vulnerability] replace jackson-mapper-asl
Sean Owen
-
2022/12/14
[Spark vulnerability] replace jackson-mapper-asl
haibo.w...@morganstanley.com
-
2022/12/13
Check if shuffle is caused for repartitioned pyspark dataframes
Shivam Verma
-
2022/12/13
Re: Spark-on-Yarn ClassNotFound Exception
Hariharan
-
2022/12/13
UNSUBSCRIBE
yixu2...@163.com
-
2022/12/13
[no subject]
yixu2...@163.com
-
2022/12/13
UNSUBSCRIBE
Joji V J
-
2022/12/13
Re: Can we upload a csv dataset into Hive using SparkSQL?
Artemis User
-
2022/12/13
Re: Spark-on-Yarn ClassNotFound Exception
scrypso
-
2022/12/13
Re: Spark-on-Yarn ClassNotFound Exception
Hariharan
-
2022/12/13
Re: Spark-on-Yarn ClassNotFound Exception
scrypso
-
2022/12/13
Re: Spark-on-Yarn ClassNotFound Exception
Hariharan
-
2022/12/12
UNSUBSCRIBE
Ricardo Sardenberg
-
2022/12/12
How to see user defined variables in spark shell
Salil Surendran
-
2022/12/12
Spark-on-Yarn ClassNotFound Exception
Hariharan