[jira] [Created] (SPARK-21621) Reset numRecordsWritten after DiskBlockObjectWriter.commitAndGet called

2017-08-03 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-21621: Summary: Reset numRecordsWritten after DiskBlockObjectWriter.commitAndGet called Key: SPARK-21621 URL: https://issues.apache.org/jira/browse/SPARK-21621 Project:

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call

[jira] [Created] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-21455: Summary: RpcFailure should be call on RpcResponseCallback.onFailure Key: SPARK-21455 URL: https://issues.apache.org/jira/browse/SPARK-21455 Project: Spark

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call

[jira] [Created] (SPARK-22068) Reduce the duplicate code between putIteratorAsValues and putIteratorAsBytes

2017-09-19 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-22068: Summary: Reduce the duplicate code between putIteratorAsValues and putIteratorAsBytes Key: SPARK-22068 URL: https://issues.apache.org/jira/browse/SPARK-22068

[jira] [Updated] (SPARK-22097) Request an accurate memory after we unrolled the block

2017-09-22 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-22097: - Summary: Request an accurate memory after we unrolled the block (was: Call

[jira] [Updated] (SPARK-22097) Request an accurate memory after we unrolled the block

2017-09-22 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-22097: - Description: We only need request bbos.size - unrollMemoryUsedByThisBlock after unrolled the

[jira] [Created] (SPARK-22097) Call serializationStream.close after we requested enough memory

2017-09-21 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-22097: Summary: Call serializationStream.close after we requested enough memory Key: SPARK-22097 URL: https://issues.apache.org/jira/browse/SPARK-22097 Project: Spark

[jira] [Created] (SPARK-22293) Avoid unnecessary traversal in ResolveReferences

2017-10-17 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-22293: Summary: Avoid unnecessary traversal in ResolveReferences Key: SPARK-22293 URL: https://issues.apache.org/jira/browse/SPARK-22293 Project: Spark Issue Type:

[jira] [Updated] (SPARK-21923) Avoid calling reserveUnrollMemoryForThisTask for every record

2017-09-11 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21923: - Summary: Avoid calling reserveUnrollMemoryForThisTask for every record (was: Avoid call

[jira] [Created] (SPARK-21923) Avoid call reserveUnrollMemoryForThisTask every record

2017-09-05 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-21923: Summary: Avoid call reserveUnrollMemoryForThisTask every record Key: SPARK-21923 URL: https://issues.apache.org/jira/browse/SPARK-21923 Project: Spark Issue

[jira] [Updated] (SPARK-21923) Avoid call reserveUnrollMemoryForThisTask every record

2017-09-05 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21923: - Description: When Spark persist data to Unsafe memory, we call the method

[jira] [Created] (SPARK-22838) Avoid unnecessary copying of data

2017-12-19 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-22838: Summary: Avoid unnecessary copying of data Key: SPARK-22838 URL: https://issues.apache.org/jira/browse/SPARK-22838 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-22450) Safely register class for mllib

2017-11-05 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-22450: Summary: Safely register class for mllib Key: SPARK-22450 URL: https://issues.apache.org/jira/browse/SPARK-22450 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-22367) Separate the serialization of class and object for iteraor

2017-10-27 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-22367: Summary: Separate the serialization of class and object for iteraor Key: SPARK-22367 URL: https://issues.apache.org/jira/browse/SPARK-22367 Project: Spark

[jira] [Updated] (SPARK-22367) Separate the serialization of class and object for iteraor

2017-10-27 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-22367: - Description: Becuase they are all the same class for an iterator. So there is no need write

[jira] [Created] (SPARK-25386) Don't need to synchronize the IndexShuffleBlockResolver for each writeIndexFileAndCommit

2018-09-09 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-25386: Summary: Don't need to synchronize the IndexShuffleBlockResolver for each writeIndexFileAndCommit Key: SPARK-25386 URL: https://issues.apache.org/jira/browse/SPARK-25386

[jira] [Updated] (SPARK-25386) Don't need to synchronize the IndexShuffleBlockResolver for each writeIndexFileAndCommit call

2018-09-09 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-25386: - Summary: Don't need to synchronize the IndexShuffleBlockResolver for each

[jira] [Updated] (SPARK-25386) Don't need to synchronize the IndexShuffleBlockResolver for each writeIndexFileAndCommit call

2018-09-09 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-25386: - Description: Now, we need synchronize the instance of IndexShuffleBlockResolver in order to

[jira] [Created] (SPARK-23707) Fresh 'initRange' name to avoid method name conflicts

2018-03-16 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-23707: Summary: Fresh 'initRange' name to avoid method name conflicts Key: SPARK-23707 URL: https://issues.apache.org/jira/browse/SPARK-23707 Project: Spark Issue

[jira] [Updated] (SPARK-23707) Don't need shuffle exchange with single partition for 'spark.range'

2018-03-21 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-23707: - Description: Just like #20726. There is no need 'Exchange' when `spark.range` produce only one

[jira] [Updated] (SPARK-23707) Don't need shuffle exchange with single partition for 'spark.range'

2018-03-21 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-23707: - Summary: Don't need shuffle exchange with single partition for 'spark.range' (was: Fresh

[jira] [Created] (SPARK-27350) Support create table on data source V2

2019-04-02 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-27350: Summary: Support create table on data source V2 Key: SPARK-27350 URL: https://issues.apache.org/jira/browse/SPARK-27350 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-27319) Filter out dir based on PathFilter before listing them

2019-03-28 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-27319: Summary: Filter out dir based on PathFilter before listing them Key: SPARK-27319 URL: https://issues.apache.org/jira/browse/SPARK-27319 Project: Spark Issue

[jira] [Created] (SPARK-26952) Row count statics should respect the data reported by data source

2019-02-21 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-26952: Summary: Row count statics should respect the data reported by data source Key: SPARK-26952 URL: https://issues.apache.org/jira/browse/SPARK-26952 Project: Spark

[jira] [Created] (SPARK-26763) Using fileStatus cache when filterPartitions

2019-01-28 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-26763: Summary: Using fileStatus cache when filterPartitions Key: SPARK-26763 URL: https://issues.apache.org/jira/browse/SPARK-26763 Project: Spark Issue Type:

[jira] [Created] (SPARK-28751) Imporve java serializer deserialization performance

2019-08-15 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-28751: Summary: Imporve java serializer deserialization performance Key: SPARK-28751 URL: https://issues.apache.org/jira/browse/SPARK-28751 Project: Spark Issue

[jira] [Updated] (SPARK-28751) Imporve java serializer deserialization performance

2019-08-15 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-28751: - Description: Improve the performance of java serializer deserialization by caching the

[jira] [Created] (SPARK-29582) Unify the behavior of pyspark.TaskContext with spark core

2019-10-23 Thread Xianyang Liu (Jira)
Xianyang Liu created SPARK-29582: Summary: Unify the behavior of pyspark.TaskContext with spark core Key: SPARK-29582 URL: https://issues.apache.org/jira/browse/SPARK-29582 Project: Spark

[jira] [Created] (SPARK-29499) Add mapPartitionsWithIndex for RDDBarrier

2019-10-17 Thread Xianyang Liu (Jira)
Xianyang Liu created SPARK-29499: Summary: Add mapPartitionsWithIndex for RDDBarrier Key: SPARK-29499 URL: https://issues.apache.org/jira/browse/SPARK-29499 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-29236) Access 'executorDataMap' out of 'DriverEndpoint' should be protected by lock

2019-09-24 Thread Xianyang Liu (Jira)
Xianyang Liu created SPARK-29236: Summary: Access 'executorDataMap' out of 'DriverEndpoint' should be protected by lock Key: SPARK-29236 URL: https://issues.apache.org/jira/browse/SPARK-29236

[jira] [Created] (SPARK-32899) Support submit application with user-defined cluster manager

2020-09-16 Thread Xianyang Liu (Jira)
Xianyang Liu created SPARK-32899: Summary: Support submit application with user-defined cluster manager Key: SPARK-32899 URL: https://issues.apache.org/jira/browse/SPARK-32899 Project: Spark

[jira] [Updated] (SPARK-32899) Support submit application with user-defined cluster manager

2021-07-01 Thread Xianyang Liu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-32899: - Description: We have supported users to define the customed cluster manager with

[jira] [Created] (SPARK-44930) Deterministic ApplyFunctionExpression should be foldable

2023-08-23 Thread Xianyang Liu (Jira)
Xianyang Liu created SPARK-44930: Summary: Deterministic ApplyFunctionExpression should be foldable Key: SPARK-44930 URL: https://issues.apache.org/jira/browse/SPARK-44930 Project: Spark

[jira] [Updated] (SPARK-44930) Deterministic ApplyFunctionExpression should be foldable

2023-08-23 Thread Xianyang Liu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-44930: - Description: Currently, ApplyFunctionExpression is unfoldable because inherits the default

[jira] [Updated] (SPARK-44930) Deterministic ApplyFunctionExpression should be foldable

2023-08-23 Thread Xianyang Liu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-44930: - Description: Currently, ApplyFunctionExpression is unfoldable because inherits the default

[jira] [Created] (SPARK-44913) DS V2 supports push down V2 UDF that has magic method

2023-08-22 Thread Xianyang Liu (Jira)
Xianyang Liu created SPARK-44913: Summary: DS V2 supports push down V2 UDF that has magic method Key: SPARK-44913 URL: https://issues.apache.org/jira/browse/SPARK-44913 Project: Spark Issue

[jira] [Created] (SPARK-40137) Combines limits after projection

2022-08-18 Thread Xianyang Liu (Jira)
Xianyang Liu created SPARK-40137: Summary: Combines limits after projection Key: SPARK-40137 URL: https://issues.apache.org/jira/browse/SPARK-40137 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-40227) Data Source V2: Support creating table with the duplicate transform with different arguments

2022-08-26 Thread Xianyang Liu (Jira)
Xianyang Liu created SPARK-40227: Summary: Data Source V2: Support creating table with the duplicate transform with different arguments Key: SPARK-40227 URL: https://issues.apache.org/jira/browse/SPARK-40227