[
https://issues.apache.org/jira/browse/SPARK-23366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-23366:
------------------------------------
Assignee: (was: Apache Spark)
> Improve hot reading path in ReadAheadInputStream
> ------------------------------------------------
>
> Key: SPARK-23366
> URL: https://issues.apache.org/jira/browse/SPARK-23366
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core
> Affects Versions: 2.3.0
> Reporter: Juliusz Sompolski
> Priority: Major
>
> ReadAheadInputStream was introduced in
> [apache/spark#18317|https://github.com/apache/spark/pull/18317] to optimize
> reading spill files from disk.
> However, investigating flamegraphs of profiles from investigating some
> regressed workloads after switch to Spark 2.3, it seems that the hot path of
> reading small amounts of data (like readInt) is inefficient - it involves
> taking locks, and multiple checks.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]