I just tested 1.0.2-rc2 on EMR-7.8 and can confirm HUDI-9119( https://issues.apache.org/jira/browse/HUDI-9119) still persists.
This is going to critically impact Hudi users on EMR, was wondering if we can fix this for the 1.0.2 release? On Tue, Apr 22, 2025 at 8:39 PM Danny Chan <danny0...@apache.org> wrote: > -1 on this. > > we found the new fg reader just caches the Spark GenericInternalRow in > records cache, which is 5x larger than the original avro bytes based > payload records, thus, the records is more prone to spill, the spill > is kind of a bottleneck of the compaction/regular reader read path, > the spill causes performance regression actually. We should mark this > as block of 1.0.2 I think. Also the cache takes a map metadata for > each record which also takes a lot of memory(the map obj takes a lot > of memory itself). To address this issue, I have fired a JIRA task: > https://issues.apache.org/jira/browse/HUDI-9318 > > Best, > Danny > > Voon <v...@apache.org> 于2025年4月22日周二 09:59写道: > > > > Hi everyone, > > > > Please review and vote on the release candidate #2 for the version 1.0.2, > > as follows: > > > > [ ] +1, Approve the release > > > > [ ] -1, Do not approve the release (please provide specific comments) > > > > > > > > The complete staging area is available for your review, which includes: > > > > * JIRA release notes [1], > > > > * the official Apache source release and binary convenience releases to > be > > deployed to dist.apache.org [2], which are signed with the key with > > fingerprint B8DC892C439CCB5C0CCA3BEA68050B561D9AFB32 [3], > > > > * all artifacts to be deployed to the Maven Central Repository [4], > > > > * source code tag "1.0.2-rc2" [5], > > > > > > > > The vote will be open for at least 72 hours. It is adopted by majority > > approval, with at least 3 PMC affirmative votes. > > > > > > > > Thanks, > > > > Release Manager > > > > > > > > [1] > > > https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12322822&version=12355558 > > > > [2] https://dist.apache.org/repos/dist/dev/hudi/hudi-1.0.2-rc2/ > > > > [3] https://dist.apache.org/repos/dist/release/hudi/KEYS > > > > [4] > https://repository.apache.org/content/repositories/orgapachehudi-1149/ > > > > [5] https://github.com/apache/hudi/releases/tag/release-1.0.2-rc2 >