[
https://issues.apache.org/jira/browse/IMPALA-12449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17766172#comment-17766172
]
Quanlong Huang commented on IMPALA-12449:
-----------------------------------------
Thanks for reporting this!
Hi [~MikaelSmith], do you want to take this?
> SSE2NEON is slow compared to native NEON code
> ---------------------------------------------
>
> Key: IMPALA-12449
> URL: https://issues.apache.org/jira/browse/IMPALA-12449
> Project: IMPALA
> Issue Type: Bug
> Affects Versions: Impala 4.3.0
> Reporter: Sebastian Pop
> Assignee: Quanlong Huang
> Priority: Critical
> Fix For: Impala 4.3.0
>
>
> The port to arm64 uses https://github.com/DLTcollab/sse2neon an automatic
> tool to convert x86_64 intrinsics to arm64 NEON.
> As some of the x86_64 vector instructions do not have a one-to-one
> translation to NEON, the sse2neon translation tool falls back to a sequential
> emulation of the missing instructions.
> To avoid those slow paths, we need to adapt to NEON instructions all the
> code-paths with x86_64 intrinsics.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]