[ 
https://issues.apache.org/jira/browse/IMPALA-12449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17766519#comment-17766519
 ] 

Michael Smith commented on IMPALA-12449:
----------------------------------------

Joe and I are reviewing the change at https://gerrit.cloudera.org/c/19954/.

> SSE2NEON is slow compared to native NEON code
> ---------------------------------------------
>
>                 Key: IMPALA-12449
>                 URL: https://issues.apache.org/jira/browse/IMPALA-12449
>             Project: IMPALA
>          Issue Type: Bug
>    Affects Versions: Impala 4.3.0
>            Reporter: Sebastian Pop
>            Assignee: Sebastian Pop
>            Priority: Critical
>             Fix For: Impala 4.3.0
>
>
> The port to arm64 uses https://github.com/DLTcollab/sse2neon an automatic 
> tool to convert x86_64 intrinsics to arm64 NEON.
> As some of the x86_64 vector instructions do not have a one-to-one 
> translation to NEON, the sse2neon translation tool falls back to a sequential 
> emulation of the missing instructions.
> To avoid those slow paths, we need to adapt to NEON instructions all the 
> code-paths with x86_64 intrinsics.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to