Pulkitg64 commented on PR #15549:
URL: https://github.com/apache/lucene/pull/15549#issuecomment-3719887908

   Here is the output differenct from profiler between float16 and float32 
benchmark runs for no quantization. Based on below comparison, it can be 
clearly seen the additional latency in float16 benchmark run is coming while 
reading float16 vectors.
   
   <img width="2419" height="877" alt="Screenshot 2026-01-07 at 12 11 15 PM" 
src="https://github.com/user-attachments/assets/a5511fd0-8ab9-4d0c-86f9-5836eeb4e5b2";
 />
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to