Pulkitg64 commented on PR #15549:
URL: https://github.com/apache/lucene/pull/15549#issuecomment-3770848545

   Sorry, Earlier I pasted wrong profiler output above for float32 
implementation. This is the correct profiler output for float32:
   
   ```
   39.85%        2457          
jdk.incubator.vector.FloatVector#lanewiseTemplate() [Inlined code]
   5.97%         368           
jdk.incubator.vector.FloatVector#reduceLanesTemplate() [Inlined code]
   5.43%         335           
org.apache.lucene.util.TernaryLongHeap#downHeap() [Inlined code]
   4.61%         284           
org.apache.lucene.codecs.lucene90.compressing.Lucene90CompressingStoredFieldsReader$BlockState#doReset()
 [JIT compiled code]
   3.99%         246           sun.nio.ch.UnixFileDispatcherImpl#force0() 
[Native code]
   2.74%         169           
jdk.internal.misc.ScopedMemoryAccess#loadFromMemorySegmentScopedInternal() 
[Inlined code]
   1.43%         88            org.apache.lucene.document.StoredField#<init>() 
[Inlined code]
   1.41%         87            
org.apache.lucene.codecs.lucene99.Lucene99HnswVectorsReader#search() [JIT 
compiled code]
   1.35%         83            java.util.TimSort#binarySort() [JIT compiled 
code]
   1.12%         69            
jdk.incubator.vector.FloatVector#fromArray0Template() [Inlined code]
   1.10%         68            
org.apache.lucene.internal.vectorization.Lucene99MemorySegmentFloatVectorScorer#bulkScore()
 [JIT compiled code]
   0.96%         59            
org.apache.lucene.codecs.lucene90.LZ4WithPresetDictCompressionMode$LZ4WithPresetDictDecompressor#decompress()
 [JIT compiled code]
   0.91%         56            sun.nio.ch.UnixFileDispatcherImpl#read0() 
[Native code]
   0.86%         53            
org.apache.lucene.internal.vectorization.MemorySegmentBulkVectorOps$DotProduct#dotProductBulkImpl()
 [Interpreted code]
   0.79%         49            org.apache.lucene.util.TernaryLongHeap#upHeap() 
[Inlined code]
   0.73%         45            
org.apache.lucene.util.TernaryLongHeap#updateTop() [JIT compiled code]
   0.68%         42            
org.apache.lucene.codecs.lucene90.compressing.StoredFieldsInts#readInts8() 
[Inlined code]
   0.65%         40            
jdk.internal.foreign.MemorySessionImpl#checkValidStateRaw() [Inlined code]
   0.63%         39            sun.nio.fs.UnixNativeDispatcher#open0() [Native 
code]
   0.62%         38            
org.apache.lucene.util.TernaryLongHeap#insertWithOverflow() [Inlined code]
   0.57%         35            java.util.HashMap#resize() [JIT compiled code]
   0.52%         32            
org.apache.lucene.util.hnsw.HnswGraphSearcher#searchLevel() [JIT compiled code]
   0.45%         28            
org.apache.lucene.util.VectorUtil#normalizeToUnitInterval() [Inlined code]
   0.44%         27            java.lang.Integer#valueOf() [Inlined code]
   0.42%         26            
org.apache.lucene.search.TaskExecutor#invokeAll() [JIT compiled code]
   0.42%         26            jdk.jfr.internal.JVM#emitEvent() [Native code]
   0.42%         26            java.util.zip.Inflater#inflateBytesBytes() 
[Native code]
   0.42%         26            
org.apache.lucene.util.hnsw.RandomVectorScorer$AbstractRandomVectorScorer#ordToDoc()
 [Inlined code]
   0.42%         26            java.util.TimSort#mergeLo() [JIT compiled code]
   0.41%         25            
org.apache.lucene.util.hnsw.NeighborQueue#encode() [Inlined code]
   
   ```
   
   
   and this is for float16:
   
   ```
   66.03%        217295        
jdk.internal.vm.vector.VectorSupport$VectorPayload#getPayload() [Inlined code]
   10.08%        33172         jdk.incubator.vector.Float16Vector#tOpTemplate() 
[Inlined code]
   7.90%         25995         jdk.incubator.vector.Float16#valueOf() [Inlined 
code]
   6.16%         20265         
jdk.incubator.vector.Float16Vector#lanewiseTemplate() [Inlined code]
   5.07%         16691         jdk.incubator.vector.Float16#lambda$fma$0() 
[Inlined code]
   1.54%         5084          
jdk.incubator.vector.Float16Vector256#vectorFactory() [Inlined code]
   0.40%         1323          
jdk.incubator.vector.Float16#shortBitsToFloat16() [Inlined code]
   0.35%         1160          
jdk.incubator.vector.Float16Vector#fromArray0Template() [Inlined code]
   0.30%         988           jdk.internal.vm.vector.VectorSupport#ternaryOp() 
[JIT compiled code]
   0.23%         757           jdk.jfr.internal.JVM#emitEvent() [Native code]
   0.21%         683           
jdk.internal.vm.vector.VectorSupport$VectorPayload#<init>() [Inlined code]
   0.16%         523           
jdk.incubator.vector.Float16Vector$$Lambda.0x000000003811dec0#apply() [Inlined 
code]
   0.15%         506           jdk.incubator.vector.Float16Vector#bOpTemplate() 
[Inlined code]
   0.10%         340           jdk.incubator.vector.Float16Vector256#vec() 
[Inlined code]
   0.10%         333           
org.apache.lucene.util.TernaryLongHeap#downHeap() [Inlined code]
   0.09%         302           
org.apache.lucene.codecs.lucene90.compressing.Lucene90CompressingStoredFieldsReader$BlockState#doReset()
 [JIT compiled code]
   0.09%         289           
jdk.internal.vm.vector.VectorSupport#maybeRebox() [Inlined code]
   0.09%         288           
jdk.internal.foreign.MemorySessionImpl#checkValidStateRaw() [Inlined code]
   0.07%         231           sun.nio.ch.UnixFileDispatcherImpl#force0() 
[Native code]
   0.05%         171           
jdk.incubator.vector.Float16Vector#lambda$reductionOperations$1() [Inlined code]
   0.03%         112           jdk.incubator.vector.Float16Vector#rOpTemplate() 
[Inlined code]
   0.03%         98            sun.nio.fs.UnixNativeDispatcher#open0() [Native 
code]
   0.02%         82            java.util.TimSort#binarySort() [JIT compiled 
code]
   0.02%         79            
jdk.internal.misc.ScopedMemoryAccess#loadFromMemorySegmentScopedInternal() 
[Inlined code]
   0.02%         77            jdk.internal.vm.vector.VectorSupport#binaryOp() 
[JIT compiled code]
   0.02%         73            sun.nio.ch.UnixFileDispatcherImpl#read0() 
[Native code]
   0.02%         60            
org.apache.lucene.codecs.lucene90.compressing.StoredFieldsInts#readInts8() 
[Inlined code]
   0.02%         59            
org.apache.lucene.codecs.lucene99.Lucene99HnswVectorsReader#search() [JIT 
compiled code]
   0.02%         56            org.apache.lucene.util.TernaryLongHeap#upHeap() 
[Inlined code]
   0.02%         52            
org.apache.lucene.util.hnsw.RandomVectorScorer#bulkScore() [Inlined code]
   ```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to