Re: [x265] [PATCH] copy_cnt: enable avx2 version of asm code

2014-09-11 Thread Deepthi Nandakumar
Would be better to combine this asm enable with the corresponding asm patch itself. I have pushed copy_cnt8, and enabled only that for now. On Wed, Sep 10, 2014 at 3:28 PM, prav...@multicorewareinc.com wrote: # HG changeset patch # User Praveen Tiwari # Date 1410343073 -19800 # Node ID

Re: [x265] [PATCH] copy_cnt: enable avx2 version of asm code

2014-09-11 Thread Praveen Tiwari
You can push 16x16 and 32x32 also they are good in performance but they need a bit more improvement, I will be sending improvement patch soon. Regards, Praveen Tiwari On Thu, Sep 11, 2014 at 11:29 AM, Deepthi Nandakumar deep...@multicorewareinc.com wrote: Would be better to combine this asm

Re: [x265] [PATCH] copy_cnt: enable avx2 version of asm code

2014-09-11 Thread Deepthi Nandakumar
Ok, thanks. On Thu, Sep 11, 2014 at 11:47 AM, Praveen Tiwari prav...@multicorewareinc.com wrote: You can push 16x16 and 32x32 also they are good in performance but they need a bit more improvement, I will be sending improvement patch soon. Regards, Praveen Tiwari On Thu, Sep 11, 2014 at

[x265] [PATCH] removed copy_cnt_4 avx2 asm code: SSE version is eualy faster

2014-09-11 Thread praveen
# HG changeset patch # User Praveen Tiwari # Date 1410433904 -19800 # Node ID 5740ec22db67267bfca97fbba07ef9239802d2b0 # Parent 012f315d3eda8044f5a49865e15ba2943fbab094 removed copy_cnt_4 avx2 asm code: SSE version is eualy faster diff -r 012f315d3eda -r 5740ec22db67

[x265] [PATCH] search: measure RDO of intra modes within 25% of least cost [CHANGES OUTPUTS]

2014-09-11 Thread Steve Borho
# HG changeset patch # User Steve Borho st...@borho.org # Date 1410345355 -7200 # Wed Sep 10 12:35:55 2014 +0200 # Node ID 2fb709fbf2d85caae68db9dd6574ba3e6f52d99f # Parent 012f315d3eda8044f5a49865e15ba2943fbab094 search: measure RDO of intra modes within 25% of least cost [CHANGES OUTPUTS]

Re: [x265] [PATCH] removed copy_cnt_4 avx2 asm code: SSE version is eualy faster

2014-09-11 Thread Praveen Tiwari
Ignore It, need to correct commit message. Regards, Praveen Tiwari On Thu, Sep 11, 2014 at 4:41 PM, prav...@multicorewareinc.com wrote: # HG changeset patch # User Praveen Tiwari # Date 1410433904 -19800 # Node ID 5740ec22db67267bfca97fbba07ef9239802d2b0 # Parent

[x265] [PATCH] copy_cnt_4 avx2 asm code: nit, same speedup by sse version

2014-09-11 Thread praveen
# HG changeset patch # User Praveen Tiwari # Date 1410437024 -19800 # Node ID 9241634204a12babf8a2a90dc4f776646a9ddfb3 # Parent c7d9506fe681e29b6315bf42ac898b46adb9b6a5 copy_cnt_4 avx2 asm code: nit, same speedup by sse version diff -r c7d9506fe681 -r 9241634204a1

[x265] [PATCH] copy_cnt_16: avx2 asm code as per new interface, improved 514.32 cycles - 313.66 cycles

2014-09-11 Thread praveen
# HG changeset patch # User Praveen Tiwari # Date 1410442704 -19800 # Node ID df74723eb9a3861f6bba7f33d09a37efe53932a4 # Parent 9241634204a12babf8a2a90dc4f776646a9ddfb3 copy_cnt_16: avx2 asm code as per new interface, improved 514.32 cycles - 313.66 cycles diff -r 9241634204a1 -r df74723eb9a3

[x265] [PATCH 3 of 4] store analysis information in buffers

2014-09-11 Thread sagar
# HG changeset patch # User Sagar Kotecha sa...@multicorewareinc.com # Date 1410443605 -19800 # Thu Sep 11 19:23:25 2014 +0530 # Node ID 089f8764b2ebe7f6eb5f29133cdc62e799e74587 # Parent 707d9fe762d987803aef15760cb4a6ff01965c32 store analysis information in buffers diff -r 707d9fe762d9 -r

[x265] [PATCH 1 of 4] add analysis data structures and param options

2014-09-11 Thread sagar
# HG changeset patch # User Sagar Kotecha sa...@multicorewareinc.com # Date 1410443320 -19800 # Thu Sep 11 19:18:40 2014 +0530 # Node ID 5524e76b14e4b532925cd094da039fe8fa0d2edc # Parent 012f315d3eda8044f5a49865e15ba2943fbab094 add analysis data structures and param options diff -r

[x265] [PATCH 2 of 4] api: introduce x265_alloc_analysis_data and x265_free_analysis_data methods to allocate and free analysis buffers

2014-09-11 Thread sagar
# HG changeset patch # User Sagar Kotecha sa...@multicorewareinc.com # Date 1410443497 -19800 # Thu Sep 11 19:21:37 2014 +0530 # Node ID 707d9fe762d987803aef15760cb4a6ff01965c32 # Parent 5524e76b14e4b532925cd094da039fe8fa0d2edc api: introduce x265_alloc_analysis_data and

[x265] [PATCH] copy_cnt_32: avx2 asm code as per new interface, improved 1521.17 cycles - 934.46 cycles

2014-09-11 Thread praveen
# HG changeset patch # User Praveen Tiwari # Date 141007 -19800 # Node ID 05162453203f955413aec5153a85ccdda1a3f519 # Parent df74723eb9a3861f6bba7f33d09a37efe53932a4 copy_cnt_32: avx2 asm code as per new interface, improved 1521.17 cycles - 934.46 cycles diff -r df74723eb9a3 -r 05162453203f

Re: [x265] [PATCH 1 of 4] add analysis data structures and param options

2014-09-11 Thread Steve Borho
On 09/11, sa...@multicorewareinc.com wrote: # HG changeset patch # User Sagar Kotecha sa...@multicorewareinc.com # Date 1410443320 -19800 # Thu Sep 11 19:18:40 2014 +0530 # Node ID 5524e76b14e4b532925cd094da039fe8fa0d2edc # Parent 012f315d3eda8044f5a49865e15ba2943fbab094 add analysis

Re: [x265] [PATCH 4 of 4] cli: add cli options analysis-mode and analysis-file

2014-09-11 Thread Steve Borho
On 09/11, sa...@multicorewareinc.com wrote: # HG changeset patch # User Sagar Kotecha sa...@multicorewareinc.com # Date 1410443668 -19800 # Thu Sep 11 19:24:28 2014 +0530 # Node ID 2429bf7aa08dbcded7979833b7c01614deb076fd # Parent 089f8764b2ebe7f6eb5f29133cdc62e799e74587 cli: add cli

[x265] [PATCH] search: measure RDO of intra modes within 25% of least cost [CHANGES OUTPUTS]

2014-09-11 Thread Steve Borho
# HG changeset patch # User Steve Borho st...@borho.org # Date 1410345355 -7200 # Wed Sep 10 12:35:55 2014 +0200 # Node ID b35ff208d7f8f85ca4bb0c65c4a562c0e9c24100 # Parent 9637c6ae8294d8f74981d01a1c1e216196376e69 search: measure RDO of intra modes within 25% of least cost [CHANGES OUTPUTS]

Re: [x265] [PATCH] copy_cnt_16: avx2 asm code as per new interface, improved 514.32 cycles - 313.66 cycles

2014-09-11 Thread chen
At 2014-09-11 21:38:32,prav...@multicorewareinc.com wrote: # HG changeset patch # User Praveen Tiwari # Date 1410442704 -19800 # Node ID df74723eb9a3861f6bba7f33d09a37efe53932a4 # Parent 9241634204a12babf8a2a90dc4f776646a9ddfb3 copy_cnt_16: avx2 asm code as per new interface, improved 514.32

[x265] [PATCH] Resolved gcc compiler error of mismatched type

2014-09-11 Thread dtyx265
# HG changeset patch # User David T Yuen dtyx...@gmail.com # Date 1410481540 25200 # Node ID 37b199156dfc27874205d6d7fadb71a00e3257d9 # Parent 7e29b10982d2eb7fd79f581d6f04184522ba Resolved gcc compiler error of mismatched type diff -r 7e29b10982d2 -r 37b199156dfc source/x265.cpp ---