[x265] [PATCH 2 of 2] sao: merge saoCuOrgE3 asm with encoder along with sign asm code integration

2015-01-07 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1420621053 -19800 # Wed Jan 07 14:27:33 2015 +0530 # Node ID 09499a02ddb41b438e4e6de09ddb92b926c6c8e0 # Parent 9ec89f245be8ca4468362cb095172dbc92bd5140 sao: merge saoCuOrgE3 asm with encoder along with sign asm code integration diff -r

[x265] [PATCH 1 of 2] asm: saoCuOrgE3 asm code

2015-01-07 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1420620491 -19800 # Wed Jan 07 14:18:11 2015 +0530 # Node ID 9ec89f245be8ca4468362cb095172dbc92bd5140 # Parent 6cc757f662ed982a2f64122eba8e557d8ef0abba asm: saoCuOrgE3 asm code diff -r 6cc757f662ed -r 9ec89f245be8 source/common/loopfilter.cpp

[x265] [PATCH 0 of 2 ] asm: saoCuOrgE3 asm code

2015-01-07 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH] asm: saoCuOrgE1 asm code

2015-01-07 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1420618463 -19800 # Wed Jan 07 13:44:23 2015 +0530 # Node ID 6cc757f662ed982a2f64122eba8e557d8ef0abba # Parent 357ec738fb0ccaa678ab548629666b118f9f938f asm: saoCuOrgE1 asm code diff -r 357ec738fb0c -r 6cc757f662ed source/common/loopfilter.cpp

[x265] [PATCH] loopfilter: use x265_clip for common clipping operations

2015-01-07 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1420633919 -19800 # Wed Jan 07 18:01:59 2015 +0530 # Node ID ffbe1a8ec09f25208ae6e0506b04589f2299bc73 # Parent ff32d97fe59ce9d8dc04d785c605f44d18dcdcee loopfilter: use x265_clip for common clipping operations diff -r ff32d97fe59c -r

Re: [x265] ASM crash in r6682 on E5200: Probably SSE4 vs. SSSE3 instructions

2014-04-11 Thread Nabajit Deka
Hi, It will be really helpful if you can provide the call stack log. On Fri, Apr 11, 2014 at 11:39 AM, Mario *LigH* Rohkrämer cont...@ligh.dewrote: See also: http://forum.doom9.org/showthread.php?p=1677018#post1677018 ff. Probably some needle in the hay task for Min Chen, the ASM expert?

Re: [x265] ASM crash in r6682 on E5200: Probably SSE4 vs. SSSE3 instructions

2014-04-11 Thread Nabajit Deka
We dont have a system (having support upto SSSE3) to reproduce the issue at our end . So the call stack will be helpful. On Fri, Apr 11, 2014 at 1:49 PM, Nabajit Deka naba...@multicorewareinc.comwrote: Hi, It will be really helpful if you can provide the call stack log. On Fri, Apr 11

[x265] [PATCH] asm: fix build error caused by usage of 64-bit dependent register in Win32 versions

2014-04-01 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1396347344 -19800 # Tue Apr 01 15:45:44 2014 +0530 # Node ID dd189fd26f47dbff79e3f92a5afe25e7c4b6 # Parent 7ce180ca05b373e042d672f103f721d11cf4af7a asm: fix build error caused by usage of 64-bit dependent register in Win32 versions diff

[x265] [PATCH] test bench : Modify chroma_p2s test function to handle csp

2014-03-03 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1393847862 -19800 # Mon Mar 03 17:27:42 2014 +0530 # Node ID 5e6e06b8ec118904ad28a2d703dc9ad7956b4d44 # Parent 6662df480e39c83ab138d831f883d11fc5b052c5 test bench : Modify chroma_p2s test function to handle csp. diff -r 6662df480e39 -r

[x265] [PATCH] test bench : fix for test bench failure, caused by redundant malloc

2014-03-02 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1393826302 -19800 # Mon Mar 03 11:28:22 2014 +0530 # Node ID 6662df480e39c83ab138d831f883d11fc5b052c5 # Parent 288a83d7e28999798859eba6b2f38c952cac7547 test bench : fix for test bench failure, caused by redundant malloc. diff -r 288a83d7e289

[x265] [PATCH] asm: 10bpp code for vertical luma interpolation filters

2014-02-27 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1393501657 -19800 # Thu Feb 27 17:17:37 2014 +0530 # Branch stable # Node ID 452b23cd9f4d7f5785dbebeb02ecd97ad7192a9d # Parent 0a6dd816d2e2b5135e4c6479b5b734c318daf1aa asm: 10bpp code for vertical luma interpolation filters. diff -r

[x265] [PATCH 1 of 2] asm : Add new file for 10bpp asm filter functions

2014-02-25 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1393328083 -19800 # Tue Feb 25 17:04:43 2014 +0530 # Node ID c9236d867a07b18d0e28bd39528a02bf03cf4eda # Parent a36a669d09e89332dd91817afdf139853ba3ad03 asm : Add new file for 10bpp asm filter functions. diff -r a36a669d09e8 -r c9236d867a07

[x265] [PATCH 2 of 2] Enable 10 bpp asm filter functions

2014-02-25 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1393328202 -19800 # Tue Feb 25 17:06:42 2014 +0530 # Node ID 41a3689f2a07fa86568e07aab75dd31dd59da4a8 # Parent c9236d867a07b18d0e28bd39528a02bf03cf4eda Enable 10 bpp asm filter functions diff -r c9236d867a07 -r 41a3689f2a07 source/common/x86

[x265] [PATCH] asm : asm routine for chroma_p2s for 4:4:4 color space format

2014-02-17 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1392641037 -19800 # Mon Feb 17 18:13:57 2014 +0530 # Node ID f5275ca8f2985bb0daf563738e6071b81967c2cd # Parent ce96cdb390fe26aee6effa731e51303c1d9056b0 asm : asm routine for chroma_p2s for 4:4:4 color space format diff -r ce96cdb390fe -r

[x265] [PATCH] testbench : test bench correction for chroma_p2s

2014-02-17 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1392641200 -19800 # Mon Feb 17 18:16:40 2014 +0530 # Node ID 33640c5d8abd33a8d165bb5f32dfab9d478b4c1b # Parent f5275ca8f2985bb0daf563738e6071b81967c2cd testbench : test bench correction for chroma_p2s diff -r f5275ca8f298 -r 33640c5d8abd

[x265] [PATCH] asm : Clean up and minor modifications in pixel_add_ps 16bpp asm functions(4xN)

2014-02-13 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1392291521 -19800 # Thu Feb 13 17:08:41 2014 +0530 # Node ID cce7e1f3d433113dbd3046df3d9ac7a8bb2333f5 # Parent 21832083908f96fa7c7f51f13457837fb0e8c2f9 asm : Clean up and minor modifications in pixel_add_ps 16bpp asm functions(4xN) diff -r

Re: [x265] [PATCH] asm : Clean up and minor modifications in pixel_add_ps 16bpp asm functions(4xN)

2014-02-13 Thread Nabajit Deka
Ok..I will modify the patch and resend.Thanks On Thu, Feb 13, 2014 at 6:26 PM, chen chenm...@163.com wrote: +paddwm2, m3 +CLIPWm2, m0, m1 + +movlps [r0], m2 +movhps [r0 + r1], m2 movh is faster than movlps on Intel CPU

[x265] [PATCH] asm : Clean up and minor modifications in pixel_add_ps 16bpp asm functions(4xN)

2014-02-13 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1392361357 -19800 # Fri Feb 14 12:32:37 2014 +0530 # Node ID 77c2c6bfafe98aef82658a25e21c88652f7e2e54 # Parent 0d033b5677da7c0b00582082c8b00feba3abb9fa asm : Clean up and minor modifications in pixel_add_ps 16bpp asm functions(4xN) diff -r

[x265] [PATCH] asm : Optimisations in blockcopy_sp asm routines(2x4, 2x8, 6x8)

2014-02-11 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1392112254 -19800 # Tue Feb 11 15:20:54 2014 +0530 # Node ID e6e9310bc545a84fd30533fc7739912c55179d17 # Parent 07b5d6b82f5fbcb78ecab12cb8abcf13c78fe552 asm : Optimisations in blockcopy_sp asm routines(2x4, 2x8, 6x8) diff -r 07b5d6b82f5f -r

Re: [x265] [PATCH] asm : Optimisations in blockcopy_sp asm routines(2x4, 2x8, 6x8)

2014-02-11 Thread Nabajit Deka
Yes, it gives small improvements On Tue, Feb 11, 2014 at 4:06 PM, chen chenm...@163.com wrote: similar code, only remove reduce MOV instruction. At 2014-02-11 17:51:09,naba...@multicorewareinc.com wrote: # HG changeset patch # User Nabajit Deka # Date 1392112254 -19800 # Tue Feb 11

[x265] [PATCH] testbench : fix for 16bpp test bench crash

2014-02-04 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1391517315 -19800 # Tue Feb 04 18:05:15 2014 +0530 # Node ID 5a288f3c74c10ef481d62c83ae9ef509439cfbf1 # Parent ff430d39d4280f01dbcd57cfbe3f6f45f4fbe6a1 testbench : fix for 16bpp test bench crash diff -r ff430d39d428 -r 5a288f3c74c1 source/test

[x265] [PATCH] testbench: Added stress test cases for chroma_pp, chroma_ps and chroma_hps filter functions

2014-02-03 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1391425643 -19800 # Mon Feb 03 16:37:23 2014 +0530 # Node ID 89b7060e631754de11577dbd1cab735d0df6df7e # Parent ae56333a326830d07ee2f7a25c2a5939154888bf testbench: Added stress test cases for chroma_pp, chroma_ps and chroma_hps filter functions

[x265] [PATCH] testbench : Fix for random test bench failure caused by pixeladd_ss

2014-01-30 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1391076769 -19800 # Thu Jan 30 15:42:49 2014 +0530 # Branch stable # Node ID 6f4a9d68e0b5bf1e17f8869287b7f8670e2a1095 # Parent 86743912a5b0459645e5aeccd1c35313e3f0af58 testbench : Fix for random test bench failure caused by pixeladd_ss diff -r

[x265] [PATCH] test bench : Added stress test case for luma_pp filter function

2014-01-30 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1391089526 -19800 # Thu Jan 30 19:15:26 2014 +0530 # Node ID d6fd8178649e5c4add3572931948053a975eff42 # Parent e879873ce926a4b58c111b8e9cdd5fb2692bcb54 test bench : Added stress test case for luma_pp filter function diff -r e879873ce926 -r

Re: [x265] [PATCH] asm : saturation bug fix for luma_vss asm routine

2014-01-30 Thread Nabajit Deka
Yes, you can skip these .I need to check these patches once more. On Thu, Jan 30, 2014 at 7:59 PM, Deepthi Nandakumar deep...@multicorewareinc.com wrote: This patch is pending, right Nabajit? I havent pushed the luma_vss /chroma_vss assembly patches or the testbench edits to luma_vss

[x265] [PATCH] asm : saturation bug fix for luma_vss asm routine

2014-01-28 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1390977132 -19800 # Wed Jan 29 12:02:12 2014 +0530 # Node ID a03f9fbd6af6d793af9054c85ee7d281fe447af8 # Parent 8552e8cc1a3c60ddcab85e7421229c9a86d4785f asm : saturation bug fix for luma_vss asm routine. diff -r 8552e8cc1a3c -r a03f9fbd6af6

[x265] [PATCH] asm : saturation bug fix for chroma_vss asm routine

2014-01-28 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1390980467 -19800 # Wed Jan 29 12:57:47 2014 +0530 # Node ID ba8c31037a655ae55e53cee753677f78d56df397 # Parent a03f9fbd6af6d793af9054c85ee7d281fe447af8 asm : saturation bug fix for chroma_vss asm routine. diff -r a03f9fbd6af6 -r ba8c31037a65

[x265] [PATCH] Corrected test bench function for luma_ss

2014-01-28 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1390980992 -19800 # Wed Jan 29 13:06:32 2014 +0530 # Node ID 4c1296020b6d0dcb83fe24eec2f82e155eb95e7c # Parent ba8c31037a655ae55e53cee753677f78d56df397 Corrected test bench function for luma_ss. diff -r ba8c31037a65 -r 4c1296020b6d source/test

[x265] [PATCH] Stress test case for luma_pp

2014-01-27 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1390824412 -19800 # Mon Jan 27 17:36:52 2014 +0530 # Node ID 00c0d3e09e3e2180df8675f0a715b2bb2830b7ef # Parent b59b1e579f78b4c29c0c1491e6198a63ba1d597f Stress test case for luma_pp. diff -r b59b1e579f78 -r 00c0d3e09e3e source/test

[x265] [PATCH] asm : Hook up chroma_hps with encoder

2013-12-11 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1386760292 -19800 # Wed Dec 11 16:41:32 2013 +0530 # Node ID 24c31d265d829d4c7a99049229e9bd87ca745500 # Parent 470737ecdb2e6993d651b9cfe7080341390f5a05 asm : Hook up chroma_hps with encoder. diff -r 470737ecdb2e -r 24c31d265d82 source/Lib

[x265] [PATCH] Add comment for luma_hps and chroma_hps test bench code

2013-12-10 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1386682285 -19800 # Tue Dec 10 19:01:25 2013 +0530 # Node ID 34694c6343dd8c065eb7222105fbf813f04b07a8 # Parent e4c13676c4b5a4702a1b70ca91af242a74f4c1a5 Add comment for luma_hps and chroma_hps test bench code. diff -r e4c13676c4b5 -r

[x265] [PATCH] Bug fix in luma_hps C primitive

2013-12-09 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1386657936 -19800 # Tue Dec 10 12:15:36 2013 +0530 # Node ID eba68a2eedb6a13ceea97fde83c39e13e29e7989 # Parent 285a4d8c42a07d4c3a285c657da609801391c4a2 Bug fix in luma_hps C primitive. diff -r 285a4d8c42a0 -r eba68a2eedb6 source/common

[x265] [PATCH 3 of 4] Test bench code for luma_hps and chroma_hps

2013-12-04 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1386160833 -19800 # Wed Dec 04 18:10:33 2013 +0530 # Node ID b0602bd77013c5c590d788259b2fdd9546374d4d # Parent 8b045312625b2ffe16dd555b6958647f105b906d Test bench code for luma_hps and chroma_hps diff -r 8b045312625b -r b0602bd77013 source

[x265] [PATCH 4 of 4] Function declarations for modified luma_hps and chroma_hps functions

2013-12-04 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1386161019 -19800 # Wed Dec 04 18:13:39 2013 +0530 # Node ID 51151520faa3cbb79af8c9534eeb289d60ac1b95 # Parent b0602bd77013c5c590d788259b2fdd9546374d4d Function declarations for modified luma_hps and chroma_hps functions. diff -r b0602bd77013

[x265] [PATCH 2 of 4] C primitive changes for luma_hps and chroma_hps

2013-12-04 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1386160675 -19800 # Wed Dec 04 18:07:55 2013 +0530 # Node ID 8b045312625b2ffe16dd555b6958647f105b906d # Parent 9440e424c637a46e15a96c03739d645e1dbf8b56 C primitive changes for luma_hps and chroma_hps. diff -r 9440e424c637 -r 8b045312625b

[x265] [PATCH 0 of 4 ] asm : Modifications for luma_hps and chroma_hps for doing extra rows.

2013-12-04 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 1 of 4] asm : Modifications for luma_hps and chroma_hps(extra rows)

2013-12-04 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1386159854 -19800 # Wed Dec 04 17:54:14 2013 +0530 # Node ID 9440e424c637a46e15a96c03739d645e1dbf8b56 # Parent 9b062eb8124e9fb12bc16e32eab524ba080cf258 asm : Modifications for luma_hps and chroma_hps(extra rows) diff -r 9b062eb8124e -r

[x265] [PATCH 1 of 2] asm : Adding asm routine for idst4

2013-11-29 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385728044 -19800 # Fri Nov 29 17:57:24 2013 +0530 # Node ID 189377dcf4a43a98f3a217d4db9866799068cb8d # Parent 833d78aaf71edddf774605fefb8912aea3aeced6 asm : Adding asm routine for idst4 diff -r 833d78aaf71e -r 189377dcf4a4 source/common/x86

[x265] [PATCH 0 of 2 ] asm: Adding asm routine for idst4

2013-11-29 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 0 of 2 ] asm : Adding asm routine for dst4

2013-11-28 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 1 of 2] asm : Adding asm routine for dst4

2013-11-28 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385641262 -19800 # Thu Nov 28 17:51:02 2013 +0530 # Node ID cb54626347bc69690c2a6ee2983e57b76314e3e2 # Parent 2ba6c26c9febdc8c57d3014c0cf98d4897d3992d asm : Adding asm routine for dst4. diff -r 2ba6c26c9feb -r cb54626347bc source/common/x86

[x265] [PATCH 2 of 2] Enable dst4 asm

2013-11-28 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385641369 -19800 # Thu Nov 28 17:52:49 2013 +0530 # Node ID cb674b5caed0fc4c07f16d74ea7d7cb0af946524 # Parent cb54626347bc69690c2a6ee2983e57b76314e3e2 Enable dst4 asm diff -r cb54626347bc -r cb674b5caed0 source/common/vec/dct-ssse3.cpp

[x265] [PATCH 0 of 2 ] asm : Adding asm routine for idct4

2013-11-27 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 2 of 2] asm: Adding asm routine for idct4

2013-11-27 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385557692 -19800 # Wed Nov 27 18:38:12 2013 +0530 # Branch stable # Node ID e4206a37c20f531312013d2a5879f6dbb58c05c5 # Parent 648c669afd7476f30e4f432d839b36fbb5390332 asm: Adding asm routine for idct4 diff -r 648c669afd74 -r e4206a37c20f

[x265] [PATCH] Enable the idct4 asm routine

2013-11-27 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385558054 -19800 # Wed Nov 27 18:44:14 2013 +0530 # Branch stable # Node ID e0400709b4b18ea159179882b9578adbd415fb6c # Parent e4206a37c20f531312013d2a5879f6dbb58c05c5 Enable the idct4 asm routine. diff -r e4206a37c20f -r e0400709b4b1 source

[x265] [PATCH] asm: Correct number of xmm registers for weight_sp routine

2013-11-26 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385467682 -19800 # Tue Nov 26 17:38:02 2013 +0530 # Node ID 40d314225757b9a6009c98f456bd64d15c169b8c # Parent 491fd3ee6fd11a52f50ba22b39b9e9596b8e7238 asm: Correct number of xmm registers for weight_sp routine. diff -r 491fd3ee6fd1 -r

[x265] [PATCH 2 of 3] Adding constant table used for dct4

2013-11-26 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385474743 -19800 # Tue Nov 26 19:35:43 2013 +0530 # Node ID f9ebd23af598f6900a878aad5c9b01e7ea654bc9 # Parent cdae16d2ebf3da0df9f7ec6af758bc34f6b2de12 Adding constant table used for dct4 diff -r cdae16d2ebf3 -r f9ebd23af598 source/common/x86

[x265] [PATCH 3 of 3] Adding dct8.asm and dct8.h to CMakeLists

2013-11-26 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385474836 -19800 # Tue Nov 26 19:37:16 2013 +0530 # Node ID 713e6b21099e37136a1778b8c24e251951f46fd2 # Parent f9ebd23af598f6900a878aad5c9b01e7ea654bc9 Adding dct8.asm and dct8.h to CMakeLists diff -r f9ebd23af598 -r 713e6b21099e source/common

[x265] [PATCH 1 of 3] asm: Adding asm and header files for dct asm primitives

2013-11-26 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385474627 -19800 # Tue Nov 26 19:33:47 2013 +0530 # Node ID cdae16d2ebf3da0df9f7ec6af758bc34f6b2de12 # Parent 40d314225757b9a6009c98f456bd64d15c169b8c asm: Adding asm and header files for dct asm primitives. diff -r 40d314225757 -r

[x265] [PATCH 0 of 3 ] Adding asm routine , function declaration and function pointer initialization for weight_pp() function.

2013-11-25 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 2 of 3] Test bench modifications for weight_pp() asm routine

2013-11-25 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385374525 -19800 # Mon Nov 25 15:45:25 2013 +0530 # Node ID f7422dfb7eef017344b4d974dac641cb00f7f5b7 # Parent 365f90b3b78cd3c91d6f0985b0d467da4a91d95a Test bench modifications for weight_pp() asm routine. diff -r 365f90b3b78c -r f7422dfb7eef

[x265] [PATCH 0 of 3 ] Adding asm routine, function declaration and function pointer initialization for weight_sp() function.

2013-11-25 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 1 of 3] asm : routine for weight_sp()

2013-11-25 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385375693 -19800 # Mon Nov 25 16:04:53 2013 +0530 # Node ID 4a5ad44661863551a57ab5a2d38f9e91e4297b7c # Parent 92969306ae85ed2c506d53d709e02f3d98b895f7 asm : routine for weight_sp(). diff -r 92969306ae85 -r 4a5ad4466186 source/common/x86/pixel

[x265] [PATCH] Test bench modifications for weight_sp() asm routine

2013-11-25 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385378388 -19800 # Mon Nov 25 16:49:48 2013 +0530 # Node ID d2d31d26493438d3b4ee22802bdab085460359a4 # Parent 4a5ad44661863551a57ab5a2d38f9e91e4297b7c Test bench modifications for weight_sp() asm routine diff -r 4a5ad4466186 -r d2d31d264934

[x265] [PATCH] asm : routine for weightUnidirPixel(), for input width in multiples of 16

2013-11-22 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385116295 -19800 # Fri Nov 22 16:01:35 2013 +0530 # Node ID 18fd67c0d27291012b65bf9c48c675d09b5db1f3 # Parent 5009254d3d3ac92e90b1551444c5eb32ba2f8d31 asm : routine for weightUnidirPixel(), for input width in multiples of 16. diff -r

Re: [x265] [PATCH] asm : routine for weightUnidirPixel(), for input width in multiples of 16

2013-11-22 Thread Nabajit Deka
Please ignore this patch. On Fri, Nov 22, 2013 at 4:02 PM, naba...@multicorewareinc.com wrote: # HG changeset patch # User Nabajit Deka # Date 1385116295 -19800 # Fri Nov 22 16:01:35 2013 +0530 # Node ID 18fd67c0d27291012b65bf9c48c675d09b5db1f3 # Parent

[x265] [PATCH] asm : routine for weightUnidirPixel(), for input width in multiples of 16

2013-11-22 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1385116295 -19800 # Fri Nov 22 16:01:35 2013 +0530 # Node ID 31f6c6e8e965f06825a8b72e5dc42bfb5ce981ff # Parent 5009254d3d3ac92e90b1551444c5eb32ba2f8d31 asm : routine for weightUnidirPixel(), for input width in multiples of 16. diff -r

Re: [x265] [PATCH] asm : routine for weightUnidirPixel(), for input width in multiples of 16

2013-11-22 Thread Nabajit Deka
forget to include the asm-primitive.cpp change, or this this meant to be for review only? On Nov 22, 2013, at 4:49 AM, naba...@multicorewareinc.com wrote: # HG changeset patch # User Nabajit Deka # Date 1385116295 -19800 # Fri Nov 22 16:01:35 2013 +0530 # Node ID

[x265] [PATCH] Adding asm function declarations and initializations for chroma hps filter functions

2013-11-17 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384757804 -19800 # Mon Nov 18 12:26:44 2013 +0530 # Node ID ff7f550d87405fd5a3e2917249c8530a8ea8c624 # Parent e2895ce7bbeb2c3d845fee2578758d0012fa2cb4 Adding asm function declarations and initializations for chroma hps filter functions. diff

[x265] [PATCH 0 of 3 ] Adding C primitive and test bench code for chroma vss filter functions.

2013-11-15 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 1 of 3] Adding function pointer type array declaration for chroma vss filter functions

2013-11-15 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384514598 -19800 # Fri Nov 15 16:53:18 2013 +0530 # Node ID 98bb7d2f07c89a241d8ba9c1d4f145e1feb62307 # Parent 19b8ab58a80d9a2a1a6ba6103be23c21a424520a Adding function pointer type array declaration for chroma vss filter functions diff -r

[x265] [PATCH 2 of 3] Adding C primitive for chroma vss filter functions

2013-11-15 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384514673 -19800 # Fri Nov 15 16:54:33 2013 +0530 # Node ID 583a306d9d644682c966b7e25a1ddd30c1aa2455 # Parent 98bb7d2f07c89a241d8ba9c1d4f145e1feb62307 Adding C primitive for chroma vss filter functions diff -r 98bb7d2f07c8 -r 583a306d9d64

[x265] [PATCH 3 of 3] Adding test bench code for chroma vss filter functions

2013-11-15 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384515039 -19800 # Fri Nov 15 17:00:39 2013 +0530 # Node ID 7d42727cd87856e593f294ececcac218110d388a # Parent 583a306d9d644682c966b7e25a1ddd30c1aa2455 Adding test bench code for chroma vss filter functions diff -r 583a306d9d64 -r 7d42727cd878

[x265] [PATCH 1 of 2] asm: routines for chroma vss filter functions for all block sizes

2013-11-15 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384515384 -19800 # Fri Nov 15 17:06:24 2013 +0530 # Node ID 9842bb0aab4c3b3a5b241d77cf6436d8bd7e717f # Parent 7d42727cd87856e593f294ececcac218110d388a asm: routines for chroma vss filter functions for all block sizes diff -r 7d42727cd878 -r

[x265] [PATCH 2 of 2] Adding asm function declarations and initializations for chroma vss filter functions

2013-11-15 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384515584 -19800 # Fri Nov 15 17:09:44 2013 +0530 # Node ID 07329593c7024c8fc20127ebe1144dab26b1008c # Parent 9842bb0aab4c3b3a5b241d77cf6436d8bd7e717f Adding asm function declarations and initializations for chroma vss filter functions diff

[x265] [PATCH 0 of 2 ] Adding asm routines , function declarations and function pointer initializations for chroma vss filter functions

2013-11-15 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 2 of 2] Adding test bench code for luma vss filter functions

2013-11-15 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384517700 -19800 # Fri Nov 15 17:45:00 2013 +0530 # Node ID b918110fd337178a1cf3616989c65a1e0ed14776 # Parent df47be7d93bbec09aa70446cf369fde2f5f1933d Adding test bench code for luma vss filter functions. diff -r df47be7d93bb -r b918110fd337

[x265] [PATCH 1 of 2] asm: routines for luma vss filter functions for all block sizes

2013-11-15 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384517817 -19800 # Fri Nov 15 17:46:57 2013 +0530 # Node ID 351229c80f52d580d24853f64f79e42d47617f87 # Parent b918110fd337178a1cf3616989c65a1e0ed14776 asm: routines for luma vss filter functions for all block sizes. diff -r b918110fd337 -r

[x265] [PATCH 2 of 2] Adding asm function declarations and initializations for luma vss filter functions

2013-11-15 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384517887 -19800 # Fri Nov 15 17:48:07 2013 +0530 # Node ID b72e5604ca2e496d8c1e02bbff2b92c25718dc26 # Parent 351229c80f52d580d24853f64f79e42d47617f87 Adding asm function declarations and initializations for luma vss filter functions. diff

[x265] [PATCH] asm: routines for chroma vps filter functions for 2x4 and 2x8 block sizes

2013-11-14 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384426801 -19800 # Thu Nov 14 16:30:01 2013 +0530 # Node ID 373fa609f3309420e5d5a9b3227d41757d315ac5 # Parent 5683ee5b793cca5956f1e44e4e0bb3d6be70e942 asm: routines for chroma vps filter functions for 2x4 and 2x8 block sizes diff -r

[x265] [PATCH 3 of 3] asm: routines for chroma hps filter functions for 16xN, 24xN and 32xN

2013-11-13 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384333222 -19800 # Wed Nov 13 14:30:22 2013 +0530 # Node ID 491a491669a2058bf42445ace4df5cd8da1b204d # Parent 2561f9984fbfd7d05269d4b48b9bd350216f435c asm: routines for chroma hps filter functions for 16xN, 24xN and 32xN diff -r 2561f9984fbf

[x265] [PATCH] asm: routines for chroma vps filter functions for 6x8 and 12x16 block sizes

2013-11-13 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384339787 -19800 # Wed Nov 13 16:19:47 2013 +0530 # Node ID 31192cf36593bce97071d9f252ca9a2c14ca406d # Parent b5be1a9259e686aa8d0bc9351cb35477c0ab5b0e asm: routines for chroma vps filter functions for 6x8 and 12x16 block sizes. diff -r

[x265] [PATCH] Adding asm function declarations and initializations for chroma vps filter functions

2013-11-13 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384342092 -19800 # Wed Nov 13 16:58:12 2013 +0530 # Node ID 04150fe5cc7ab01728a3b4da8fd8ff95f1bf995f # Parent 919e6bec663a2d46e62b497dcfc2aaedcee49ed8 Adding asm function declarations and initializations for chroma vps filter functions. diff

[x265] [PATCH 1 of 3] asm: routines for chroma vsp filter functions for all block sizes

2013-11-12 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384253490 -19800 # Tue Nov 12 16:21:30 2013 +0530 # Node ID da706d553c882eff32b53969a425e69a17976c2e # Parent fd23a50d6336fc3fef6466c9a8f1baa0e3a2228b asm: routines for chroma vsp filter functions for all block sizes. diff -r fd23a50d6336 -r

Re: [x265] [PATCH 1 of 3] asm: routines for chroma vsp filter functions for all block sizes

2013-11-12 Thread Nabajit Deka
Thanks. I will modify this part in the next commit. On Tue, Nov 12, 2013 at 5:05 PM, chen chenm...@163.com wrote: +;--- +; void interp_4tap_vertical_sp_%1x%2(int16_t *src,

[x265] [PATCH 2 of 2] Adding test bench code for chroma hps filter functions

2013-11-12 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384257859 -19800 # Tue Nov 12 17:34:19 2013 +0530 # Node ID 968f6df6d50f70d2a4cf569a8c0426f65d927b00 # Parent 6c6026676f0f6d6af270ede92bdc66aa0749f0c9 Adding test bench code for chroma hps filter functions. diff -r 6c6026676f0f -r

[x265] [PATCH 0 of 2 ] Adding function pointer initializations and test bench code for chroma hps filter functions.

2013-11-12 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 0 of 3 ] asm: routines for chroma hps filter functions for all block sizes.

2013-11-12 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 2 of 3] asm: routines for chroma hps filter functions for 8xN block size

2013-11-12 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384268587 -19800 # Tue Nov 12 20:33:07 2013 +0530 # Node ID 676529ca9c45a14e54a1ad0976252b03fdcd9cd2 # Parent c9851effbce88c9a70f712fbfaf7e83616c5615f asm: routines for chroma hps filter functions for 8xN block size. diff -r c9851effbce8 -r

[x265] [PATCH 3 of 3] asm: routines for chroma hps filter functions for 16xN, 24xN and 32xN

2013-11-12 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384268787 -19800 # Tue Nov 12 20:36:27 2013 +0530 # Node ID 3587c6ea492c0e10403fb07b02c28b4e7fb571d9 # Parent 676529ca9c45a14e54a1ad0976252b03fdcd9cd2 asm: routines for chroma hps filter functions for 16xN, 24xN and 32xN. diff -r 676529ca9c45

[x265] [PATCH 1 of 3] asm: routines for chroma hps filter functions for 2xN, 4xN, 6x8 and 12x16 block sizes

2013-11-12 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384268074 -19800 # Tue Nov 12 20:24:34 2013 +0530 # Node ID c9851effbce88c9a70f712fbfaf7e83616c5615f # Parent 968f6df6d50f70d2a4cf569a8c0426f65d927b00 asm: routines for chroma hps filter functions for 2xN, 4xN, 6x8 and 12x16 block sizes

[x265] [PATCH] Adding asm function declarations and initializations for chroma hps filter functions

2013-11-12 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384269016 -19800 # Tue Nov 12 20:40:16 2013 +0530 # Node ID 286039cf5329a00db7a7788eaadeeaac13d53e48 # Parent 3587c6ea492c0e10403fb07b02c28b4e7fb571d9 Adding asm function declarations and initializations for chroma hps filter functions diff

[x265] [PATCH 1 of 2] Adding function pointer array and C primitive initializations for chroma vps filter functions

2013-11-12 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384269680 -19800 # Tue Nov 12 20:51:20 2013 +0530 # Node ID 8e582fd2d1ff49abdac24debc4c9ffb02c90a7b2 # Parent 286039cf5329a00db7a7788eaadeeaac13d53e48 Adding function pointer array and C primitive initializations for chroma vps filter

[x265] [PATCH 2 of 2] Adding test bench code for chroma vps filter functions

2013-11-12 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384269733 -19800 # Tue Nov 12 20:52:13 2013 +0530 # Node ID 77e94eb8f4dc7d13c195b26eb91737c7a2cc0a07 # Parent 8e582fd2d1ff49abdac24debc4c9ffb02c90a7b2 Adding test bench code for chroma vps filter functions. diff -r 8e582fd2d1ff -r

[x265] [PATCH 0 of 2 ] Adding C primitive initializations and test bench code for chroma vps filter functions.

2013-11-12 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

Re: [x265] [PATCH 3 of 3] asm: routines for chroma hps filter functions for 16xN, 24xN and 32xN

2013-11-12 Thread Nabajit Deka
Function prototype is wrong here, dst should be int16_t *dst. On Tue, Nov 12, 2013 at 8:36 PM, naba...@multicorewareinc.com wrote: # HG changeset patch # User Nabajit Deka # Date 1384268787 -19800 # Tue Nov 12 20:36:27 2013 +0530 # Node ID 3587c6ea492c0e10403fb07b02c28b4e7fb571d9

Re: [x265] [PATCH 2 of 3] asm: routines for chroma hps filter functions for 8xN block size

2013-11-12 Thread Nabajit Deka
Function prototype is wrong here, dst should be int16_t *dst. On Tue, Nov 12, 2013 at 8:36 PM, naba...@multicorewareinc.com wrote: # HG changeset patch # User Nabajit Deka # Date 1384268587 -19800 # Tue Nov 12 20:33:07 2013 +0530 # Node ID 676529ca9c45a14e54a1ad0976252b03fdcd9cd2

[x265] [PATCH] asm: Replaced SSE4 instructions with SSE2 and general purpose instructions for chroma vsp filter functions

2013-11-12 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384325281 -19800 # Wed Nov 13 12:18:01 2013 +0530 # Node ID 017763dc543d091170082eccf7b42a0c47c453ff # Parent c4ca80d19105ccf1ba2ec14dd65915f2820a660d asm: Replaced SSE4 instructions with SSE2 and general purpose instructions for chroma vsp

[x265] [PATCH 2 of 3] Adding asm function declarations for luma vsp filter functions

2013-11-11 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384163071 -19800 # Mon Nov 11 15:14:31 2013 +0530 # Node ID 60d3c9c739261fd3c32555bc9619a581a3ebae38 # Parent 4a1e725c5743d7c137a6885b9cf9710ffe496030 Adding asm function declarations for luma vsp filter functions. diff -r 4a1e725c5743 -r

[x265] [PATCH 3 of 3] Adding function pointer initializations for luma vsp functions

2013-11-11 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384163146 -19800 # Mon Nov 11 15:15:46 2013 +0530 # Node ID 417a4fc8bc99ebbaa018fe1d867a78b9c64d2fa3 # Parent 60d3c9c739261fd3c32555bc9619a581a3ebae38 Adding function pointer initializations for luma vsp functions. diff -r 60d3c9c73926 -r

[x265] [PATCH 1 of 3] asm: routines for luma vsp filter functions for all block sizes

2013-11-11 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384162289 -19800 # Mon Nov 11 15:01:29 2013 +0530 # Node ID 4a1e725c5743d7c137a6885b9cf9710ffe496030 # Parent 7a4e14ca97b4297005053d8aa8cb4da0177691f6 asm: routines for luma vsp filter functions for all block sizes. diff -r 7a4e14ca97b4 -r

[x265] [PATCH 2 of 3] Adding C primitive for luma vsp filter functions

2013-11-10 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384148701 -19800 # Mon Nov 11 11:15:01 2013 +0530 # Node ID 81a8ce1f14e46f4c615e1c3494ae823e09fd131b # Parent 75d491a2a211460d1a9a1e3edcc8b07d10100e7c Adding C primitive for luma vsp filter functions. diff -r 75d491a2a211 -r 81a8ce1f14e4

[x265] [PATCH 1 of 3] Adding function pointer type array definition for luma vsp filter functions

2013-11-10 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1384148432 -19800 # Mon Nov 11 11:10:32 2013 +0530 # Node ID 75d491a2a211460d1a9a1e3edcc8b07d10100e7c # Parent 9d74638c3640679d09264b793afdf3ffc58a9107 Adding function pointer type array definition for luma vsp filter functions. diff -r

[x265] [PATCH] Bug fix for luma vpp asm routines.Also incorporated review comment changes

2013-11-07 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1383838838 -19800 # Thu Nov 07 21:10:38 2013 +0530 # Node ID a56c53581344df95e54f9cda919419f1d1ad0850 # Parent 85002898f5b4308547af6ce464bbdff5f360fa13 Bug fix for luma vpp asm routines.Also incorporated review comment changes. diff -r

[x265] [PATCH 2 of 2] Adding asm function declaration and function pointer initializations for luma hps functions

2013-11-06 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1383732979 -19800 # Wed Nov 06 15:46:19 2013 +0530 # Node ID 3e30ffe85fca96089e5923c04b3628cc747941e8 # Parent 96a46cf4a3b723d58eb8efffbc82acf8055b43f9 Adding asm function declaration and function pointer initializations for luma hps functions

[x265] [PATCH 1 of 2] asm: routines for luma hps filter functions for all block sizes

2013-11-06 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1383732753 -19800 # Wed Nov 06 15:42:33 2013 +0530 # Node ID 96a46cf4a3b723d58eb8efffbc82acf8055b43f9 # Parent bab35592e71ceac541bba5fa34eac9d657dcd7cf asm: routines for luma hps filter functions for all block sizes. diff -r bab35592e71c -r

[x265] [PATCH 0 of 2 ] Adding C primitive and unit test code for luma vps filter functions.

2013-11-05 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 1 of 2] Adding function pointer array and C primitive for luma hps filter functions

2013-11-05 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1383720945 -19800 # Wed Nov 06 12:25:45 2013 +0530 # Node ID 4745dfc5490381a906c8a87d057a75626f260c41 # Parent 7cdcf1a03d93f8f007d9d29d29ab31513310 Adding function pointer array and C primitive for luma hps filter functions. diff -r

[x265] [PATCH 0 of 3 ] Adding C primitive and test bench code for luma vps functions.

2013-11-04 Thread nabajit
___ x265-devel mailing list x265-devel@videolan.org https://mailman.videolan.org/listinfo/x265-devel

[x265] [PATCH 1 of 3] Adding function pointer type array definition for luma vps filter functions

2013-11-04 Thread nabajit
# HG changeset patch # User Nabajit Deka # Date 1383635718 -19800 # Tue Nov 05 12:45:18 2013 +0530 # Node ID 88a476ed9ed2ccfa2964a15a9f5795b79b99a195 # Parent 686b5b50279715bcfd15af8603e52c59de7d1b40 Adding function pointer type array definition for luma vps filter functions. diff -r

  1   2   >