[FFmpeg-devel] [PATCH] Add prefetch for mips

2017-07-12 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/Makefile|1 + libavcodec/mips/videodsp_init.c | 51 +++ libavcodec/videodsp.c |2 ++

[FFmpeg-devel] [PATCH] Add prefetch for mips

2017-07-11 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/Makefile|1 + libavcodec/mips/videodsp_mips.c | 42 +++ libavcodec/videodsp.c |2 ++

[FFmpeg-devel] [PATCH] libavutil/mips: Updated msa generic macros

2017-07-21 Thread kaustubh.raste
From: Kaustubh Raste Reduced msa load-store code. Removed inline asm of GP load-store for 64 bit. Updated variable names in GP load-store macros for naming consistency. Corrected macro descriptions. Signed-off-by: Kaustubh Raste ---

[FFmpeg-devel] [PATCH] libavcodec/mips: Improve avc dequant-idct luma dc msa function

2017-07-28 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264idct_msa.c | 66 +++- 1 file changed, 32 insertions(+), 34 deletions(-) diff --git a/libavcodec/mips/h264idct_msa.c

[FFmpeg-devel] [PATCH] libavcodec/mips: Optimize avc idct 4x4 for msa

2017-07-24 Thread kaustubh.raste
From: Kaustubh Raste Removed memset call and improved performance. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264idct_msa.c | 104 +++ libavutil/mips/generic_macros_msa.h | 18 ++ 2 files

[FFmpeg-devel] [PATCH] libavcodec/mips: Improve avc idct8 msa function

2017-07-31 Thread kaustubh.raste
From: Kaustubh Raste Replace memset call with msa stores. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264idct_msa.c |4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/libavcodec/mips/h264idct_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc sao band filter msa functions

2017-09-15 Thread kaustubh.raste
From: Kaustubh Raste Preload data in band filter 0-8 for better pipeline parallelization. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_lpf_sao_msa.c | 174 ++- libavutil/mips/generic_macros_msa.h

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc lpf msa functions

2017-09-14 Thread kaustubh.raste
From: Kaustubh Raste Optimize luma intra case by reducing conditional cases. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264dsp_msa.c | 428 + 1 file changed, 138 insertions(+), 290

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc mc copy msa functions

2017-09-15 Thread kaustubh.raste
From: Kaustubh Raste Remove loops and unroll as block sizes are known. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c | 81 +--- 1 file changed, 75 insertions(+), 6 deletions(-) diff

[FFmpeg-devel] [PATCH] avcodec/mips: Unrolled loops avc intra msa functions

2017-09-21 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264pred_msa.c | 318 1 file changed, 158 insertions(+), 160 deletions(-) diff --git a/libavcodec/mips/h264pred_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc chroma horiz mc msa functions

2017-09-21 Thread kaustubh.raste
From: Kaustubh Raste Replace generic with block size specific function. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264chroma_msa.c | 231 ++ 1 file changed, 133 insertions(+), 98 deletions(-)

[FFmpeg-devel] [PATCH] avcodec/mips: preload data in hevc sao edge 90 degree filter msa functions

2017-09-21 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_lpf_sao_msa.c | 181 1 file changed, 122 insertions(+), 59 deletions(-) diff --git a/libavcodec/mips/hevc_lpf_sao_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Remove generic func use in hevc non-uni copy mc msa functions

2017-09-21 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevcdsp_msa.c | 168 +++-- 1 file changed, 160 insertions(+), 8 deletions(-) diff --git a/libavcodec/mips/hevcdsp_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Fixed rnd_val variable to 6 in hevc uni mc msa functions

2017-09-18 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_uni_msa.c | 372 + 1 file changed, 133 insertions(+), 239 deletions(-) diff --git a/libavcodec/mips/hevc_mc_uni_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: preload data in hevc sao edge 0 degree filter msa functions

2017-09-18 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_lpf_sao_msa.c | 232 +--- 1 file changed, 138 insertions(+), 94 deletions(-) diff --git a/libavcodec/mips/hevc_lpf_sao_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Unrolled loops and expanded functions in avc put mc 10 & 30 msa functions

2017-09-18 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c | 284 +++- 1 file changed, 278 insertions(+), 6 deletions(-) diff --git a/libavcodec/mips/h264qpel_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Reduced conditional cases in avc inter lpf msa functions

2017-09-18 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264dsp_msa.c | 274 + 1 file changed, 110 insertions(+), 164 deletions(-) diff --git a/libavcodec/mips/h264dsp_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc uni-w copy mc msa functions

2017-09-21 Thread kaustubh.raste
From: Kaustubh Raste Load the specific destination bytes instead of MSA load and pack. Pack the data to half word before clipping. Use immediate unsigned saturation for clip to max saving one vector register. Signed-off-by: Kaustubh Raste

[FFmpeg-devel] [PATCH] avcodec/mips: Cleanup unused functions

2017-10-05 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c | 746 libavcodec/mips/hevc_mc_uniw_msa.c | 67 libavcodec/mips/hevcdsp_msa.c | 50 --- 3 files

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc lpf msa functions

2017-09-12 Thread kaustubh.raste
From: Kaustubh Raste Seperate the filter processing in all strong, all weak and strong + weak cases. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_lpf_sao_msa.c | 750 ++-- 1 file changed, 556

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc idct msa functions

2017-09-12 Thread kaustubh.raste
From: Kaustubh Raste Align the buffers. Remove reduandant constant array. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_idct_msa.c | 255 ++- 1 file changed, 171 insertions(+), 84 deletions(-)

[FFmpeg-devel] [PATCH] avcodec/mips: Removed generic function call in avc intra msa functions

2017-09-25 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264pred_msa.c | 215 +--- 1 file changed, 92 insertions(+), 123 deletions(-) diff --git a/libavcodec/mips/h264pred_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc chroma vert mc msa functions

2017-09-25 Thread kaustubh.raste
From: Kaustubh Raste Replace generic with block size specific function. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264chroma_msa.c | 237 ++ 1 file changed, 112 insertions(+), 125 deletions(-)

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc put mc 20, 01 and 03 msa functions

2017-09-26 Thread kaustubh.raste
From: Kaustubh Raste Remove loops and unroll as block sizes are known. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c | 441 +++- 1 file changed, 432 insertions(+), 9 deletions(-)

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc weighted mc msa functions

2017-09-25 Thread kaustubh.raste
From: Kaustubh Raste Replace generic with block size specific function. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264dsp_msa.c | 423 ++- libavutil/mips/generic_macros_msa.h | 36 +++ 2

[FFmpeg-devel] [PATCH] avcodec/mips: preload data in hevc sao edge 45 degree filter msa functions

2017-09-25 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_lpf_sao_msa.c | 197 1 file changed, 135 insertions(+), 62 deletions(-) diff --git a/libavcodec/mips/hevc_lpf_sao_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Improve vp9 idct msa functions

2017-09-04 Thread kaustubh.raste
From: Kaustubh Raste Removed memset calls. Signed-off-by: Kaustubh Raste --- libavcodec/mips/vp9_idct_msa.c | 118 1 file changed, 70 insertions(+), 48 deletions(-) diff --git

[FFmpeg-devel] [PATCH] avcodec/mips: Improve vp9 lpf msa functions

2017-09-04 Thread kaustubh.raste
From: Kaustubh Raste Updated VP9_LPF_FILTER4_4W macro to process on 8 bit data. Replaced VP9_LPF_FILTER4_8W with VP9_LPF_FILTER4_4W. Signed-off-by: Kaustubh Raste --- libavcodec/mips/vp9_lpf_msa.c | 94

[FFmpeg-devel] [PATCH] avcodec/mips: Improve vp9 mc msa functions

2017-09-04 Thread kaustubh.raste
From: Kaustubh Raste Load the specific destination bytes instead of MSA load and pack. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c | 17 +- libavcodec/mips/vp9_mc_msa.c| 759

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc bi-weighted mc msa functions

2017-10-09 Thread kaustubh.raste
From: Kaustubh Raste Replace generic with block size specific function. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264dsp_msa.c | 469 +++ libavutil/mips/generic_macros_msa.h |4 + 2 files

[FFmpeg-devel] [PATCH] avcodec/mips: preload data in hevc sao edge 135 degree filter msa functions

2017-10-09 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_lpf_sao_msa.c | 194 1 file changed, 132 insertions(+), 62 deletions(-) diff --git a/libavcodec/mips/hevc_lpf_sao_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc put mc 21, 23 and 02 msa functions

2017-10-09 Thread kaustubh.raste
From: Kaustubh Raste Remove loops and unroll as block sizes are known. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c | 1219 ++-- 1 file changed, 802 insertions(+), 417 deletions(-)

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc chroma hv mc msa functions

2017-10-09 Thread kaustubh.raste
From: Kaustubh Raste Replace generic with block size specific function. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264chroma_msa.c | 309 -- 1 file changed, 166 insertions(+), 143 deletions(-)

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc uni-w horiz mc msa functions

2017-10-09 Thread kaustubh.raste
From: Kaustubh Raste Load the specific destination bytes instead of MSA load and pack. Pack the data to half word before clipping. Use immediate unsigned saturation for clip to max saving one vector register. Signed-off-by: Kaustubh Raste

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc uni copy mc msa functions

2017-10-09 Thread kaustubh.raste
From: Kaustubh Raste Load the specific bytes instead of MSA load. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_uni_msa.c | 245 +++-- 1 file changed, 100 insertions(+), 145 deletions(-) diff

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc put mc 11, 31, 13 and 33 msa functions

2017-10-24 Thread kaustubh.raste
From: Kaustubh Raste Remove loops and unroll as block sizes are known. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c | 400 1 file changed, 240 insertions(+), 160 deletions(-)

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc bi weighted hv mc msa functions

2017-10-24 Thread kaustubh.raste
From: Kaustubh Raste Use immediate unsigned saturation for clip to max saving one vector register. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_biw_msa.c | 706 ++-

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc chroma copy and avg vert mc msa functions

2017-10-24 Thread kaustubh.raste
From: Kaustubh Raste Replace generic with block size specific function. Load the specific destination bytes instead of MSA load and pack. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264chroma_msa.c | 627

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc avg mc 10, 30, 01 and 03 msa functions

2017-11-05 Thread kaustubh.raste
From: Kaustubh Raste Align the mask buffer to 64 bytes. Load the specific destination bytes instead of MSA load and pack. Remove unused macros and functions. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c | 1269

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc bi 4 tap hv mc msa functions

2017-11-05 Thread kaustubh.raste
From: Kaustubh Raste Use global mask buffer for appropriate mask load. Use immediate unsigned saturation for clip to max saving one vector register. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_bi_msa.c | 1140

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc bi wgt 4 tap hv mc msa functions

2017-11-05 Thread kaustubh.raste
From: Kaustubh Raste Use global mask buffer for appropriate mask load. Use immediate unsigned saturation for clip to max saving one vector register. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_biw_msa.c | 1396

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc non-uni hv mc msa functions

2017-11-06 Thread kaustubh.raste
From: Kaustubh Raste Use mask buffer. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevcdsp_msa.c | 1478 - 1 file changed, 870 insertions(+), 608 deletions(-) diff --git

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc uni 4 tap hv mc msa functions

2017-11-06 Thread kaustubh.raste
From: Kaustubh Raste Use global mask buffer for appropriate mask load. Remove unused macro and table. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_uni_msa.c | 1125 +++-- 1 file changed, 566

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc uni weighted 4 tap vt mc msa functions

2017-11-06 Thread kaustubh.raste
From: Kaustubh Raste Use global mask buffer for appropriate mask load. Use immediate unsigned saturation for clip to max saving one vector register. Remove unused macro. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_uniw_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc non-uni hz and vt mc msa functions

2017-11-09 Thread kaustubh.raste
From: Kaustubh Raste Use mask buffer. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevcdsp_msa.c | 541 - 1 file changed, 312 insertions(+), 229 deletions(-) diff --git

[FFmpeg-devel] [PATCH] avcodec/mips: cleanup unused macros

2017-11-08 Thread kaustubh.raste
From: Kaustubh Raste Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_macros_msa.h | 37 - 1 file changed, 37 deletions(-) diff --git a/libavcodec/mips/hevc_macros_msa.h

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc avg mc 20, 21 and 23 msa functions

2017-11-02 Thread kaustubh.raste
From: Kaustubh Raste Load the specific destination bytes instead of MSA load and pack. Remove unused macros and functions. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c | 1274 ++-- 1

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc bi 4 tap hz and vt mc msa functions

2017-11-02 Thread kaustubh.raste
From: Kaustubh Raste Use global mask buffer for appropriate mask load. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_bi_msa.c | 428 +++--- 1 file changed, 210 insertions(+), 218 deletions(-)

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc bi wgt 4 tap hz and vt mc msa functions

2017-11-03 Thread kaustubh.raste
From: Kaustubh Raste Use global mask buffer for appropriate mask load. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_biw_msa.c | 587 - 1 file changed, 247 insertions(+), 340 deletions(-)

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc uni 4 tap hz and vt mc msa functions

2017-11-03 Thread kaustubh.raste
From: Kaustubh Raste Use global mask buffer for appropriate mask load. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_uni_msa.c | 509 - 1 file changed, 274 insertions(+), 235 deletions(-)

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc uni weighted 4 tap hz mc msa functions

2017-11-03 Thread kaustubh.raste
From: Kaustubh Raste Use global mask buffer for appropriate mask load. Use immediate unsigned saturation for clip to max saving one vector register. Remove unused macro. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_uniw_msa.c

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc uni weighted hv mc msa functions

2017-10-31 Thread kaustubh.raste
From: Kaustubh Raste Use immediate unsigned saturation for clip to max saving one vector register. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_macros_msa.h |9 + libavcodec/mips/hevc_mc_uniw_msa.c | 1598

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc avg mc 02, 12 and 32 msa functions

2017-11-01 Thread kaustubh.raste
From: Kaustubh Raste Remove loops and unroll as block sizes are known. Load the specific destination bytes instead of MSA load and pack. Remove unused macro and functions. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c |

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc avg mc 22, 11, 31, 13 and 33 msa functions

2017-10-27 Thread kaustubh.raste
From: Kaustubh Raste Remove loops and unroll as block sizes are known. Load the specific destination bytes instead of MSA load and pack. Remove unused macro and functions. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264qpel_msa.c |

[FFmpeg-devel] [PATCH] avcodec/mips: Improve avc chroma avg hv mc msa functions

2017-10-27 Thread kaustubh.raste
From: Kaustubh Raste Replace generic with block size specific function. Load the specific destination bytes instead of MSA load and pack. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264chroma_msa.c | 438

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc bi hz and hv mc msa functions

2017-10-27 Thread kaustubh.raste
From: Kaustubh Raste Align the mask buffer. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_bi_msa.c | 940 -- 1 file changed, 595 insertions(+), 345 deletions(-) diff --git

[FFmpeg-devel] [PATCH] avcodec/mips: Improve hevc uni vt and hv mc msa functions

2017-10-27 Thread kaustubh.raste
From: Kaustubh Raste Remove unused macro. Signed-off-by: Kaustubh Raste --- libavcodec/mips/hevc_mc_uni_msa.c | 744 + 1 file changed, 499 insertions(+), 245 deletions(-) diff --git