Hi Loic,
On 8-May-26 00:49, Loic Poulain wrote:
> Add a V4L2 mem-to-mem driver for the Qualcomm Offline Processing Engine
> (OPE). OPE is a memory-to-memory ISP block that converts raw Bayer
> frames to YUV, performing white balance, demosaic, chroma enhancement,
> color correction and downscaling.
>
> The hardware architecture consists of Fetch Engines and Write Engines,
> connected through intermediate pipeline modules for pix processing.
>
> The driver exposes three video nodes per pipeline instance:
> - ope_input: Bayer RAW input (V4L2_BUF_TYPE_VIDEO_OUTPUT_MPLANE)
> - ope_disp_output: YUV output (V4L2_BUF_TYPE_VIDEO_CAPTURE_MPLANE)
> - ope_params: ISP parameters (V4L2_BUF_TYPE_META_OUTPUT)
>
> Hardware features:
> - Stripe-based processing (up to 336 pixels wide per stripe)
> - White balance (CLC_WB)
> - Demosaic / Bayer-to-RGB (CLC_DEMO)
> - RGB-to-YUV conversion (CLC_CHROMA_ENHAN)
> - Color correction matrix (CLC_CC)
> - MN downscaler for chroma and luma planes
>
> Default configuration values are based on public standards such as BT.601.
>
> Processing Model:
> OPE processes frames in stripes of up to 336 pixels. Therefore, frames must
> be split into stripes for processing. Each stripe is configured after the
> previous one has been acquired (double buffered registers). To minimize
> inter-stripe latency, stripe configurations are generated ahead of time.
>
> Signed-off-by: Loic Poulain <[email protected]>
As discussed during our meeting today I have several remarks about
subdev try vs active state issies, propagating settings from one
pad to the next (or not), etc.
Here is a copy of my notes about this which I already shared with you
before our meeting (so nothing new for you here, this is FYI for
the list):
1. Any try operations on either /dev/video# nodes or on /dev/v4l-subdev# node
pads should not rely on active configuration. For subdevs a pristine default
config tied to the fd for the subdev is created by the media-core code
at open() time and this should be used as the sole state for all try
operations on a subdev's pads.
Generally speaking subdev's configuration, even active config should not
depend on the state of other subdevs, Instead making sure sink + source
fmts between subdevs line is done by calling media_pipeline_start() at
stream-on time.
This is why all pad operations get a sd_state passed, this can be used
to access the per fd try state info, and the passed in sd_state will
already be set to the try-state tied to the fd if this is a try
operation.
The sd_state has an array of v4l2_subdev_pad_config-s which contains:
struct v4l2_subdev_pad_config {
struct v4l2_mbus_framefmt format;
struct v4l2_rect crop;
struct v4l2_rect compose;
struct v4l2_fract interval;
};
instead of storing format, crop and compose info of different pads in
the ctx this should be stored in the sd_state.
The driver should provide a v4l2_subdev_internal_ops.init_state
callback to initialize the fresh try-state sd_state allocated
for each fd stemming from an opening of the /dev/v4l-subdev node.
And the driver should call v4l2_subdev_init_finalize() to also
get an active state allocated, which will then also be automatically
passed to subdev ops when the operation is to be applied to
the active state. It looks like a lot of the state which is currently
in ctx should use the sd_state pointer instead.
I expect the media-core core to pass / create a different active
state sd_state struct per subdev for each context once multi-context
support land and for now we can use the single active sd_state created
by calling v4l2_subdev_init_finalize().
2. Related to this I believe that atm you auto propagate set_fmt data
on the ope_input /dev/video# to ope_proc pad 0 fmt (or the current
ctx representation of this). This does not work for try calls, because
try sd_state is per fd. For the try-path I would expect userspace to
do a set_fmt call on ope_proc pad 0 with a format mirroring which it
plans to use when really applying the configuration.
This means that in try mode you should allow a set_fmt on proc subdev
pas0, which currently always gets turned into a get() IIRC.
Once you've moved the input fmt from the ctx to sd_state->pads[0].format
you will need to get the active sd_state on set_fmt on the ope_input
/dev/video# by calling v4l2_subdev_get_locked_active_state() on the
proc subdev, since the set_fmt on the ope_input /dev/video# does
not get passed a sd_state pointer.
3. Similar to the ope_input /dev/video# to ope_proc pad 0 propagation,
for the try stuff automatically propagating info from ope_proc pad 2
to ope_disp pad 0 will also not work. In this case it might be best to
just completely disable auto-propagation and rely on userspace to do
this itself even for the active state. Or you could do the
v4l2_subdev_get_locked_active_state() thing again.
You can check userspace has done this properly by calling
media_pipeline_start() at stream-on time, which ensures that enabled
source -> sink pad links agree on the format on both sides.
You should probably call media_pipeline_start() regardless.
4. The v3 driver limits supported capture formats for the ope_disp_output
node (ope_enum_fmt_vid_cap()) to those matching ctx->proc_mbus_code, but
this not how this supposed to work. For media-controller centric
drivers, userspace is supposed to be able to enumerate all supported output
formats, as well as output formats for a specific mbus-code.
This is controlled by the v4l2_fmtdesc.mbus_code field, if this is 0
then all formats should be returned and if it is set then only formats
which can be outputted for that mbus-code should be returned.
Note: I already have a patch fixing this.
No active driver state like ctx->proc_mbus_code should be queried,
setting a compatible mbus code on the sink pad which is the source
for the /dev/video# capture node is userspace's responsibility.
Also see:
https://www.kernel.org/doc/html/latest/userspace-api/media/v4l/vidioc-enum-fmt.html
"Regardless of the value of the mbus_code field, the enumerated image
formats shall not depend on the active configuration of the video
device or device pipeline."
Note as for the source-pad feeding into the sink-pad lining up fmt /
mbus-code wise, this is also userspace's responsibility and should be
checked by calling media_pipeline_start() at stream-on time.
5. There is some propagating of fmts / selections hapening in ways where
this is causing problems. Propagating of fmts of selections should
only happen within a single subdev (because of try_state being per
subdev) and then it should happen like this:
1. On setting input (OPE_PROC_PAD_SINK_IN) fmt, update OPE_PROC_PAD_SINK_IN
crop, bounding it to the new fmt.
2. On setting OPE_PROC_PAD_SINK_IN crop, including on the update from 1,
update OPE_PROC_PAD_SOURCE compose, bounding it to the crop rect.
3. On setting OPE_PROC_PAD_SOURCE compose, update OPE_PROC_PAD_SOURCE fmt
setting its width + height to the compose rect width + height
4. On setting OPE_PROC_PAD_SOURCE fmt, override passed in width + height
with OPE_PROC_PAD_SOURCE compose rect (maybe just use get()?)
6. There should be no propagation from /dev/video# nodes to pads, instead
a validate_link media-entity op should be used to check userspace has
properly made these line up on stream-start
Regards,
Hans
> ---
> drivers/media/platform/qcom/camss/Kconfig | 18 +
> drivers/media/platform/qcom/camss/Makefile | 3 +
> drivers/media/platform/qcom/camss/camss-isp-ope.c | 3025
> ++++++++++++++++++++
> .../media/platform/qcom/camss/camss-isp-pipeline.c | 11 +
> .../media/platform/qcom/camss/camss-isp-pipeline.h | 2 +
> 5 files changed, 3059 insertions(+)
>
> diff --git a/drivers/media/platform/qcom/camss/Kconfig
> b/drivers/media/platform/qcom/camss/Kconfig
> index
> d77482f3f5eadc65856806b9b237d65ea484f267..aae87e109b336a953a25a7b3bc8ada66b29135ce
> 100644
> --- a/drivers/media/platform/qcom/camss/Kconfig
> +++ b/drivers/media/platform/qcom/camss/Kconfig
> @@ -21,3 +21,21 @@ config VIDEO_QCOM_CAMSS_ISP
>
> This module is selected automatically by drivers that need it.
>
> +config VIDEO_QCOM_CAMSS_OPE
> + tristate "Qualcomm CAMSS Offline Processing Engine (OPE) driver"
> + depends on V4L_PLATFORM_DRIVERS
> + depends on VIDEO_DEV
> + depends on (ARCH_QCOM && IOMMU_DMA) || COMPILE_TEST
> + select MEDIA_CONTROLLER
> + select VIDEO_QCOM_CAMSS_ISP
> + select V4L2_ISP
> + select VIDEOBUF2_DMA_CONTIG
> + select VIDEOBUF2_VMALLOC
> + help
> + Enable support for the Qualcomm Offline Processing Engine (OPE).
> + OPE is a memory-to-memory ISP block that converts raw Bayer frames
> + to YUV, performing white balance, demosaic, chroma enhancement and
> + downscaling. Found on QCM2290 and related SoCs.
> +
> + To compile this driver as a module, choose M here: the module
> + will be called qcom-camss-ope.
> diff --git a/drivers/media/platform/qcom/camss/Makefile
> b/drivers/media/platform/qcom/camss/Makefile
> index
> fba6f34b8d9f70ea258f7ae1a293a8d58d866498..9d47e4f5fe6dc8d168d866813732655eb0c547a6
> 100644
> --- a/drivers/media/platform/qcom/camss/Makefile
> +++ b/drivers/media/platform/qcom/camss/Makefile
> @@ -37,3 +37,6 @@ qcom-camss-isp-objs := camss-isp-bufq.o \
>
> obj-$(CONFIG_VIDEO_QCOM_CAMSS_ISP) += qcom-camss-isp.o
>
> +qcom-camss-ope-objs := camss-isp-ope.o
> +
> +obj-$(CONFIG_VIDEO_QCOM_CAMSS_OPE) += qcom-camss-ope.o
> diff --git a/drivers/media/platform/qcom/camss/camss-isp-ope.c
> b/drivers/media/platform/qcom/camss/camss-isp-ope.c
> new file mode 100644
> index
> 0000000000000000000000000000000000000000..1febe3e7417f90dc59deca47a2c164b97116fec6
> --- /dev/null
> +++ b/drivers/media/platform/qcom/camss/camss-isp-ope.c
> @@ -0,0 +1,3025 @@
> +// SPDX-License-Identifier: GPL-2.0
> +/*
> + * camss-ope.c
> + *
> + * Qualcomm MSM Camera Subsystem - Offline Processing Engine
> + *
> + * Copyright (c) Qualcomm Technologies, Inc. and/or its subsidiaries.
> + */
> +
> +/*
> + * This driver provides driver implementation for the Qualcomm Offline
> + * Processing Engine (OPE). OPE is a memory-to-memory hardware block
> + * designed for image processing on a source frame. Typically, the input
> + * frame originates from the SoC CSI capture path, though not limited to.
> + *
> + * The hardware architecture consists of Fetch Engines and Write Engines,
> + * connected through intermediate pipeline modules:
> + * [FETCH ENGINES] => [Pipeline Modules] => [WRITE ENGINES]
> + *
> + * Current Configuration:
> + * Fetch Engine: One fetch engine is used for Bayer frame input.
> + * Write Engines: Two display write engines for Y and UV planes output.
> + *
> + * Only a subset of the pipeline modules are enabled:
> + * CLC_WB: White balance for channel gain configuration
> + * CLC_DEMO: Demosaic for Bayer to RGB conversion
> + * CLC_CC: Color Correct, coefficient based RGB correction
> + * CLC_CHROMA_ENHAN: for RGB to YUV conversion
> + * CLC_DOWNSCALE*: Downscaling for UV (YUV444 -> YUV422/YUV420) and YUV
> planes
> + *
> + * Default configuration values are based on public standards such as BT.601.
> + *
> + * Processing Model:
> + * OPE processes frames in stripes of up to 336 pixels. Therefore, frames
> must
> + * be split into stripes for processing. Each stripe is configured after the
> + * previous one has been acquired (double buffered registers). To minimize
> + * inter-stripe latency, the stripe configurations are generated ahead of
> time.
> + */
> +
> +#include <linux/bitfield.h>
> +#include <linux/clk.h>
> +#include <linux/completion.h>
> +#include <linux/delay.h>
> +#include <linux/dma-mapping.h>
> +#include <linux/interconnect.h>
> +#include <linux/interrupt.h>
> +#include <linux/io.h>
> +#include <linux/iopoll.h>
> +#include <linux/module.h>
> +#include <linux/of.h>
> +#include <linux/platform_device.h>
> +#include <linux/pm_clock.h>
> +#include <linux/pm_domain.h>
> +#include <linux/pm_opp.h>
> +#include <linux/pm_runtime.h>
> +#include <linux/slab.h>
> +#include <linux/units.h>
> +
> +#include <media/v4l2-device.h>
> +#include <media/media-device.h>
> +#include <media/v4l2-ctrls.h>
> +#include <media/v4l2-event.h>
> +#include <media/v4l2-fh.h>
> +#include <media/v4l2-ioctl.h>
> +#include "camss-isp-bufq.h"
> +#include "camss-isp-pipeline.h"
> +#include "camss-isp-sched.h"
> +#include <media/videobuf2-dma-contig.h>
> +#include <media/videobuf2-vmalloc.h>
> +
> +#include <uapi/linux/camss-config.h>
> +
> +#include "camss-isp-params.h"
> +
> +#define OPE_NAME "qcom-camss-ope"
> +
> +/* Format descriptor */
> +struct ope_fmt {
> + u32 fourcc;
> + unsigned int depth;
> + unsigned int align;
> + unsigned int num_planes;
> + u32 mbus_code;
> +};
> +
> +/* Per-queue format state */
> +struct ope_fmt_state {
> + const struct ope_fmt *fmt;
> + unsigned int width;
> + unsigned int height;
> + struct v4l2_rect crop;
> + unsigned int bytesperline;
> + unsigned int sizeimage;
> + enum v4l2_colorspace colorspace;
> + enum v4l2_xfer_func xfer_func;
> + enum v4l2_ycbcr_encoding ycbcr_enc;
> + enum v4l2_quantization quantization;
> + unsigned int sequence;
> + struct v4l2_fract timeperframe;
> +};
> +
> +/* -------- Register layout -------- */
> +
> +#define OPE_TOP_HW_VERSION 0x000
> +#define OPE_TOP_HW_VERSION_STEP GENMASK(15, 0)
> +#define OPE_TOP_HW_VERSION_REV GENMASK(27, 16)
> +#define OPE_TOP_HW_VERSION_GEN GENMASK(31, 28)
> +#define OPE_TOP_RESET_CMD 0x004
> +#define OPE_TOP_RESET_CMD_HW BIT(0)
> +#define OPE_TOP_RESET_CMD_SW BIT(1)
> +#define OPE_TOP_IRQ_STATUS 0x014
> +#define OPE_TOP_IRQ_MASK 0x018
> +#define OPE_TOP_IRQ_STATUS_RST_DONE BIT(0)
> +#define OPE_TOP_IRQ_STATUS_WE BIT(1)
> +#define OPE_TOP_IRQ_STATUS_FE BIT(2)
> +#define OPE_TOP_IRQ_STATUS_VIOL BIT(3)
> +#define OPE_TOP_IRQ_STATUS_IDLE BIT(4)
> +#define OPE_TOP_IRQ_CLEAR 0x01c
> +#define OPE_TOP_IRQ_CMD 0x024
> +#define OPE_TOP_IRQ_CMD_CLEAR BIT(0)
> +#define OPE_TOP_VIOLATION_STATUS 0x028
> +
> +/* Fetch engine */
> +#define OPE_BUS_RD_INPUT_IF_IRQ_MASK 0x00c
> +#define OPE_BUS_RD_INPUT_IF_IRQ_CLEAR 0x010
> +#define OPE_BUS_RD_INPUT_IF_IRQ_CMD 0x014
> +#define OPE_BUS_RD_INPUT_IF_IRQ_CMD_CLEAR BIT(0)
> +#define OPE_BUS_RD_INPUT_IF_IRQ_STATUS 0x018
> +#define OPE_BUS_RD_INPUT_IF_CMD 0x01c
> +#define OPE_BUS_RD_INPUT_IF_CMD_GO_CMD BIT(0)
> +#define OPE_BUS_RD_CLIENT_0_CORE_CFG 0x050
> +#define OPE_BUS_RD_CLIENT_0_CORE_CFG_EN BIT(0)
> +#define OPE_BUS_RD_CLIENT_0_CCIF_META_DATA 0x054
> +#define OPE_BUS_RD_CLIENT_0_CCIF_MD_PIX_PATTERN GENMASK(7, 2)
> +#define OPE_BUS_RD_CLIENT_0_ADDR_IMAGE 0x058
> +#define OPE_BUS_RD_CLIENT_0_RD_BUFFER_SIZE 0x05c
> +#define OPE_BUS_RD_CLIENT_0_RD_STRIDE 0x060
> +#define OPE_BUS_RD_CLIENT_0_UNPACK_CFG_0 0x064
> +
> +/* Write engines */
> +#define OPE_BUS_WR_INPUT_IF_IRQ_MASK_0 0x018
> +#define OPE_BUS_WR_INPUT_IF_IRQ_MASK_1 0x01c
> +#define OPE_BUS_WR_INPUT_IF_IRQ_CLEAR_0 0x020
> +#define OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0 0x028
> +#define OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_RUP_DONE BIT(0)
> +#define OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_BUF_DONE BIT(8)
> +#define OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_CONS_VIOL BIT(28)
> +#define OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_VIOL BIT(30)
> +#define OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_IMG_SZ_VIOL BIT(31)
> +#define OPE_BUS_WR_INPUT_IF_IRQ_CMD 0x030
> +#define OPE_BUS_WR_INPUT_IF_IRQ_CMD_CLEAR BIT(0)
> +#define OPE_BUS_WR_VIOLATION_STATUS 0x064
> +#define OPE_BUS_WR_IMAGE_SIZE_VIOLATION_STATUS 0x070
> +#define OPE_BUS_WR_CLIENT_CFG(c) (0x200 + (c) *
> 0x100)
> +#define OPE_BUS_WR_CLIENT_CFG_EN BIT(0)
> +#define OPE_BUS_WR_CLIENT_CFG_AUTORECOVER BIT(4)
> +#define OPE_BUS_WR_CLIENT_ADDR_IMAGE(c) (0x204
> + (c) * 0x100)
> +#define OPE_BUS_WR_CLIENT_IMAGE_CFG_0(c) (0x20c + (c) *
> 0x100)
> +#define OPE_BUS_WR_CLIENT_IMAGE_CFG_1(c) (0x210 + (c) *
> 0x100)
> +#define OPE_BUS_WR_CLIENT_IMAGE_CFG_2(c) (0x214 + (c) *
> 0x100)
> +#define OPE_BUS_WR_CLIENT_PACKER_CFG(c) (0x218
> + (c) * 0x100)
> +#define OPE_BUS_WR_CLIENT_MAX 4
> +
> +/* Pipeline modules */
> +#define OPE_PP_CLC_WB_GAIN_MODULE_CFG (0x200
> + 0x60)
> +#define OPE_PP_CLC_WB_GAIN_MODULE_CFG_EN BIT(0)
> +#define OPE_PP_CLC_WB_GAIN_WB_CFG(ch) (0x200
> + 0x68 + 4 * (ch))
> +
> +#define OPE_PP_CLC_CC_BASE 0x400
> +#define OPE_PP_CLC_CC_MODULE_CFG
> (OPE_PP_CLC_CC_BASE + 0x60)
> +#define OPE_PP_CLC_CC_MODULE_CFG_EN BIT(0)
> +#define OPE_PP_CLC_CC_COEFF_A_CFG_0
> (OPE_PP_CLC_CC_BASE + 0x68)
> +#define OPE_PP_CLC_CC_COEFF_A_CFG_0_A0 GENMASK(11, 0)
> +#define OPE_PP_CLC_CC_COEFF_A_CFG_0_A1 GENMASK(27, 16)
> +#define OPE_PP_CLC_CC_COEFF_A_CFG_1
> (OPE_PP_CLC_CC_BASE + 0x6c)
> +#define OPE_PP_CLC_CC_COEFF_A_CFG_1_A2 GENMASK(11, 0)
> +#define OPE_PP_CLC_CC_COEFF_B_CFG_0
> (OPE_PP_CLC_CC_BASE + 0x70)
> +#define OPE_PP_CLC_CC_COEFF_B_CFG_0_B0 GENMASK(11, 0)
> +#define OPE_PP_CLC_CC_COEFF_B_CFG_0_B1 GENMASK(27, 16)
> +#define OPE_PP_CLC_CC_COEFF_B_CFG_1
> (OPE_PP_CLC_CC_BASE + 0x74)
> +#define OPE_PP_CLC_CC_COEFF_B_CFG_1_B2 GENMASK(11, 0)
> +#define OPE_PP_CLC_CC_COEFF_C_CFG_0
> (OPE_PP_CLC_CC_BASE + 0x78)
> +#define OPE_PP_CLC_CC_COEFF_C_CFG_0_C0 GENMASK(11, 0)
> +#define OPE_PP_CLC_CC_COEFF_C_CFG_0_C1 GENMASK(27, 16)
> +#define OPE_PP_CLC_CC_COEFF_C_CFG_1
> (OPE_PP_CLC_CC_BASE + 0x7c)
> +#define OPE_PP_CLC_CC_COEFF_C_CFG_1_C2 GENMASK(11, 0)
> +#define OPE_PP_CLC_CC_COEFF_K_CFG_0
> (OPE_PP_CLC_CC_BASE + 0x80)
> +#define OPE_PP_CLC_CC_COEFF_K_CFG_0_K0 GENMASK(12, 0)
> +#define OPE_PP_CLC_CC_COEFF_K_CFG_1
> (OPE_PP_CLC_CC_BASE + 0x84)
> +#define OPE_PP_CLC_CC_COEFF_K_CFG_1_K1 GENMASK(12, 0)
> +#define OPE_PP_CLC_CC_COEFF_K_CFG_2
> (OPE_PP_CLC_CC_BASE + 0x88)
> +#define OPE_PP_CLC_CC_COEFF_K_CFG_2_K2 GENMASK(12, 0)
> +#define OPE_PP_CLC_CC_COEFF_M_CFG
> (OPE_PP_CLC_CC_BASE + 0x8c)
> +#define OPE_PP_CLC_CC_COEFF_M_CFG_M GENMASK(11, 0)
> +
> +#define OPE_PP_CLC_DEMO_MODULE_CFG (0x800 + 0x60)
> +#define OPE_PP_CLC_DEMO_MODULE_CFG_EN BIT(0)
> +#define OPE_PP_CLC_DEMO_MODULE_CFG_DYN_G_CLAMP_EN BIT(4)
> +#define OPE_PP_CLC_DEMO_INTERP_COEFF_CFG (0x800 + 0x68)
> +#define OPE_PP_CLC_DEMO_INTERP_COEFF_CFG_LAMBDA_G GENMASK(15,
> 8)
> +#define OPE_PP_CLC_DEMO_INTERP_COEFF_CFG_LAMBDA_RB GENMASK(7, 0)
> +#define OPE_PP_CLC_DEMO_INTERP_CLASSIFIER_CFG_0 (0x800
> + 0x6c)
> +#define OPE_PP_CLC_DEMO_INTERP_CLASSIFIER_CFG_0_AK GENMASK(15,
> 0)
> +#define OPE_PP_CLC_DEMO_INTERP_CLASSIFIER_CFG_1 (0x800
> + 0x70)
> +#define OPE_PP_CLC_DEMO_INTERP_CLASSIFIER_CFG_1_WK GENMASK(15,
> 0)
> +
> +#define OPE_PP_CLC_DOWNSCALE_MN_DS_C_PRE_BASE 0x1c00
> +#define OPE_PP_CLC_DOWNSCALE_MN_DS_Y_DISP_BASE 0x3000
> +#define OPE_PP_CLC_DOWNSCALE_MN_DS_C_DISP_BASE 0x3200
> +#define OPE_PP_CLC_CROP_RND_CLAMP_Y_DISP_BASE 0x3400
> +#define OPE_PP_CLC_CROP_RND_CLAMP_C_DISP_BASE 0x3600
> +#define OPE_PP_CLC_CROP_RND_CLAMP_HW_STATUS(base) ((base) + 0x04)
> +#define OPE_PP_CLC_CROP_RND_CLAMP_HEIGHT_VIOL BIT(2)
> +#define OPE_PP_CLC_CROP_RND_CLAMP_WIDTH_VIOL BIT(1)
> +#define OPE_PP_CLC_CROP_RND_CLAMP_MODULE_CFG(base) ((base) + 0x60)
> +#define OPE_PP_CLC_CROP_RND_CLAMP_EN BIT(0)
> +#define OPE_PP_CLC_CROP_RND_CLAMP_CROP_EN BIT(9)
> +#define OPE_PP_CLC_CROP_RND_CLAMP_CROP_LINE_CFG(base) ((base)
> + 0x68)
> +#define OPE_PP_CLC_CROP_RND_CLAMP_CROP_PIXEL_CFG(base) ((base)
> + 0x6c)
> +#define OPE_PP_CLC_CROP_RND_CLAMP_CROP_FIRST GENMASK(29, 16)
> +#define OPE_PP_CLC_CROP_RND_CLAMP_CROP_LAST GENMASK(13, 0)
> +#define OPE_PP_CLC_DOWNSCALE_MN_CFG(ds) ((ds) + 0x60)
> +#define OPE_PP_CLC_DOWNSCALE_MN_CFG_EN BIT(0)
> +#define OPE_PP_CLC_DOWNSCALE_MN_DS_CFG(ds) ((ds) + 0x64)
> +#define OPE_PP_CLC_DOWNSCALE_MN_DS_CFG_H_SCALE_EN BIT(9)
> +#define OPE_PP_CLC_DOWNSCALE_MN_DS_CFG_V_SCALE_EN BIT(10)
> +#define OPE_PP_CLC_DOWNSCALE_MN_DS_IMAGE_SIZE_CFG(ds) ((ds) +
> 0x68)
> +#define OPE_PP_CLC_DOWNSCALE_MN_DS_MN_H_CFG(ds) ((ds) + 0x6c)
> +#define OPE_PP_CLC_DOWNSCALE_MN_DS_MN_V_CFG(ds) ((ds) + 0x74)
> +
> +#define OPE_PP_CLC_CHROMA_ENHAN_MODULE_CFG (0x1200 + 0x60)
> +#define OPE_PP_CLC_CHROMA_ENHAN_MODULE_CFG_EN BIT(0)
> +#define OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_0 (0x1200 + 0x68)
> +#define OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_0_V0 GENMASK(11, 0)
> +#define OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_0_V1 GENMASK(27, 16)
> +#define OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_1 (0x1200 + 0x6c)
> +#define OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_1_K GENMASK(31, 23)
> +#define OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_2 (0x1200 + 0x70)
> +#define OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_2_V2 GENMASK(11, 0)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_A_CFG (0x1200 + 0x74)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_A_CFG_AP GENMASK(11, 0)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_A_CFG_AM GENMASK(27, 16)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_B_CFG (0x1200 + 0x78)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_B_CFG_BP GENMASK(11, 0)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_B_CFG_BM GENMASK(27, 16)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_C_CFG (0x1200 + 0x7C)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_C_CFG_CP GENMASK(11, 0)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_C_CFG_CM GENMASK(27, 16)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_D_CFG (0x1200 + 0x80)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_D_CFG_DP GENMASK(11, 0)
> +#define OPE_PP_CLC_CHROMA_ENHAN_COEFF_D_CFG_DM GENMASK(27, 16)
> +#define OPE_PP_CLC_CHROMA_ENHAN_CHROMA_CFG_0 (0x1200 + 0x84)
> +#define OPE_PP_CLC_CHROMA_ENHAN_CHROMA_CFG_0_KCB GENMASK(31, 21)
> +#define OPE_PP_CLC_CHROMA_ENHAN_CHROMA_CFG_1 (0x1200 + 0x88)
> +#define OPE_PP_CLC_CHROMA_ENHAN_CHROMA_CFG_1_KCR GENMASK(31, 21)
> +
> +/* -------- OPE-specific constants -------- */
> +
> +#define OPE_STRIPE_MAX_W 336
> +#define OPE_STRIPE_MAX_H 8192
> +#define OPE_STRIPE_MIN_W 16
> +#define OPE_STRIPE_MIN_H OPE_STRIPE_MIN_W
> +#define OPE_MAX_STRIPE 16
> +#define OPE_ALIGN_H 1
> +#define OPE_ALIGN_W 1
> +#define OPE_MIN_W 24
> +#define OPE_MIN_H 16
> +#define OPE_MAX_W (OPE_STRIPE_MAX_W * OPE_MAX_STRIPE)
> +#define OPE_MAX_H OPE_STRIPE_MAX_H
> +#define OPE_RESET_TIMEOUT_MS 100
> +#define DEFAULT_FRAMERATE 60
> +
> +/* Downscaler fixed-point helpers */
> +#define Q21(v) (((uint64_t)(v)) << 21)
> +#define DS_Q21(n, d) ((uint32_t)(((uint64_t)(n) << 21) / (d)))
> +#define DS_RESOLUTION(in, out) \
> + (((out) * 128 <= (in)) ? 0x0 : \
> + ((out) * 16 <= (in)) ? 0x1 : \
> + ((out) * 8 <= (in)) ? 0x2 : 0x3)
> +#define DS_OUTPUT_PIX(in, phase_init, phase_step) \
> + ((Q21(in) - (phase_init)) / (phase_step))
> +#define OPE_WB(n, d) (((n) << 10) / (d))
> +
> +enum ope_downscaler {
> + OPE_DS_C_PRE,
> + OPE_DS_C_DISP,
> + OPE_DS_Y_DISP,
> + OPE_DS_MAX,
> +};
> +
> +static const u32 ope_ds_base[OPE_DS_MAX] = {
> + OPE_PP_CLC_DOWNSCALE_MN_DS_C_PRE_BASE,
> + OPE_PP_CLC_DOWNSCALE_MN_DS_C_DISP_BASE,
> + OPE_PP_CLC_DOWNSCALE_MN_DS_Y_DISP_BASE,
> +};
> +
> +enum ope_wr_client {
> + OPE_WR_CLIENT_VID_Y,
> + OPE_WR_CLIENT_VID_C,
> + OPE_WR_CLIENT_DISP_Y,
> + OPE_WR_CLIENT_DISP_C,
> + OPE_WR_CLIENT_MAX,
> +};
> +
> +enum ope_pixel_pattern {
> + OPE_PIXEL_PATTERN_RGRGRG,
> + OPE_PIXEL_PATTERN_GRGRGR,
> + OPE_PIXEL_PATTERN_BGBGBG,
> + OPE_PIXEL_PATTERN_GBGBGB,
> + OPE_PIXEL_PATTERN_YCBYCR,
> + OPE_PIXEL_PATTERN_YCRYCB,
> + OPE_PIXEL_PATTERN_CBYCRY,
> + OPE_PIXEL_PATTERN_CRYCBY,
> +};
> +
> +enum ope_stripe_location {
> + OPE_STRIPE_LOCATION_FULL,
> + OPE_STRIPE_LOCATION_LEFT,
> + OPE_STRIPE_LOCATION_RIGHT,
> + OPE_STRIPE_LOCATION_MIDDLE,
> +};
> +
> +enum ope_unpacker_format {
> + OPE_UNPACKER_FMT_PLAIN_8 = 1,
> + OPE_UNPACKER_FMT_PLAIN_16_10BPP = 2,
> + OPE_UNPACKER_FMT_MIPI_10 = 13,
> +};
> +
> +enum ope_packer_format {
> + OPE_PACKER_FMT_PLAIN_8 = 1,
> + OPE_PACKER_FMT_PLAIN_8_ODD_EVEN = 2,
> + OPE_PACKER_FMT_MIPI_10 = 12,
> +};
> +
> +struct ope_hw_fmt {
> + u32 fourcc;
> + enum ope_pixel_pattern pattern;
> + enum ope_unpacker_format unpacker;
> + enum ope_packer_format packer;
> +};
> +
> +static const struct ope_hw_fmt ope_hw_fmts[] = {
> + { V4L2_PIX_FMT_SBGGR10P, OPE_PIXEL_PATTERN_BGBGBG,
> + OPE_UNPACKER_FMT_MIPI_10, OPE_PACKER_FMT_MIPI_10 },
> + { V4L2_PIX_FMT_SGBRG10P, OPE_PIXEL_PATTERN_GBGBGB,
> + OPE_UNPACKER_FMT_MIPI_10, OPE_PACKER_FMT_MIPI_10 },
> + { V4L2_PIX_FMT_SGRBG10P, OPE_PIXEL_PATTERN_GRGRGR,
> + OPE_UNPACKER_FMT_MIPI_10, OPE_PACKER_FMT_MIPI_10 },
> + { V4L2_PIX_FMT_SRGGB10P, OPE_PIXEL_PATTERN_RGRGRG,
> + OPE_UNPACKER_FMT_MIPI_10, OPE_PACKER_FMT_MIPI_10 },
> + { V4L2_PIX_FMT_SRGGB8, OPE_PIXEL_PATTERN_RGRGRG,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8 },
> + { V4L2_PIX_FMT_SBGGR8, OPE_PIXEL_PATTERN_BGBGBG,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8 },
> + { V4L2_PIX_FMT_SGBRG8, OPE_PIXEL_PATTERN_GBGBGB,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8 },
> + { V4L2_PIX_FMT_SGRBG8, OPE_PIXEL_PATTERN_GRGRGR,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8 },
> + { V4L2_PIX_FMT_NV24, OPE_PIXEL_PATTERN_YCBYCR,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8 },
> + { V4L2_PIX_FMT_NV42, OPE_PIXEL_PATTERN_YCRYCB,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8_ODD_EVEN },
> + { V4L2_PIX_FMT_NV16, OPE_PIXEL_PATTERN_CBYCRY,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8 },
> + { V4L2_PIX_FMT_NV61, OPE_PIXEL_PATTERN_CBYCRY,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8_ODD_EVEN },
> + { V4L2_PIX_FMT_NV12, OPE_PIXEL_PATTERN_CBYCRY,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8 },
> + { V4L2_PIX_FMT_NV21, OPE_PIXEL_PATTERN_CBYCRY,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8_ODD_EVEN },
> + { V4L2_PIX_FMT_GREY, OPE_PIXEL_PATTERN_RGRGRG,
> + OPE_UNPACKER_FMT_PLAIN_8, OPE_PACKER_FMT_PLAIN_8 },
> +};
> +
> +static const struct ope_hw_fmt *ope_find_hw_fmt(u32 fourcc)
> +{
> + unsigned int i;
> +
> + for (i = 0; i < ARRAY_SIZE(ope_hw_fmts); i++)
> + if (ope_hw_fmts[i].fourcc == fourcc)
> + return &ope_hw_fmts[i];
> +
> + return NULL;
> +}
> +
> +static const struct ope_fmt ope_input_fmts[] = {
> + { V4L2_PIX_FMT_SBGGR10P, 10, 2, 1, MEDIA_BUS_FMT_SBGGR10_1X10 },
> + { V4L2_PIX_FMT_SGBRG10P, 10, 2, 1, MEDIA_BUS_FMT_SGBRG10_1X10 },
> + { V4L2_PIX_FMT_SGRBG10P, 10, 2, 1, MEDIA_BUS_FMT_SGRBG10_1X10 },
> + { V4L2_PIX_FMT_SRGGB10P, 10, 2, 1, MEDIA_BUS_FMT_SRGGB10_1X10 },
> + { V4L2_PIX_FMT_SRGGB8, 8, 0, 1, MEDIA_BUS_FMT_SRGGB8_1X8 },
> + { V4L2_PIX_FMT_SBGGR8, 8, 0, 1, MEDIA_BUS_FMT_SBGGR8_1X8 },
> + { V4L2_PIX_FMT_SGBRG8, 8, 0, 1, MEDIA_BUS_FMT_SGBRG8_1X8 },
> + { V4L2_PIX_FMT_SGRBG8, 8, 0, 1, MEDIA_BUS_FMT_SGRBG8_1X8 },
> +};
> +
> +static const struct ope_fmt ope_output_fmts[] = {
> + { V4L2_PIX_FMT_NV24, 24, 0, 1, MEDIA_BUS_FMT_YUV8_1X24 },
> + { V4L2_PIX_FMT_NV42, 24, 0, 1, MEDIA_BUS_FMT_YUV8_1X24 },
> + { V4L2_PIX_FMT_NV16, 16, 1, 1, MEDIA_BUS_FMT_YUYV8_2X8 },
> + { V4L2_PIX_FMT_NV61, 16, 1, 1, MEDIA_BUS_FMT_YUYV8_2X8 },
> + { V4L2_PIX_FMT_NV12, 12, 1, 1, MEDIA_BUS_FMT_YUYV8_1_5X8 },
> + { V4L2_PIX_FMT_NV21, 12, 1, 1, MEDIA_BUS_FMT_YUYV8_1_5X8 },
> + { V4L2_PIX_FMT_GREY, 8, 0, 1, MEDIA_BUS_FMT_Y8_1X8 },
> +};
> +
> +struct ope_dsc_config {
> + u32 input_width;
> + u32 input_height;
> + u32 output_width;
> + u32 output_height;
> + u32 phase_step_h;
> + u32 phase_step_v;
> + u32 crop_last_pixel;
> + u32 crop_last_line;
> +};
> +
> +struct ope_stripe {
> + struct {
> + dma_addr_t addr;
> + u32 width;
> + u32 height;
> + u32 stride;
> + enum ope_stripe_location location;
> + enum ope_pixel_pattern pattern;
> + enum ope_unpacker_format format;
> + } src;
> + struct {
> + dma_addr_t addr;
> + u32 width;
> + u32 height;
> + u32 stride;
> + u32 x_init;
> + enum ope_packer_format format;
> + bool enabled;
> + } dst[OPE_WR_CLIENT_MAX];
> + struct ope_dsc_config dsc[OPE_DS_MAX];
> +};
> +
> +/* OPE tuning parameter state */
> +
> +/* Demosaic (CLC_DEMO) coefficients — internal only, not user-configurable.
> */
> +#define CAMSS_PARAMS_INTERNAL (-1)
> +
> +struct ope_params_demo {
> + struct v4l2_isp_params_block_header header;
> + u8 lambda_rb;
> + u8 lambda_g;
> + u16 a_k;
> + u16 w_k;
> + u16 _pad;
> +};
> +
> +/* OPE tuning parameter state */
> +struct ope_config {
> + struct camss_params_wb_gain wb_gain;
> + struct camss_params_chroma_enhan chroma_enhan;
> + struct camss_params_color_correct color_correct;
> + struct ope_params_demo demo;
> +};
> +
> +enum ope_entity {
> + OPE_ENTITY_FRAME_IN,
> + OPE_ENTITY_PARAMS,
> + OPE_ENTITY_PROC,
> + OPE_ENTITY_DISP,
> + OPE_ENTITY_DISP_OUT,
> + OPE_ENTITY_COUNT
> +};
> +
> +enum ope_queue_idx {
> + OPE_QUEUE_FRAME_IN,
> + OPE_QUEUE_DISP_OUT,
> + OPE_QUEUE_PARAMS,
> + OPE_QUEUE_COUNT
> +};
> +
> +/* per-context state */
> +struct ope_ctx {
> + struct camss_isp_bufq *bufq;
> + struct camss_isp_job job;
> + struct ope_dev *ope;
> + struct mutex vbq_lock;
> + struct vb2_queue vqs[OPE_QUEUE_COUNT];
> +
> + unsigned int framerate;
> + struct ope_fmt_state fmt_in;
> + struct ope_fmt_state fmt_out;
> + u32 proc_mbus_code;
> + struct v4l2_rect disp_compose;
> +
> + struct list_head list;
> + bool started;
> +
> + struct ope_config config;
> + u8 current_stripe;
> + struct ope_stripe stripe[OPE_MAX_STRIPE];
> +};
> +
> +/* Per OPE device state */
> +struct ope_dev {
> + struct device *dev;
> + struct v4l2_device v4l2_dev;
> + struct media_device mdev;
> + struct camss_isp_pipeline *pipeline;
> + struct mutex mutex;
> + struct camss_isp_sched sched;
> +
> + struct icc_path *icc_data;
> + struct icc_path *icc_config;
> +
> + void __iomem *base;
> + void __iomem *base_rd;
> + void __iomem *base_wr;
> + void __iomem *base_pp;
> +
> + struct completion reset_complete;
> +
> + struct list_head ctx_list;
> + struct ope_ctx *shared_ctx;
> + unsigned int open_count;
> +
> + /* Currently active hardware context (set at job start) */
> + struct ope_ctx *hw_ctx;
> +};
> +
> +/* -------- Register accessors -------- */
> +
> +static inline u32 ope_read(struct ope_dev *ope, u32 reg)
> +{
> + return readl(ope->base + reg);
> +}
> +
> +static inline void ope_write(struct ope_dev *ope, u32 reg, u32 val)
> +{
> + writel(val, ope->base + reg);
> +}
> +
> +static inline void ope_write_wr(struct ope_dev *ope, u32 reg, u32 val)
> +{
> + writel_relaxed(val, ope->base_wr + reg);
> +}
> +
> +static inline u32 ope_read_wr(struct ope_dev *ope, u32 reg)
> +{
> + return readl_relaxed(ope->base_wr + reg);
> +}
> +
> +static inline void ope_write_rd(struct ope_dev *ope, u32 reg, u32 val)
> +{
> + writel_relaxed(val, ope->base_rd + reg);
> +}
> +
> +static inline void ope_write_pp(struct ope_dev *ope, u32 reg, u32 val)
> +{
> + writel_relaxed(val, ope->base_pp + reg);
> +}
> +
> +static inline void ope_start(struct ope_dev *ope)
> +{
> + wmb(); /* ensure all register writes are visible before GO_CMD */
> + ope_write_rd(ope, OPE_BUS_RD_INPUT_IF_CMD,
> OPE_BUS_RD_INPUT_IF_CMD_GO_CMD);
> +}
> +
> +/* -------- Stripe helpers -------- */
> +
> +static inline enum ope_stripe_location ope_stripe_location(unsigned int idx,
> unsigned int count)
> +{
> + if (count == 1)
> + return OPE_STRIPE_LOCATION_FULL;
> + if (idx == 0)
> + return OPE_STRIPE_LOCATION_LEFT;
> + if (idx == count - 1)
> + return OPE_STRIPE_LOCATION_RIGHT;
> +
> + return OPE_STRIPE_LOCATION_MIDDLE;
> +}
> +
> +static inline bool ope_stripe_is_last(const struct ope_stripe *s)
> +{
> + return s && (s->src.location == OPE_STRIPE_LOCATION_RIGHT ||
> + s->src.location == OPE_STRIPE_LOCATION_FULL);
> +}
> +
> +static inline struct ope_stripe *ope_current_stripe(struct ope_ctx *ctx)
> +{
> + if (ctx->current_stripe >= OPE_MAX_STRIPE)
> + return NULL;
> +
> + return &ctx->stripe[ctx->current_stripe];
> +}
> +
> +static inline unsigned int ope_stripe_index(struct ope_ctx *ctx,
> + const struct ope_stripe *stripe)
> +{
> + return stripe - &ctx->stripe[0];
> +}
> +
> +static inline struct ope_stripe *ope_prev_stripe(struct ope_ctx *ctx,
> + struct ope_stripe *stripe)
> +{
> + unsigned int idx = ope_stripe_index(ctx, stripe);
> +
> + return idx ? &ctx->stripe[idx - 1] : NULL;
> +}
> +
> +static void ope_gen_stripe_chroma_dsc(struct ope_ctx *ctx,
> + struct ope_stripe *stripe)
> +{
> + struct ope_dsc_config *dsc = &stripe->dsc[OPE_DS_C_PRE];
> + u32 dst_fourcc = ctx->fmt_out.fmt->fourcc;
> +
> + dsc->input_width = stripe->src.width;
> + dsc->input_height = stripe->src.height;
> +
> + switch (dst_fourcc) {
> + case V4L2_PIX_FMT_NV61:
> + case V4L2_PIX_FMT_NV16:
> + dsc->output_width = dsc->input_width / 2;
> + dsc->output_height = dsc->input_height;
> + break;
> + case V4L2_PIX_FMT_NV12:
> + case V4L2_PIX_FMT_NV21:
> + dsc->output_width = dsc->input_width / 2;
> + dsc->output_height = dsc->input_height / 2;
> + break;
> + default:
> + dsc->output_width = dsc->input_width;
> + dsc->output_height = dsc->input_height;
> + }
> +
> + dsc->phase_step_h = DS_Q21(dsc->input_width, dsc->output_width);
> + dsc->phase_step_v = DS_Q21(dsc->input_height, dsc->output_height);
> +}
> +
> +static void ope_gen_stripe_dsc(struct ope_ctx *ctx, struct ope_stripe
> *stripe,
> + u32 h_scale, u32 v_scale)
> +{
> + struct ope_dsc_config *dsc_c = &stripe->dsc[OPE_DS_C_DISP];
> + struct ope_dsc_config *dsc_y = &stripe->dsc[OPE_DS_Y_DISP];
> + unsigned int sw = stripe->src.width;
> + unsigned int sw_c = stripe->dsc[OPE_DS_C_PRE].output_width;
> +
> + dsc_c->phase_step_h = dsc_y->phase_step_h = h_scale;
> + dsc_c->phase_step_v = dsc_y->phase_step_v = v_scale;
> +
> + dsc_c->input_width = stripe->dsc[OPE_DS_C_PRE].output_width;
> + dsc_c->input_height = stripe->dsc[OPE_DS_C_PRE].output_height;
> + dsc_y->input_width = stripe->src.width;
> + dsc_y->input_height = stripe->src.height;
> +
> + /*
> + * WE width/height = DS_OUTPUT_PIX (floor). The scaler may deliver
> + * floor or floor+1 pixels/lines; CROP_RND_CLAMP clips the output to
> + * exactly floor so the WE always receives the expected count.
> + */
> + dsc_y->output_width = DS_OUTPUT_PIX(sw, 0, h_scale);
> + dsc_y->output_height = DS_OUTPUT_PIX(stripe->src.height, 0, v_scale);
> + dsc_c->output_width = DS_OUTPUT_PIX(sw_c, 0, h_scale);
> + dsc_c->output_height =
> DS_OUTPUT_PIX(stripe->dsc[OPE_DS_C_PRE].output_height, 0, v_scale);
> +
> + dsc_y->crop_last_pixel = dsc_y->output_width - 1;
> + dsc_y->crop_last_line = dsc_y->output_height - 1;
> + dsc_c->crop_last_pixel = dsc_c->output_width - 1;
> + dsc_c->crop_last_line = dsc_c->output_height - 1;
> +}
> +
> +static void ope_gen_stripe_yuv_dst(struct ope_ctx *ctx, struct ope_stripe
> *stripe, dma_addr_t dst)
> +{
> + const struct ope_fmt_state *fo = &ctx->fmt_out;
> + unsigned int img_h = fo->height;
> + unsigned int bpl = fo->bytesperline; /* Y-plane row stride in bytes */
> + const struct ope_hw_fmt *hw = ope_find_hw_fmt(fo->fmt->fourcc);
> + struct ope_stripe *prev = ope_prev_stripe(ctx, stripe);
> + const struct v4l2_rect *compose = &ctx->disp_compose;
> + dma_addr_t y_base = dst + compose->top * bpl;
> + dma_addr_t c_base;
> + u32 x_init = compose->left;
> + u32 c_x_init;
> +
> + switch (fo->fmt->fourcc) {
> + case V4L2_PIX_FMT_NV24:
> + case V4L2_PIX_FMT_NV42:
> + /* YUV444: C plane starts after Y plane; full-height, 2
> bytes/luma-col */
> + c_base = dst + bpl * img_h + compose->top * bpl * 2;
> + c_x_init = compose->left * 2;
> + break;
> + case V4L2_PIX_FMT_NV16:
> + case V4L2_PIX_FMT_NV61:
> + /* YUV422: C plane = Y plane size; full-height, 1 byte/luma-col
> */
> + c_base = dst + bpl * img_h + compose->top * bpl;
> + c_x_init = compose->left;
> + break;
> + default:
> + /* YUV420 (NV12/NV21): C plane = Y/2 height, 1 byte/luma-col */
> + c_base = dst + bpl * img_h + (compose->top / 2) * bpl;
> + c_x_init = compose->left;
> + break;
> + }
> +
> + stripe->dst[OPE_WR_CLIENT_DISP_Y].enabled = true;
> + stripe->dst[OPE_WR_CLIENT_DISP_C].enabled = true;
> +
> + /* Y plane */
> + if (prev)
> + x_init = prev->dst[OPE_WR_CLIENT_DISP_Y].x_init +
> + prev->dst[OPE_WR_CLIENT_DISP_Y].width;
> +
> + stripe->dst[OPE_WR_CLIENT_DISP_Y].addr = y_base;
> + stripe->dst[OPE_WR_CLIENT_DISP_Y].x_init = x_init;
> + stripe->dst[OPE_WR_CLIENT_DISP_Y].width =
> stripe->dsc[OPE_DS_Y_DISP].output_width;
> + stripe->dst[OPE_WR_CLIENT_DISP_Y].height =
> stripe->dsc[OPE_DS_Y_DISP].output_height;
> + stripe->dst[OPE_WR_CLIENT_DISP_Y].stride = bpl;
> + stripe->dst[OPE_WR_CLIENT_DISP_Y].format = OPE_PACKER_FMT_PLAIN_8;
> +
> + /* UV plane */
> + x_init = c_x_init;
> + if (prev)
> + x_init = prev->dst[OPE_WR_CLIENT_DISP_C].x_init +
> + prev->dst[OPE_WR_CLIENT_DISP_C].width;
> +
> + stripe->dst[OPE_WR_CLIENT_DISP_C].addr = c_base;
> + stripe->dst[OPE_WR_CLIENT_DISP_C].x_init = x_init;
> + stripe->dst[OPE_WR_CLIENT_DISP_C].format = hw ? hw->packer :
> OPE_PACKER_FMT_PLAIN_8;
> + stripe->dst[OPE_WR_CLIENT_DISP_C].width =
> stripe->dsc[OPE_DS_C_DISP].output_width * 2;
> + stripe->dst[OPE_WR_CLIENT_DISP_C].height =
> stripe->dsc[OPE_DS_C_DISP].output_height;
> +
> + switch (fo->fmt->fourcc) {
> + case V4L2_PIX_FMT_NV42:
> + case V4L2_PIX_FMT_NV24:
> + stripe->dst[OPE_WR_CLIENT_DISP_C].stride = bpl * 2;
> + break;
> + case V4L2_PIX_FMT_GREY:
> + stripe->dst[OPE_WR_CLIENT_DISP_C].enabled = false;
> + break;
> + default:
> + stripe->dst[OPE_WR_CLIENT_DISP_C].stride = bpl;
> + }
> +}
> +
> +static void ope_gen_stripes(struct ope_ctx *ctx, dma_addr_t src, dma_addr_t
> dst)
> +{
> + const struct ope_fmt_state *fi = &ctx->fmt_in;
> + const struct v4l2_rect *crop = &fi->crop;
> + const struct v4l2_rect *compose = &ctx->disp_compose;
> + const struct ope_hw_fmt *src_hw = ope_find_hw_fmt(fi->fmt->fourcc);
> + unsigned int num_stripes, width, x_out, x_out_c, i;
> + u32 h_scale, v_scale;
> +
> + /* Advance source pointer to the crop origin */
> + src += crop->top * fi->bytesperline + crop->left * fi->fmt->depth / 8;
> + width = crop->width;
> + num_stripes = DIV_ROUND_UP(crop->width, OPE_STRIPE_MAX_W);
> + h_scale = DS_Q21(crop->width, compose->width);
> + v_scale = DS_Q21(crop->height, compose->height);
> +
> + x_out = x_out_c = 0;
> +
> + for (i = 0; i < num_stripes; i++) {
> + struct ope_stripe *stripe = &ctx->stripe[i];
> + unsigned int sw;
> +
> + memset(stripe, 0, sizeof(*stripe));
> +
> + stripe->src.addr = src;
> + stripe->src.width = width;
> + stripe->src.height = crop->height;
> + stripe->src.stride = fi->bytesperline;
> + stripe->src.location = ope_stripe_location(i, num_stripes);
> + stripe->src.pattern = src_hw ? src_hw->pattern : 0;
> + stripe->src.format = src_hw ? src_hw->unpacker : 0;
> +
> + /* Ensure last stripe is wide enough */
> + if (width > OPE_STRIPE_MAX_W && width < OPE_STRIPE_MAX_W +
> OPE_STRIPE_MIN_W)
> + stripe->src.width -= OPE_STRIPE_MIN_W * 2;
> +
> + v4l_bound_align_image(&stripe->src.width,
> + OPE_STRIPE_MIN_W, OPE_STRIPE_MAX_W,
> + fi->fmt->align,
> + &stripe->src.height,
> + OPE_STRIPE_MIN_H, OPE_STRIPE_MAX_H,
> + OPE_ALIGN_H, 0);
> +
> + sw = stripe->src.width;
> +
> + width -= stripe->src.width;
> + src += stripe->src.width * fi->fmt->depth / 8;
> +
> + /*
> + * Last-stripe adjustment: grow the input width (up to the
> + * available remaining input) until the scaler delivers at
> + * least the remaining output pixels for both Y and C.
> + * CROP_RND_CLAMP then clips the scaler output to exactly
> + * the remaining pixels so the total sums to fo->width.
> + */
> + if (ope_stripe_is_last(stripe)) {
> + unsigned int rem_y = compose->width - x_out;
> + unsigned int rem_c = compose->width / 2 - x_out_c;
> + unsigned int s;
> +
> + for (s = OPE_STRIPE_MIN_W; s <= sw; s += 2) {
> + if (DS_OUTPUT_PIX(s, 0, h_scale) >= rem_y &&
> + DS_OUTPUT_PIX(s / 2, 0, h_scale) >= rem_c) {
> + stripe->src.width = s;
> + break;
> + }
> + }
> + }
> +
> + ope_gen_stripe_chroma_dsc(ctx, stripe);
> + ope_gen_stripe_dsc(ctx, stripe, h_scale, v_scale);
> + ope_gen_stripe_yuv_dst(ctx, stripe, dst);
> +
> + x_out += stripe->dsc[OPE_DS_Y_DISP].output_width;
> + x_out_c += stripe->dsc[OPE_DS_C_DISP].output_width;
> +
> + /* Width in bytes for the fetch engine */
> + stripe->src.width = stripe->src.width * fi->fmt->depth / 8;
> + }
> +}
> +
> +/* -------- Pipeline IQ module programming -------- */
> +
> +static bool ope_module_update(struct ope_ctx *ctx, u32 module_cfg_reg, u32
> enable_mask,
> + struct v4l2_isp_params_block_header *hdr, bool
> force)
> +{
> + bool enable = !(hdr->flags & V4L2_ISP_PARAMS_FL_BLOCK_DISABLE);
> + bool dirty = hdr->flags & CAMSS_ISP_PARAMS_FL_BLOCK_DIRTY;
> +
> + /* skip if neither forced (new context) nor dirty (updated params) */
> + if (!force && !dirty)
> + return false;
> +
> + hdr->flags &= ~CAMSS_ISP_PARAMS_FL_BLOCK_DIRTY;
> +
> + ope_write_pp(ctx->ope, module_cfg_reg, enable ? enable_mask : 0);
> +
> + /* If module is disabled, not need to update content */
> + if (!enable)
> + return false;
> +
> + dev_dbg(ctx->ope->dev, "IQ module (%u) update (flags:0x%x)\n",
> hdr->type, hdr->flags);
> +
> + return true;
> +}
> +
> +static void ope_prog_wb(struct ope_ctx *ctx, bool force)
> +{
> + struct camss_params_wb_gain *wb = &ctx->config.wb_gain;
> + struct ope_dev *ope = ctx->ope;
> +
> + if (!ope_module_update(ctx, OPE_PP_CLC_WB_GAIN_MODULE_CFG,
> + OPE_PP_CLC_WB_GAIN_MODULE_CFG_EN,
> + &wb->header, force))
> + return;
> +
> + ope_write_pp(ope, OPE_PP_CLC_WB_GAIN_WB_CFG(0), wb->g_gain);
> + ope_write_pp(ope, OPE_PP_CLC_WB_GAIN_WB_CFG(1), wb->b_gain);
> + ope_write_pp(ope, OPE_PP_CLC_WB_GAIN_WB_CFG(2), wb->r_gain);
> +}
> +
> +static void ope_prog_bayer2rgb(struct ope_ctx *ctx, bool force)
> +{
> + struct ope_params_demo *demo = &ctx->config.demo;
> + struct ope_dev *ope = ctx->ope;
> +
> + if (!ope_module_update(ctx, OPE_PP_CLC_DEMO_MODULE_CFG,
> + OPE_PP_CLC_DEMO_MODULE_CFG_EN |
> + OPE_PP_CLC_DEMO_MODULE_CFG_DYN_G_CLAMP_EN,
> + &demo->header, force))
> + return;
> +
> + ope_write_pp(ope, OPE_PP_CLC_DEMO_INTERP_COEFF_CFG,
> + FIELD_PREP(OPE_PP_CLC_DEMO_INTERP_COEFF_CFG_LAMBDA_G,
> demo->lambda_g) |
> + FIELD_PREP(OPE_PP_CLC_DEMO_INTERP_COEFF_CFG_LAMBDA_RB,
> demo->lambda_rb));
> + ope_write_pp(ope, OPE_PP_CLC_DEMO_INTERP_CLASSIFIER_CFG_0,
> + FIELD_PREP(OPE_PP_CLC_DEMO_INTERP_CLASSIFIER_CFG_0_AK,
> demo->a_k));
> + ope_write_pp(ope, OPE_PP_CLC_DEMO_INTERP_CLASSIFIER_CFG_1,
> + FIELD_PREP(OPE_PP_CLC_DEMO_INTERP_CLASSIFIER_CFG_1_WK,
> demo->w_k));
> +}
> +
> +static void ope_prog_rgb2yuv(struct ope_ctx *ctx, bool force)
> +{
> + struct camss_params_chroma_enhan *cc = &ctx->config.chroma_enhan;
> + struct ope_dev *ope = ctx->ope;
> +
> + if (!ope_module_update(ctx, OPE_PP_CLC_CHROMA_ENHAN_MODULE_CFG,
> + OPE_PP_CLC_CHROMA_ENHAN_MODULE_CFG_EN,
> + &cc->header, force))
> + return;
> +
> + ope_write_pp(ope, OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_0,
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_0_V0,
> cc->luma_v0) |
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_0_V1,
> cc->luma_v1));
> + ope_write_pp(ope, OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_2,
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_2_V2,
> cc->luma_v2));
> + ope_write_pp(ope, OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_1,
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_LUMA_CFG_1_K,
> cc->luma_k));
> + ope_write_pp(ope, OPE_PP_CLC_CHROMA_ENHAN_COEFF_A_CFG,
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_COEFF_A_CFG_AP,
> cc->coeff_ap) |
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_COEFF_A_CFG_AM,
> cc->coeff_am));
> + ope_write_pp(ope, OPE_PP_CLC_CHROMA_ENHAN_COEFF_B_CFG,
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_COEFF_B_CFG_BP,
> cc->coeff_dp) |
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_COEFF_B_CFG_BM,
> cc->coeff_dm));
> + ope_write_pp(ope, OPE_PP_CLC_CHROMA_ENHAN_COEFF_C_CFG,
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_COEFF_C_CFG_CP,
> cc->coeff_cp) |
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_COEFF_C_CFG_CM,
> cc->coeff_cm));
> + ope_write_pp(ope, OPE_PP_CLC_CHROMA_ENHAN_COEFF_D_CFG,
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_COEFF_D_CFG_DP,
> cc->coeff_dp) |
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_COEFF_D_CFG_DM,
> cc->coeff_dm));
> + ope_write_pp(ope, OPE_PP_CLC_CHROMA_ENHAN_CHROMA_CFG_0,
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_CHROMA_CFG_0_KCB,
> cc->kcb));
> + ope_write_pp(ope, OPE_PP_CLC_CHROMA_ENHAN_CHROMA_CFG_1,
> + FIELD_PREP(OPE_PP_CLC_CHROMA_ENHAN_CHROMA_CFG_1_KCR,
> cc->kcr));
> +}
> +
> +static void ope_prog_color_correct(struct ope_ctx *ctx, bool force)
> +{
> + struct camss_params_color_correct *cc = &ctx->config.color_correct;
> + struct ope_dev *ope = ctx->ope;
> +
> + if (!ope_module_update(ctx, OPE_PP_CLC_CC_MODULE_CFG,
> + OPE_PP_CLC_CC_MODULE_CFG_EN,
> + &cc->header, force))
> + return;
> +
> + ope_write_pp(ope, OPE_PP_CLC_CC_COEFF_A_CFG_0,
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_A_CFG_0_A0, cc->a[0]) |
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_A_CFG_0_A1, cc->a[1]));
> + ope_write_pp(ope, OPE_PP_CLC_CC_COEFF_A_CFG_1,
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_A_CFG_1_A2, cc->a[2]));
> + ope_write_pp(ope, OPE_PP_CLC_CC_COEFF_B_CFG_0,
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_B_CFG_0_B0, cc->b[0]) |
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_B_CFG_0_B1, cc->b[1]));
> + ope_write_pp(ope, OPE_PP_CLC_CC_COEFF_B_CFG_1,
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_B_CFG_1_B2, cc->b[2]));
> + ope_write_pp(ope, OPE_PP_CLC_CC_COEFF_C_CFG_0,
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_C_CFG_0_C0, cc->c[0]) |
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_C_CFG_0_C1, cc->c[1]));
> + ope_write_pp(ope, OPE_PP_CLC_CC_COEFF_C_CFG_1,
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_C_CFG_1_C2, cc->c[2]));
> + ope_write_pp(ope, OPE_PP_CLC_CC_COEFF_K_CFG_0,
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_K_CFG_0_K0, cc->k[0]));
> + ope_write_pp(ope, OPE_PP_CLC_CC_COEFF_K_CFG_1,
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_K_CFG_1_K1, cc->k[1]));
> + ope_write_pp(ope, OPE_PP_CLC_CC_COEFF_K_CFG_2,
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_K_CFG_2_K2, cc->k[2]));
> + ope_write_pp(ope, OPE_PP_CLC_CC_COEFF_M_CFG,
> + FIELD_PREP(OPE_PP_CLC_CC_COEFF_M_CFG_M, cc->m));
> +}
> +
> +static void ope_prog_crop_rnd_clamp(struct ope_dev *ope, const struct
> ope_stripe *stripe)
> +{
> + static const u32 crop_bases[] = {
> + OPE_PP_CLC_CROP_RND_CLAMP_Y_DISP_BASE,
> + OPE_PP_CLC_CROP_RND_CLAMP_C_DISP_BASE,
> + };
> + static const enum ope_downscaler ds_idx[] = {
> + OPE_DS_Y_DISP,
> + OPE_DS_C_DISP,
> + };
> + unsigned int i;
> +
> + for (i = 0; i < ARRAY_SIZE(crop_bases); i++) {
> + const struct ope_dsc_config *dsc = &stripe->dsc[ds_idx[i]];
> + u32 cbase = crop_bases[i];
> +
> + if (!dsc->output_width || !dsc->output_height) {
> + ope_write_pp(ope,
> OPE_PP_CLC_CROP_RND_CLAMP_MODULE_CFG(cbase), 0);
> + continue;
> + }
> +
> + ope_write_pp(ope,
> OPE_PP_CLC_CROP_RND_CLAMP_CROP_PIXEL_CFG(cbase),
> + FIELD_PREP(OPE_PP_CLC_CROP_RND_CLAMP_CROP_FIRST,
> 0) |
> + FIELD_PREP(OPE_PP_CLC_CROP_RND_CLAMP_CROP_LAST,
> + dsc->crop_last_pixel));
> + ope_write_pp(ope,
> OPE_PP_CLC_CROP_RND_CLAMP_CROP_LINE_CFG(cbase),
> + FIELD_PREP(OPE_PP_CLC_CROP_RND_CLAMP_CROP_FIRST,
> 0) |
> + FIELD_PREP(OPE_PP_CLC_CROP_RND_CLAMP_CROP_LAST,
> dsc->crop_last_line));
> + ope_write_pp(ope, OPE_PP_CLC_CROP_RND_CLAMP_MODULE_CFG(cbase),
> + OPE_PP_CLC_CROP_RND_CLAMP_EN |
> OPE_PP_CLC_CROP_RND_CLAMP_CROP_EN);
> + }
> +}
> +
> +static void ope_prog_stripe(struct ope_ctx *ctx, struct ope_stripe *stripe)
> +{
> + struct ope_dev *ope = ctx->ope;
> + int i;
> +
> + dev_dbg(ope->dev, "ctx=%p programming stripe %u\n",
> + ctx, (unsigned int)(stripe - ctx->stripe));
> +
> + /* Fetch Engine */
> + ope_write_rd(ope, OPE_BUS_RD_CLIENT_0_UNPACK_CFG_0, stripe->src.format);
> + ope_write_rd(ope, OPE_BUS_RD_CLIENT_0_RD_BUFFER_SIZE,
> + (stripe->src.width << 16) | stripe->src.height);
> + ope_write_rd(ope, OPE_BUS_RD_CLIENT_0_ADDR_IMAGE, stripe->src.addr);
> + ope_write_rd(ope, OPE_BUS_RD_CLIENT_0_RD_STRIDE, stripe->src.stride);
> + ope_write_rd(ope, OPE_BUS_RD_CLIENT_0_CCIF_META_DATA,
> + FIELD_PREP(OPE_BUS_RD_CLIENT_0_CCIF_MD_PIX_PATTERN,
> + stripe->src.pattern));
> + ope_write_rd(ope, OPE_BUS_RD_CLIENT_0_CORE_CFG,
> + OPE_BUS_RD_CLIENT_0_CORE_CFG_EN);
> +
> + /* Write Engines */
> + for (i = 0; i < OPE_WR_CLIENT_MAX; i++) {
> + if (!stripe->dst[i].enabled) {
> + ope_write_wr(ope, OPE_BUS_WR_CLIENT_CFG(i), 0);
> + continue;
> + }
> + ope_write_wr(ope, OPE_BUS_WR_CLIENT_ADDR_IMAGE(i),
> + stripe->dst[i].addr);
> + ope_write_wr(ope, OPE_BUS_WR_CLIENT_IMAGE_CFG_0(i),
> + (stripe->dst[i].height << 16) |
> stripe->dst[i].width);
> + ope_write_wr(ope, OPE_BUS_WR_CLIENT_IMAGE_CFG_1(i),
> + stripe->dst[i].x_init);
> + ope_write_wr(ope, OPE_BUS_WR_CLIENT_IMAGE_CFG_2(i),
> + stripe->dst[i].stride);
> + ope_write_wr(ope, OPE_BUS_WR_CLIENT_PACKER_CFG(i),
> + stripe->dst[i].format);
> + ope_write_wr(ope, OPE_BUS_WR_CLIENT_CFG(i),
> + OPE_BUS_WR_CLIENT_CFG_EN |
> + OPE_BUS_WR_CLIENT_CFG_AUTORECOVER);
> + }
> +
> + /* Downscalers */
> + for (i = 0; i < OPE_DS_MAX; i++) {
> + struct ope_dsc_config *dsc = &stripe->dsc[i];
> + u32 base = ope_ds_base[i];
> + u32 cfg = 0;
> +
> + if (dsc->input_width != dsc->output_width) {
> + dsc->phase_step_h |=
> + DS_RESOLUTION(dsc->input_width,
> + dsc->output_width) << 30;
> + cfg |= OPE_PP_CLC_DOWNSCALE_MN_DS_CFG_H_SCALE_EN;
> + }
> + if (dsc->input_height != dsc->output_height) {
> + dsc->phase_step_v |=
> + DS_RESOLUTION(dsc->input_height,
> + dsc->output_height) << 30;
> + cfg |= OPE_PP_CLC_DOWNSCALE_MN_DS_CFG_V_SCALE_EN;
> + }
> +
> + ope_write_pp(ope, OPE_PP_CLC_DOWNSCALE_MN_DS_CFG(base), cfg);
> + ope_write_pp(ope,
> OPE_PP_CLC_DOWNSCALE_MN_DS_IMAGE_SIZE_CFG(base),
> + ((dsc->input_width - 1) << 16) |
> (dsc->input_height - 1));
> + ope_write_pp(ope, OPE_PP_CLC_DOWNSCALE_MN_DS_MN_H_CFG(base),
> + dsc->phase_step_h);
> + ope_write_pp(ope, OPE_PP_CLC_DOWNSCALE_MN_DS_MN_V_CFG(base),
> + dsc->phase_step_v);
> + ope_write_pp(ope, OPE_PP_CLC_DOWNSCALE_MN_CFG(base),
> + cfg ? OPE_PP_CLC_DOWNSCALE_MN_CFG_EN : 0);
> +
> + dev_dbg(ope->dev,
> + "DS[%d] cfg=0x%x in=%ux%u out=%ux%u\n",
> + i, cfg,
> + dsc->input_width, dsc->input_height,
> + dsc->output_width, dsc->output_height);
> + }
> +
> + ope_prog_crop_rnd_clamp(ope, stripe);
> +}
> +
> +static void ope_params_apply_wb(void *priv, const union
> camss_isp_params_block *block)
> +{
> + struct ope_ctx *ctx = priv;
> +
> + ctx->config.wb_gain = block->wb_gain;
> + ctx->config.wb_gain.header.flags |= CAMSS_ISP_PARAMS_FL_BLOCK_DIRTY;
> +}
> +
> +static void ope_params_apply_chroma_enhan(void *priv, const union
> camss_isp_params_block *block)
> +{
> + struct ope_ctx *ctx = priv;
> +
> + ctx->config.chroma_enhan = block->chroma_enhan;
> + ctx->config.chroma_enhan.header.flags |=
> CAMSS_ISP_PARAMS_FL_BLOCK_DIRTY;
> +}
> +
> +static void ope_params_apply_color_correct(void *priv, const union
> camss_isp_params_block *block)
> +{
> + struct ope_ctx *ctx = priv;
> +
> + ctx->config.color_correct = block->color_correct;
> + ctx->config.color_correct.header.flags |=
> CAMSS_ISP_PARAMS_FL_BLOCK_DIRTY;
> +}
> +
> +static const struct v4l2_isp_params_block_type_info ope_params_type_info[] =
> {
> + [CAMSS_PARAMS_WB_GAIN] = { sizeof(struct camss_params_wb_gain) },
> + [CAMSS_PARAMS_CHROMA_ENHAN] = { sizeof(struct
> camss_params_chroma_enhan) },
> + [CAMSS_PARAMS_COLOR_CORRECT] = { sizeof(struct
> camss_params_color_correct) },
> +};
> +
> +static const camss_isp_params_handler_fn ope_params_handlers[] = {
> + [CAMSS_PARAMS_WB_GAIN] = ope_params_apply_wb,
> + [CAMSS_PARAMS_CHROMA_ENHAN] = ope_params_apply_chroma_enhan,
> + [CAMSS_PARAMS_COLOR_CORRECT] = ope_params_apply_color_correct,
> +};
> +
> +static void ope_apply_params(struct ope_ctx *ctx)
> +{
> + struct vb2_v4l2_buffer *vbuf;
> +
> + vbuf = camss_isp_bufq_next(ctx->bufq, OPE_QUEUE_PARAMS);
> + if (!vbuf)
> + return;
> +
> + camss_isp_params_apply(ctx->ope->dev, &vbuf->vb2_buf,
> + ope_params_type_info,
> + ope_params_handlers,
> + ARRAY_SIZE(ope_params_handlers),
> + ctx);
> +}
> +
> +/* Default/initial tuning parameters */
> +static const struct ope_config ope_default_config = {
> + .wb_gain = {
> + .header.type = CAMSS_PARAMS_WB_GAIN,
> + .header.flags = V4L2_ISP_PARAMS_FL_BLOCK_ENABLE |
> + CAMSS_ISP_PARAMS_FL_BLOCK_DIRTY,
> + .g_gain = OPE_WB(1, 1),
> + .b_gain = OPE_WB(3, 2),
> + .r_gain = OPE_WB(3, 2),
> + },
> + .demo = {
> + .header.type = CAMSS_PARAMS_INTERNAL,
> + .header.flags = V4L2_ISP_PARAMS_FL_BLOCK_ENABLE |
> + CAMSS_ISP_PARAMS_FL_BLOCK_DIRTY,
> + .lambda_rb = 0,
> + .lambda_g = 128,
> + .a_k = 128,
> + .w_k = 102,
> + },
> + .chroma_enhan = { /* RGB -> YUV values from BT.601 */
> + .header.type = CAMSS_PARAMS_CHROMA_ENHAN,
> + .header.flags = V4L2_ISP_PARAMS_FL_BLOCK_ENABLE |
> + CAMSS_ISP_PARAMS_FL_BLOCK_DIRTY,
> + .luma_v0 = 0x04d,
> + .luma_v1 = 0x096,
> + .luma_v2 = 0x01d,
> + .luma_k = 0,
> + .coeff_ap = 0x0e6,
> + .coeff_am = 0x0e6,
> + .coeff_cp = 0x0b3,
> + .coeff_cm = 0x0b3,
> + .coeff_dp = 0xfb3,
> + .coeff_dm = 0xfb3,
> + .kcb = 128,
> + .kcr = 128,
> + },
> + .color_correct = {
> + .header.type = CAMSS_PARAMS_COLOR_CORRECT,
> + .header.flags = V4L2_ISP_PARAMS_FL_BLOCK_DISABLE |
> + CAMSS_ISP_PARAMS_FL_BLOCK_DIRTY,
> + }
> +};
> +
> +/* -------- Job scheduling -------- */
> +static bool ope_job_ready(void *priv)
> +{
> + struct ope_ctx *ctx = priv;
> +
> + return ctx->started &&
> + camss_isp_bufq_num_ready(ctx->bufq, OPE_QUEUE_FRAME_IN) >= 1 &&
> + camss_isp_bufq_num_ready(ctx->bufq, OPE_QUEUE_DISP_OUT) >= 1;
> +}
> +
> +static void ope_job_finish(struct ope_ctx *ctx, enum vb2_buffer_state state)
> +{
> + struct vb2_v4l2_buffer *src, *dst, *params;
> + bool requeue = false;
> +
> + src = camss_isp_bufq_remove(ctx->bufq, OPE_QUEUE_FRAME_IN);
> + dst = camss_isp_bufq_remove(ctx->bufq, OPE_QUEUE_DISP_OUT);
> + params = camss_isp_bufq_remove(ctx->bufq, OPE_QUEUE_PARAMS);
> +
> + if (dst) {
> + dst->sequence = ctx->fmt_in.sequence++;
> + dst->vb2_buf.timestamp = src->vb2_buf.timestamp;
> + }
> +
> + if (dst)
> + camss_isp_buf_done(dst, state);
> + if (src)
> + camss_isp_buf_done(src, state);
> + if (params)
> + camss_isp_buf_done(params, state);
> +
> + if (ope_job_ready(ctx) && state == VB2_BUF_STATE_DONE)
> + requeue = true;
> +
> + dev_dbg(ctx->ope->dev, "done ctx=%p continue=%s\n", ctx, requeue ?
> "yes" : "no");
> +
> + ctx->ope->hw_ctx = NULL;
> + camss_isp_sched_job_finish(&ctx->ope->sched, &ctx->job, requeue);
> +}
> +
> +static dma_addr_t ope_buf_dma_addr(struct ope_ctx *ctx, unsigned int
> queue_idx)
> +{
> + struct vb2_v4l2_buffer *vbuf = camss_isp_bufq_next(ctx->bufq,
> queue_idx);
> +
> + if (!vbuf)
> + return 0;
> +
> + return vb2_dma_contig_plane_dma_addr(&vbuf->vb2_buf, 0);
> +}
> +
> +static void ope_run_job(void *priv, bool ctx_changed)
> +{
> + struct ope_ctx *ctx = priv;
> + struct ope_dev *ope = ctx->ope;
> + dma_addr_t src, dst;
> +
> + src = ope_buf_dma_addr(ctx, OPE_QUEUE_FRAME_IN);
> + dst = ope_buf_dma_addr(ctx, OPE_QUEUE_DISP_OUT);
> +
> + if (!src || !dst) {
> + dev_warn(ope->dev, "Job cannot run, missing buffer\n");
> + camss_isp_sched_job_finish(&ope->sched, &ctx->job, false);
> + return;
> + }
> +
> + dev_dbg(ope->dev, "start ctx=%p->%p src=%pad dst=%pad\n",
> + ope->hw_ctx, ctx, &src, &dst);
> +
> + ope_apply_params(ctx);
> + ope_gen_stripes(ctx, src, dst);
> +
> + ope_prog_wb(ctx, ctx_changed);
> + ope_prog_bayer2rgb(ctx, ctx_changed);
> + ope_prog_color_correct(ctx, ctx_changed);
> + ope_prog_rgb2yuv(ctx, ctx_changed);
> +
> + ctx->current_stripe = 0;
> + ope->hw_ctx = ctx;
> + ope_prog_stripe(ctx, &ctx->stripe[0]);
> +
> + ope_start(ope);
> +}
> +
> +static void ope_abort_job(void *priv)
> +{
> + struct ope_ctx *ctx = priv;
> +
> + dev_dbg(ctx->ope->dev, "job abort ctx=%p\n", ctx);
> + ope_write(ctx->ope, OPE_TOP_RESET_CMD, OPE_TOP_RESET_CMD_SW);
> +
> + /* Scheduler will wait for completion (on reset complete isr) */
> +}
> +
> +static void ope_try_schedule(struct ope_ctx *ctx)
> +{
> + camss_isp_sched_try_run(&ctx->ope->sched, &ctx->job);
> +}
> +
> +static const struct camss_isp_job_ops ope_job_ops = {
> + .ready = ope_job_ready,
> + .run = ope_run_job,
> + .abort = ope_abort_job,
> +};
> +
> +/* -------- Interrupt handlers -------- */
> +
> +static void ope_fe_irq(struct ope_dev *ope)
> +{
> + u32 status = readl_relaxed(ope->base_rd +
> OPE_BUS_RD_INPUT_IF_IRQ_STATUS);
> +
> + writel_relaxed(status, ope->base_rd + OPE_BUS_RD_INPUT_IF_IRQ_CLEAR);
> + writel_relaxed(OPE_BUS_RD_INPUT_IF_IRQ_CMD_CLEAR,
> + ope->base_rd + OPE_BUS_RD_INPUT_IF_IRQ_CMD);
> +}
> +
> +static void ope_we_irq(struct ope_dev *ope, struct ope_ctx *ctx)
> +{
> + u32 status;
> +
> + status = ope_read_wr(ope, OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0);
> + ope_write_wr(ope, OPE_BUS_WR_INPUT_IF_IRQ_CLEAR_0, status);
> + ope_write_wr(ope, OPE_BUS_WR_INPUT_IF_IRQ_CMD,
> OPE_BUS_WR_INPUT_IF_IRQ_CMD_CLEAR);
> +
> + if (!ctx)
> + return;
> +
> + if (status & OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_CONS_VIOL) {
> + u32 viol = ope_read_wr(ope, OPE_BUS_WR_VIOLATION_STATUS);
> +
> + dev_err_ratelimited(ope->dev, "constraint violation
> (clients=0x%x)\n", viol);
> + ope_write(ctx->ope, OPE_TOP_RESET_CMD, OPE_TOP_RESET_CMD_SW);
> + }
> +
> + if (status & OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_IMG_SZ_VIOL) {
> + u32 viol = ope_read_wr(ope,
> OPE_BUS_WR_IMAGE_SIZE_VIOLATION_STATUS);
> + int i;
> +
> + for (i = 0; i < OPE_WR_CLIENT_MAX; i++) {
> + if (BIT(i) & viol)
> + dev_err_ratelimited(ope->dev, "WE%d: image size
> violation\n", i);
> + }
> + ope_write(ctx->ope, OPE_TOP_RESET_CMD, OPE_TOP_RESET_CMD_SW);
> + }
> +
> + if (status & OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_VIOL) {
> + u32 viol = ope_read_wr(ope, OPE_BUS_WR_VIOLATION_STATUS);
> +
> + dev_err_ratelimited(ope->dev, "fatal violation
> (status=0x%08x)\n", viol);
> + ope_write(ctx->ope, OPE_TOP_RESET_CMD, OPE_TOP_RESET_CMD_SW);
> + }
> +
> + if (status & OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_RUP_DONE) {
> + /* Register update done: program next stripe (double-buffered)
> */
> + struct ope_stripe *stripe = ope_current_stripe(ctx);
> +
> + if (stripe && !ope_stripe_is_last(stripe))
> + ope_prog_stripe(ctx, stripe + 1);
> + }
> +}
> +
> +
> +static void __ope_irq_init(struct ope_dev *ope)
> +{
> + ope_write(ope, OPE_TOP_IRQ_MASK,
> + OPE_TOP_IRQ_STATUS_RST_DONE |
> + OPE_TOP_IRQ_STATUS_WE |
> + OPE_TOP_IRQ_STATUS_VIOL |
> + OPE_TOP_IRQ_STATUS_IDLE);
> + ope_write_wr(ope, OPE_BUS_WR_INPUT_IF_IRQ_MASK_0,
> + OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_RUP_DONE |
> + OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_CONS_VIOL |
> + OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_VIOL |
> + OPE_BUS_WR_INPUT_IF_IRQ_STATUS_0_IMG_SZ_VIOL);
> +}
> +
> +static irqreturn_t ope_irq(int irq, void *dev_id)
> +{
> + struct ope_dev *ope = dev_id;
> + struct ope_ctx *ctx;
> + u32 status;
> +
> + /*
> + * hw_ctx is safe to read here without a lock: ope_run_job() sets it
> + * before calling ope_start(), so the hardware cannot raise an IRQ
> + * before hw_ctx is visible. ope_job_finish() clears it in IRQ context
> + * (i.e. here), so the clear is serialised with this read by the IRQ
> + * itself.
> + */
> + ctx = ope->hw_ctx;
> +
> + status = ope_read(ope, OPE_TOP_IRQ_STATUS);
> + ope_write(ope, OPE_TOP_IRQ_CLEAR, status);
> + ope_write(ope, OPE_TOP_IRQ_CMD, OPE_TOP_IRQ_CMD_CLEAR);
> +
> + if (status & OPE_TOP_IRQ_STATUS_RST_DONE) {
> + dev_dbg(ope->dev, "reset done ctx=%p\n", ctx);
> + if (ctx)
> + ope_job_finish(ctx, VB2_BUF_STATE_ERROR);
> + complete(&ope->reset_complete);
> + }
> +
> + if (status & OPE_TOP_IRQ_STATUS_VIOL)
> + dev_warn(ope->dev, "OPE violation: 0x%08x\n",
> + ope_read(ope, OPE_TOP_VIOLATION_STATUS));
> +
> + if (status & OPE_TOP_IRQ_STATUS_FE)
> + ope_fe_irq(ope);
> +
> + if (status & OPE_TOP_IRQ_STATUS_WE)
> + ope_we_irq(ope, ctx);
> +
> + if ((status & OPE_TOP_IRQ_STATUS_IDLE) && ctx) {
> + struct ope_stripe *stripe = ope_current_stripe(ctx);
> +
> + dev_dbg(ope->dev, "stripe %u done ctx=%p\n",
> ctx->current_stripe, ctx);
> +
> + if (ope_stripe_is_last(stripe)) {
> + ctx->current_stripe = 0;
> + ope_job_finish(ctx, VB2_BUF_STATE_DONE);
> + } else {
> + ctx->current_stripe++;
> + ope_start(ope);
> + }
> + }
> +
> + return IRQ_HANDLED;
> +}
> +
> +/* -------- vb2 queue private data -------- */
> +
> +struct ope_vq_priv {
> + struct ope_ctx *ctx;
> + unsigned int queue_idx;
> +};
> +
> +static inline struct ope_ctx *ope_ctx_from_vq(struct vb2_queue *q)
> +{
> + return ((struct ope_vq_priv *)vb2_get_drv_priv(q))->ctx;
> +}
> +
> +static inline unsigned int ope_idx_from_vq(struct vb2_queue *q)
> +{
> + return ((struct ope_vq_priv *)vb2_get_drv_priv(q))->queue_idx;
> +}
> +
> +/* -------- vb2 ops -------- */
> +
> +static int ope_queue_setup(struct vb2_queue *q, unsigned int *nbuffers,
> + unsigned int *nplanes, unsigned int sizes[],
> + struct device *alloc_devs[])
> +{
> + struct ope_ctx *ctx = ope_ctx_from_vq(q);
> + unsigned int idx = ope_idx_from_vq(q);
> + unsigned int size;
> +
> + if (idx == OPE_QUEUE_FRAME_IN)
> + size = ctx->fmt_in.sizeimage ? ctx->fmt_in.sizeimage :
> PAGE_SIZE;
> + else if (idx == OPE_QUEUE_DISP_OUT)
> + size = ctx->fmt_out.sizeimage ? ctx->fmt_out.sizeimage :
> PAGE_SIZE;
> + else
> + size = v4l2_isp_params_buffer_size(CAMSS_PARAMS_MAX_PAYLOAD);
> +
> + if (*nplanes) {
> + if (*nplanes != 1 || sizes[0] < size)
> + return -EINVAL;
> + } else {
> + *nplanes = 1;
> + sizes[0] = size;
> + }
> +
> + return 0;
> +}
> +
> +static int ope_buf_prepare(struct vb2_buffer *vb)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_vq(vb->vb2_queue);
> + unsigned int idx = ope_idx_from_vq(vb->vb2_queue);
> + struct vb2_v4l2_buffer *vbuf = to_vb2_v4l2_buffer(vb);
> + unsigned int sizeimage;
> +
> + if (idx == OPE_QUEUE_FRAME_IN)
> + sizeimage = ctx->fmt_in.sizeimage;
> + else if (idx == OPE_QUEUE_DISP_OUT)
> + sizeimage = ctx->fmt_out.sizeimage;
> + else
> + sizeimage =
> v4l2_isp_params_buffer_size(CAMSS_PARAMS_MAX_PAYLOAD);
> +
> + if (V4L2_TYPE_IS_OUTPUT(vb->vb2_queue->type)) {
> + if (vb->vb2_queue->type == V4L2_BUF_TYPE_VIDEO_OUTPUT_MPLANE) {
> + if (vbuf->field == V4L2_FIELD_ANY)
> + vbuf->field = V4L2_FIELD_NONE;
> + if (vbuf->field != V4L2_FIELD_NONE)
> + return -EINVAL;
> + }
> + }
> +
> + if (vb2_plane_size(vb, 0) < sizeimage)
> + return -EINVAL;
> +
> + if (V4L2_TYPE_IS_CAPTURE(vb->vb2_queue->type))
> + vb2_set_plane_payload(vb, 0, sizeimage);
> +
> + return 0;
> +}
> +
> +static void ope_buf_queue(struct vb2_buffer *vb)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_vq(vb->vb2_queue);
> + unsigned int idx = ope_idx_from_vq(vb->vb2_queue);
> + struct vb2_v4l2_buffer *vbuf = to_vb2_v4l2_buffer(vb);
> +
> + camss_isp_bufq_queue(ctx->bufq, idx, vbuf);
> + ope_try_schedule(ctx);
> +}
> +
> +/* -------- Power scaling -------- */
> +
> +static inline unsigned long ope_pixclk(const struct ope_fmt_state *fs,
> unsigned int fps)
> +{
> + return (unsigned long)fs->width * fs->height * fps;
> +}
> +
> +static inline unsigned int ope_load_avg(const struct ope_fmt_state *fs,
> unsigned int fps)
> +{
> + return mult_frac(ope_pixclk(fs, fps), fs->fmt->depth, 1000) / 8;
> +}
> +
> +static inline unsigned int ope_load_peak(const struct ope_fmt_state *fs,
> unsigned int fps)
> +{
> + return ope_load_avg(fs, fps) * 2;
> +}
> +
> +static inline unsigned int ope_load_config(const struct ope_fmt_state *fs,
> unsigned int fps)
> +{
> + unsigned int stripe_count = fs->width / OPE_STRIPE_MAX_W + 1;
> +
> + /* In worst case we have ~50 32-bit registers to write */
> + return mult_frac(stripe_count * 50 * 4, fps, 1000);
> +}
> +
> +static void ope_adjust_power(struct ope_dev *ope)
> +{
> + unsigned int loadavg = 0, loadpeak = 0, loadconfig = 0;
> + unsigned long pixclk = 0;
> + struct dev_pm_opp *opp;
> + struct ope_ctx *ctx;
> + int ret;
> +
> + list_for_each_entry(ctx, &ope->ctx_list, list) {
> + unsigned int fps;
> +
> + if (!ctx->started)
> + continue;
> +
> + fps = ctx->framerate ? ctx->framerate : DEFAULT_FRAMERATE;
> +
> + pixclk += ope_pixclk(&ctx->fmt_in, fps);
> + loadavg += ope_load_avg(&ctx->fmt_in, fps);
> + loadavg += ope_load_avg(&ctx->fmt_out, fps);
> + loadpeak += ope_load_peak(&ctx->fmt_in, fps);
> + loadpeak += ope_load_peak(&ctx->fmt_out, fps);
> + loadconfig += ope_load_config(&ctx->fmt_in, fps);
> + }
> +
> + /* 30% margin for overhead */
> + pixclk = mult_frac(pixclk, 13, 10);
> +
> + dev_dbg(ope->dev, "adjust power: clk=%luHz avg=%uKBps peak=%uKBps
> cfg=%uKBps\n",
> + pixclk, loadavg, loadpeak, loadconfig);
> +
> + opp = dev_pm_opp_find_freq_ceil(ope->dev, &pixclk);
> + if (IS_ERR(opp))
> + dev_warn(ope->dev, "Requested pixel clock %luHz exceeds
> hardware limit\n", pixclk);
> + else
> + dev_pm_opp_put(opp);
> +
> + ret = dev_pm_opp_set_rate(ope->dev, pixclk);
> + if (ret)
> + dev_warn(ope->dev, "Failed to set OPP rate: %d\n", ret);
> +
> + ret = icc_set_bw(ope->icc_data, loadavg, loadpeak);
> + if (ret)
> + dev_warn(ope->dev, "Failed to set data path BW: %d\n", ret);
> +
> + ret = icc_set_bw(ope->icc_config, loadconfig, loadconfig * 5);
> + if (ret)
> + dev_warn(ope->dev, "Failed to set config path BW: %d\n", ret);
> +}
> +
> +static int ope_start_streaming(struct vb2_queue *q, unsigned int count)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_vq(q);
> + unsigned int idx = ope_idx_from_vq(q);
> + int ret;
> +
> + dev_dbg(ctx->ope->dev, "streaming start ctx=%p queue=%u\n", ctx, idx);
> +
> + ret = pm_runtime_resume_and_get(ctx->ope->dev);
> + if (ret) {
> + dev_err(ctx->ope->dev, "resume failed (%d)\n", ret);
> + return ret;
> + }
> +
> + if (idx == OPE_QUEUE_FRAME_IN) {
> + ctx->fmt_in.sequence = 0;
> + ctx->started = true;
> + ope_adjust_power(ctx->ope);
> + __ope_irq_init(ctx->ope);
> + }
> +
> + ope_try_schedule(ctx);
> +
> + return 0;
> +}
> +
> +static void ope_stop_streaming(struct vb2_queue *q)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_vq(q);
> + unsigned int idx = ope_idx_from_vq(q);
> +
> + dev_dbg(ctx->ope->dev, "streaming stop ctx=%p queue=%u\n", ctx, idx);
> +
> + if (idx == OPE_QUEUE_FRAME_IN) {
> + ctx->started = false;
> + ope_adjust_power(ctx->ope);
> + }
> +
> + camss_isp_bufq_drain(ctx->bufq, idx, VB2_BUF_STATE_ERROR);
> +
> + pm_runtime_put(ctx->ope->dev);
> +}
> +
> +static const struct vb2_ops ope_vb2_ops = {
> + .queue_setup = ope_queue_setup,
> + .buf_prepare = ope_buf_prepare,
> + .buf_queue = ope_buf_queue,
> + .start_streaming = ope_start_streaming,
> + .stop_streaming = ope_stop_streaming,
> +};
> +
> +static int ope_init_vq(struct ope_ctx *ctx, unsigned int idx)
> +{
> + struct ope_dev *ope = ctx->ope;
> + struct ope_vq_priv *qpriv;
> + struct vb2_queue *q = &ctx->vqs[idx];
> +
> + qpriv = devm_kzalloc(ope->dev, sizeof(*qpriv), GFP_KERNEL);
> + if (!qpriv)
> + return -ENOMEM;
> +
> + qpriv->ctx = ctx;
> + qpriv->queue_idx = idx;
> + q->drv_priv = qpriv;
> + q->ops = &ope_vb2_ops;
> + q->lock = &ctx->vbq_lock;
> + q->dev = ope->dev;
> + q->buf_struct_size = sizeof(struct camss_isp_buf);
> + q->timestamp_flags = V4L2_BUF_FLAG_TIMESTAMP_COPY;
> +
> + if (idx == OPE_QUEUE_PARAMS) {
> + q->type = V4L2_BUF_TYPE_META_OUTPUT;
> + q->mem_ops = &vb2_vmalloc_memops;
> + q->io_modes = VB2_MMAP | VB2_USERPTR;
> + } else if (idx == OPE_QUEUE_DISP_OUT) {
> + q->type = V4L2_BUF_TYPE_VIDEO_CAPTURE_MPLANE;
> + q->mem_ops = &vb2_dma_contig_memops;
> + q->io_modes = VB2_MMAP | VB2_DMABUF;
> + } else {
> + q->type = V4L2_BUF_TYPE_VIDEO_OUTPUT_MPLANE;
> + q->mem_ops = &vb2_dma_contig_memops;
> + q->io_modes = VB2_MMAP | VB2_DMABUF;
> + }
> +
> + return vb2_queue_init(q);
> +}
> +
> +/* -------- Format helpers -------- */
> +
> +static const struct ope_fmt *ope_find_fmt(const struct ope_fmt *fmts,
> + unsigned int n, u32 fourcc)
> +{
> + unsigned int i;
> +
> + for (i = 0; i < n; i++)
> + if (fmts[i].fourcc == fourcc)
> + return &fmts[i];
> +
> + return NULL;
> +}
> +
> +static const struct ope_fmt *ope_fmt_try(struct ope_dev *ope, bool is_output,
> + struct v4l2_pix_format_mplane *pix)
> +{
> + const struct ope_fmt *fmts = is_output ? ope_output_fmts :
> ope_input_fmts;
> + unsigned int n = is_output ? ARRAY_SIZE(ope_output_fmts) :
> ARRAY_SIZE(ope_input_fmts);
> + unsigned int max_w = OPE_MAX_W, max_h = OPE_MAX_H;
> + const struct ope_fmt *fmt;
> + unsigned int bytesperline;
> +
> + fmt = ope_find_fmt(fmts, n, pix->pixelformat);
> + if (!fmt) {
> + fmt = &fmts[0];
> + pix->pixelformat = fmt->fourcc;
> + }
> +
> + v4l_bound_align_image(&pix->width, OPE_MIN_W, max_w, fmt->align,
> + &pix->height, OPE_MIN_H, max_h, 0, 0);
> +
> + pix->num_planes = 1;
> + pix->field = V4L2_FIELD_NONE;
> + if (!pix->colorspace)
> + pix->colorspace = is_output ? V4L2_COLORSPACE_SRGB :
> V4L2_COLORSPACE_RAW;
> + /*
> + * Output formats are semi-planar or grey-scale (Y plane only) always
> + * using 1 byte per Y value. pix->bytesperline stores the Y-plane bpl.
> + * Depth tracks the total storage size including the second combined
> + * Cb + Cr plane which directly follows the Y plane and is used for
> + * sizeimage calculations rather then for bytesperline.
> + */
> + if (is_output)
> + bytesperline = pix->width;
> + else
> + bytesperline = pix->width * fmt->depth / 8;
> +
> + if (pix->plane_fmt[0].bytesperline < bytesperline)
> + pix->plane_fmt[0].bytesperline = bytesperline;
> +
> +
> + if (is_output)
> + pix->plane_fmt[0].sizeimage =
> + (u64)pix->plane_fmt[0].bytesperline * pix->height *
> fmt->depth / 8;
> + else
> + pix->plane_fmt[0].sizeimage = pix->plane_fmt[0].bytesperline *
> pix->height;
> +
> + return fmt;
> +}
> +
> +/* -------- ioctl helpers/ops -------- */
> +
> +static inline unsigned int ope_queue_idx_from_file(struct file *file)
> +{
> + return (unsigned int)(uintptr_t)video_get_drvdata(video_devdata(file));
> +}
> +
> +static struct vb2_queue *ope_vq_from_file(struct file *file, struct ope_ctx
> *ctx)
> +{
> + unsigned int idx = ope_queue_idx_from_file(file);
> +
> + if (idx >= OPE_QUEUE_COUNT)
> + return NULL;
> +
> + return &ctx->vqs[idx];
> +}
> +
> +static struct ope_ctx *ope_ctx_from_file(struct file *file)
> +{
> + struct ope_dev *ope = container_of(video_devdata(file)->v4l2_dev,
> + struct ope_dev, v4l2_dev);
> +
> + return ope->shared_ctx;
> +}
> +
> +static int ope_querycap(struct file *file, void *priv, struct
> v4l2_capability *cap)
> +{
> + strscpy(cap->driver, OPE_NAME, sizeof(cap->driver));
> + strscpy(cap->card, "Qualcomm CAMSS OPE", sizeof(cap->card));
> +
> + return 0;
> +}
> +
> +static int ope_enum_fmt_vid_out(struct file *file, void *priv, struct
> v4l2_fmtdesc *f)
> +{
> + if (f->index >= ARRAY_SIZE(ope_input_fmts))
> + return -EINVAL;
> +
> + f->pixelformat = ope_input_fmts[f->index].fourcc;
> +
> + return 0;
> +}
> +
> +static int ope_g_fmt_vid_out(struct file *file, void *priv, struct
> v4l2_format *f)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct ope_fmt_state *fs = &ctx->fmt_in;
> + struct v4l2_pix_format_mplane *pix = &f->fmt.pix_mp;
> +
> + pix->pixelformat = fs->fmt->fourcc;
> + pix->width = fs->width;
> + pix->height = fs->height;
> + pix->num_planes = 1;
> + pix->field = V4L2_FIELD_NONE;
> + pix->colorspace = fs->colorspace;
> + pix->xfer_func = fs->xfer_func;
> + pix->ycbcr_enc = fs->ycbcr_enc;
> + pix->quantization = fs->quantization;
> + pix->plane_fmt[0].bytesperline = fs->bytesperline;
> + pix->plane_fmt[0].sizeimage = fs->sizeimage;
> +
> + return 0;
> +}
> +
> +static int ope_try_fmt_vid_out(struct file *file, void *priv, struct
> v4l2_format *f)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> +
> + ope_fmt_try(ctx->ope, false, &f->fmt.pix_mp);
> +
> + return 0;
> +}
> +
> +static int ope_s_fmt_vid_out(struct file *file, void *priv, struct
> v4l2_format *f)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + const struct ope_fmt *fmt;
> +
> + if (vb2_is_busy(&ctx->vqs[OPE_QUEUE_FRAME_IN]))
> + return -EBUSY;
> +
> + fmt = ope_fmt_try(ctx->ope, false, &f->fmt.pix_mp);
> + ctx->fmt_in.fmt = fmt;
> + ctx->fmt_in.width = f->fmt.pix_mp.width;
> + ctx->fmt_in.height = f->fmt.pix_mp.height;
> + ctx->fmt_in.bytesperline = f->fmt.pix_mp.plane_fmt[0].bytesperline;
> + ctx->fmt_in.sizeimage = f->fmt.pix_mp.plane_fmt[0].sizeimage;
> + ctx->fmt_in.colorspace = f->fmt.pix_mp.colorspace;
> + ctx->fmt_in.xfer_func = f->fmt.pix_mp.xfer_func;
> + ctx->fmt_in.ycbcr_enc = f->fmt.pix_mp.ycbcr_enc;
> + ctx->fmt_in.quantization = f->fmt.pix_mp.quantization;
> +
> + /* Reset crop to full input frame */
> + ctx->fmt_in.crop.left = 0;
> + ctx->fmt_in.crop.top = 0;
> + ctx->fmt_in.crop.width = ctx->fmt_in.width;
> + ctx->fmt_in.crop.height = ctx->fmt_in.height;
> +
> + return 0;
> +}
> +
> +static int ope_enum_fmt_vid_cap(struct file *file, void *priv, struct
> v4l2_fmtdesc *f)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + unsigned int i, n = 0;
> +
> + for (i = 0; i < ARRAY_SIZE(ope_output_fmts); i++) {
> + if (ope_output_fmts[i].mbus_code != ctx->proc_mbus_code)
> + continue;
> + if (n++ == f->index) {
> + f->pixelformat = ope_output_fmts[i].fourcc;
> + return 0;
> + }
> + }
> +
> + return -EINVAL;
> +}
> +
> +static int ope_g_fmt_vid_cap(struct file *file, void *priv, struct
> v4l2_format *f)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct ope_fmt_state *fs = &ctx->fmt_out;
> + struct v4l2_pix_format_mplane *pix = &f->fmt.pix_mp;
> +
> + pix->pixelformat = fs->fmt->fourcc;
> + pix->width = fs->width;
> + pix->height = fs->height;
> + pix->num_planes = 1;
> + pix->field = V4L2_FIELD_NONE;
> + pix->colorspace = fs->colorspace;
> + pix->xfer_func = fs->xfer_func;
> + pix->ycbcr_enc = fs->ycbcr_enc;
> + pix->quantization = fs->quantization;
> + pix->plane_fmt[0].bytesperline = fs->bytesperline;
> + pix->plane_fmt[0].sizeimage = fs->sizeimage;
> +
> + return 0;
> +}
> +
> +static int ope_try_fmt_vid_cap(struct file *file, void *priv, struct
> v4l2_format *f)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> +
> + ope_fmt_try(ctx->ope, true, &f->fmt.pix_mp);
> +
> + return 0;
> +}
> +
> +static int ope_s_fmt_vid_cap(struct file *file, void *priv, struct
> v4l2_format *f)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + const struct ope_fmt *fmt;
> +
> + if (vb2_is_busy(&ctx->vqs[OPE_QUEUE_DISP_OUT]))
> + return -EBUSY;
> +
> + fmt = ope_fmt_try(ctx->ope, true, &f->fmt.pix_mp);
> +
> + ctx->fmt_out.fmt = fmt;
> + ctx->fmt_out.width = f->fmt.pix_mp.width;
> + ctx->fmt_out.height = f->fmt.pix_mp.height;
> + ctx->fmt_out.bytesperline = f->fmt.pix_mp.plane_fmt[0].bytesperline;
> + ctx->fmt_out.sizeimage = f->fmt.pix_mp.plane_fmt[0].sizeimage;
> + ctx->fmt_out.colorspace = f->fmt.pix_mp.colorspace;
> + ctx->fmt_out.xfer_func = f->fmt.pix_mp.xfer_func;
> + ctx->fmt_out.ycbcr_enc = f->fmt.pix_mp.ycbcr_enc;
> + ctx->fmt_out.quantization = f->fmt.pix_mp.quantization;
> + /* Sync proc mbus code with selected pixel format */
> + ctx->proc_mbus_code = fmt->mbus_code;
> + /* Reset scaler output to full crop (no scaling) */
> + ctx->disp_compose.left = 0;
> + ctx->disp_compose.top = 0;
> + ctx->disp_compose.width = ctx->fmt_in.crop.width;
> + ctx->disp_compose.height = ctx->fmt_in.crop.height;
> +
> + return 0;
> +}
> +
> +/* Input crop */
> +static int ope_g_selection(struct file *file, void *priv,
> + struct v4l2_selection *s)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> +
> + if (s->type != V4L2_BUF_TYPE_VIDEO_OUTPUT &&
> + s->type != V4L2_BUF_TYPE_VIDEO_OUTPUT_MPLANE)
> + return -EINVAL;
> +
> + switch (s->target) {
> + case V4L2_SEL_TGT_CROP:
> + s->r = ctx->fmt_in.crop;
> + return 0;
> + case V4L2_SEL_TGT_CROP_DEFAULT:
> + case V4L2_SEL_TGT_CROP_BOUNDS:
> + s->r.left = s->r.top = 0;
> + s->r.width = ctx->fmt_in.width;
> + s->r.height = ctx->fmt_in.height;
> + return 0;
> + }
> +
> + return -EINVAL;
> +}
> +
> +static int ope_s_selection(struct file *file, void *priv,
> + struct v4l2_selection *s)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> +
> + if (s->type != V4L2_BUF_TYPE_VIDEO_OUTPUT && s->type !=
> V4L2_BUF_TYPE_VIDEO_OUTPUT_MPLANE)
> + return -EINVAL;
> + if (s->target != V4L2_SEL_TGT_CROP)
> + return -EINVAL;
> + if (vb2_is_busy(&ctx->vqs[OPE_QUEUE_FRAME_IN]))
> + return -EBUSY;
> +
> + s->r.left = clamp_t(int, s->r.left, 0, (int)ctx->fmt_in.width -
> OPE_MIN_W);
> + s->r.top = clamp_t(int, s->r.top, 0, (int)ctx->fmt_in.height -
> OPE_MIN_H);
> + s->r.width = clamp(s->r.width, (unsigned int)OPE_MIN_W,
> + ctx->fmt_in.width - (unsigned int)s->r.left);
> + s->r.height = clamp(s->r.height, (unsigned int)OPE_MIN_H,
> + ctx->fmt_in.height - (unsigned int)s->r.top);
> + ctx->fmt_in.crop = s->r;
> +
> + return 0;
> +}
> +
> +static int ope_g_fmt_meta(struct file *file, void *priv, struct v4l2_format
> *f)
> +{
> + f->fmt.meta.dataformat = V4L2_META_FMT_QCOM_ISP_PARAMS;
> + f->fmt.meta.buffersize =
> v4l2_isp_params_buffer_size(CAMSS_PARAMS_MAX_PAYLOAD);
> +
> + return 0;
> +}
> +
> +static int ope_enum_fmt_meta_out(struct file *file, void *priv, struct
> v4l2_fmtdesc *f)
> +{
> + if (f->index > 0)
> + return -EINVAL;
> +
> + f->pixelformat = V4L2_META_FMT_QCOM_ISP_PARAMS;
> +
> + return 0;
> +}
> +
> +static int ope_g_parm(struct file *file, void *priv, struct v4l2_streamparm
> *sp)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> +
> + if (!V4L2_TYPE_IS_OUTPUT(sp->type))
> + return -EINVAL;
> +
> + sp->parm.output.capability = V4L2_CAP_TIMEPERFRAME;
> + sp->parm.output.timeperframe = ctx->fmt_in.timeperframe;
> +
> + return 0;
> +}
> +
> +static int ope_s_parm(struct file *file, void *priv, struct v4l2_streamparm
> *sp)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct v4l2_fract *tpf = &sp->parm.output.timeperframe;
> +
> + if (!V4L2_TYPE_IS_OUTPUT(sp->type))
> + return -EINVAL;
> +
> + if (vb2_is_busy(&ctx->vqs[OPE_QUEUE_FRAME_IN]))
> + return -EBUSY;
> +
> + if (!tpf->denominator)
> + tpf->denominator = 1;
> + if (!tpf->numerator)
> + tpf->numerator = 1;
> +
> + ctx->fmt_in.timeperframe = *tpf;
> + ctx->framerate = tpf->denominator / tpf->numerator;
> + sp->parm.output.capability = V4L2_CAP_TIMEPERFRAME;
> +
> + ope_adjust_power(ctx->ope);
> +
> + return 0;
> +}
> +
> +static int ope_enum_framesizes(struct file *file, void *priv, struct
> v4l2_frmsizeenum *fsize)
> +{
> + unsigned int idx = ope_queue_idx_from_file(file);
> + const struct ope_fmt *fmts;
> + unsigned int n;
> +
> + if (idx == OPE_QUEUE_FRAME_IN) {
> + fmts = ope_input_fmts;
> + n = ARRAY_SIZE(ope_input_fmts);
> + } else {
> + fmts = ope_output_fmts;
> + n = ARRAY_SIZE(ope_output_fmts);
> + }
> +
> + if (fsize->index > 0 || !ope_find_fmt(fmts, n, fsize->pixel_format))
> + return -EINVAL;
> +
> + fsize->type = V4L2_FRMSIZE_TYPE_STEPWISE;
> + fsize->stepwise.min_width = OPE_MIN_W;
> + fsize->stepwise.max_width = OPE_MAX_W;
> + fsize->stepwise.step_width = 1;
> + fsize->stepwise.min_height = OPE_MIN_H;
> + fsize->stepwise.max_height = OPE_MAX_H;
> + fsize->stepwise.step_height = 1;
> +
> + return 0;
> +}
> +
> +static int ope_reqbufs(struct file *file, void *priv, struct
> v4l2_requestbuffers *rb)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct vb2_queue *vq = ope_vq_from_file(file, ctx);
> + int ret;
> +
> + if (!vq)
> + return -EINVAL;
> +
> + if (vb2_queue_is_busy(vq, file))
> + return -EBUSY;
> +
> + ret = vb2_reqbufs(vq, rb);
> + if (!ret)
> + vq->owner = rb->count ? file->private_data : NULL;
> +
> + return ret;
> +}
> +
> +static int ope_querybuf(struct file *file, void *priv, struct v4l2_buffer
> *buf)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct vb2_queue *vq = ope_vq_from_file(file, ctx);
> +
> + return vq ? vb2_querybuf(vq, buf) : -EINVAL;
> +}
> +
> +static int ope_qbuf(struct file *file, void *priv, struct v4l2_buffer *buf)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct video_device *vdev = video_devdata(file);
> + struct vb2_queue *vq = ope_vq_from_file(file, ctx);
> +
> + if (!vq)
> + return -EINVAL;
> + if (vb2_queue_is_busy(vq, file))
> + return -EBUSY;
> +
> + return vb2_qbuf(vq, vdev->v4l2_dev->mdev, buf);
> +}
> +
> +static int ope_dqbuf(struct file *file, void *priv, struct v4l2_buffer *buf)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct vb2_queue *vq = ope_vq_from_file(file, ctx);
> +
> + return vq ? vb2_dqbuf(vq, buf, file->f_flags & O_NONBLOCK) : -EINVAL;
> +}
> +
> +static int ope_prepare_buf(struct file *file, void *priv, struct v4l2_buffer
> *buf)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct video_device *vdev = video_devdata(file);
> + struct vb2_queue *vq = ope_vq_from_file(file, ctx);
> +
> + if (!vq)
> + return -EINVAL;
> + if (vb2_queue_is_busy(vq, file))
> + return -EBUSY;
> +
> + return vb2_prepare_buf(vq, vdev->v4l2_dev->mdev, buf);
> +}
> +
> +static int ope_create_bufs(struct file *file, void *priv, struct
> v4l2_create_buffers *create)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct vb2_queue *vq = ope_vq_from_file(file, ctx);
> + int ret;
> +
> + if (!vq)
> + return -EINVAL;
> + if (vb2_queue_is_busy(vq, file))
> + return -EBUSY;
> + ret = vb2_create_bufs(vq, create);
> + if (!ret && create->count)
> + vq->owner = file->private_data;
> +
> + return ret;
> +}
> +
> +static int ope_expbuf(struct file *file, void *priv, struct
> v4l2_exportbuffer *eb)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct vb2_queue *vq = ope_vq_from_file(file, ctx);
> +
> + if (!vq)
> + return -EINVAL;
> + if (vb2_queue_is_busy(vq, file))
> + return -EBUSY;
> +
> + return vb2_expbuf(vq, eb);
> +}
> +
> +static int ope_streamon(struct file *file, void *priv, enum v4l2_buf_type
> type)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct vb2_queue *vq = ope_vq_from_file(file, ctx);
> +
> + if (!vq)
> + return -EINVAL;
> + if (vb2_queue_is_busy(vq, file))
> + return -EBUSY;
> +
> + return vb2_streamon(vq, type);
> +}
> +
> +static int ope_streamoff(struct file *file, void *priv, enum v4l2_buf_type
> type)
> +{
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct vb2_queue *vq = ope_vq_from_file(file, ctx);
> +
> + if (!vq)
> + return -EINVAL;
> +
> + if (vb2_queue_is_busy(vq, file))
> + return -EBUSY;
> +
> + camss_isp_sched_cancel(&ctx->ope->sched, &ctx->job);
> +
> + return vb2_streamoff(vq, type);
> +}
> +
> +/* -------- proc subdev ops -------- */
> +
> +enum ope_proc_pad {
> + OPE_PROC_PAD_SINK_IN,
> + OPE_PROC_PAD_SINK_PAR,
> + OPE_PROC_PAD_SOURCE,
> + OPE_PROC_PADS_NUM,
> +};
> +
> +static int ope_proc_get_fmt(struct v4l2_subdev *sd, struct v4l2_subdev_state
> *state,
> + struct v4l2_subdev_format *fmt)
> +{
> + struct ope_ctx *ctx = container_of(sd->v4l2_dev, struct ope_dev,
> v4l2_dev)->shared_ctx;
> +
> + if (fmt->pad == OPE_PROC_PAD_SOURCE) {
> + /* Source carries the scaler output (compose) size */
> + fmt->format.width = ctx ? ctx->disp_compose.width : 640;
> + fmt->format.height = ctx ? ctx->disp_compose.height : 480;
> + fmt->format.code = ctx ? ctx->proc_mbus_code :
> MEDIA_BUS_FMT_YUYV8_1_5X8;
> + fmt->format.field = V4L2_FIELD_NONE;
> + fmt->format.colorspace = V4L2_COLORSPACE_SRGB;
> + } else if (fmt->pad == OPE_PROC_PAD_SINK_IN) {
> + fmt->format.width = ctx ? ctx->fmt_in.width : 640;
> + fmt->format.height = ctx ? ctx->fmt_in.height : 480;
> + fmt->format.code = ctx ? ctx->fmt_in.fmt->mbus_code :
> MEDIA_BUS_FMT_SRGGB8_1X8;
> + fmt->format.field = V4L2_FIELD_NONE;
> + fmt->format.colorspace = V4L2_COLORSPACE_RAW;
> + } else { /* Params sink (pad1): no image format */
> + fmt->format.width = 0;
> + fmt->format.height = 0;
> + fmt->format.code = MEDIA_BUS_FMT_FIXED;
> + fmt->format.field = V4L2_FIELD_NONE;
> + fmt->format.colorspace = V4L2_COLORSPACE_DEFAULT;
> + }
> +
> + return 0;
> +}
> +
> +static int ope_proc_set_fmt(struct v4l2_subdev *sd, struct v4l2_subdev_state
> *state,
> + struct v4l2_subdev_format *fmt)
> +{
> + struct ope_dev *ope = container_of(sd->v4l2_dev, struct ope_dev,
> v4l2_dev);
> + struct ope_ctx *ctx = ope->shared_ctx;
> +
> + if (!ctx && fmt->which == V4L2_SUBDEV_FORMAT_ACTIVE)
> + return -ENODEV;
> +
> + if (fmt->pad != OPE_PROC_PAD_SOURCE)
> + return ope_proc_get_fmt(sd, state, fmt);
> +
> + unsigned int i;
> +
> + /* Validate mbus code: must match one of the output formats */
> + for (i = 0; i < ARRAY_SIZE(ope_output_fmts); i++)
> + if (ope_output_fmts[i].mbus_code == fmt->format.code)
> + goto valid;
> +
> + fmt->format.code = MEDIA_BUS_FMT_YUYV8_1_5X8;
> +
> +valid:
> + /* Clamp to crop size: OPE can only downscale */
> + fmt->format.width = clamp(fmt->format.width, (unsigned int)OPE_MIN_W,
> + ctx->fmt_in.crop.width);
> + fmt->format.height = clamp(fmt->format.height, (unsigned int)OPE_MIN_H,
> + ctx->fmt_in.crop.height);
> + fmt->format.field = V4L2_FIELD_NONE;
> + fmt->format.colorspace = V4L2_COLORSPACE_SRGB;
> +
> + if (fmt->which == V4L2_SUBDEV_FORMAT_ACTIVE) {
> + ctx->proc_mbus_code = fmt->format.code;
> + ctx->disp_compose.width = fmt->format.width;
> + ctx->disp_compose.height = fmt->format.height;
> + /* Clip compose origin if it no longer fits */
> + ctx->disp_compose.left = clamp_t(int, ctx->disp_compose.left, 0,
> + (int)ctx->fmt_out.width -
> + (int)ctx->disp_compose.width);
> + ctx->disp_compose.top = clamp_t(int, ctx->disp_compose.top, 0,
> + (int)ctx->fmt_out.height -
> + (int)ctx->disp_compose.height);
> + }
> +
> + return 0;
> +}
> +
> +static int ope_proc_enum_mbus_code(struct v4l2_subdev *sd,
> + struct v4l2_subdev_state *state,
> + struct v4l2_subdev_mbus_code_enum *code)
> +{
> + unsigned int n = 0;
> +
> + if (code->pad != OPE_PROC_PAD_SOURCE)
> + return -EINVAL;
> +
> + /* Enumerate unique mbus codes from output format table */
> + for (unsigned int i = 0; i < ARRAY_SIZE(ope_output_fmts); i++) {
> + u32 mc = ope_output_fmts[i].mbus_code;
> + bool seen = false;
> +
> + for (unsigned int j = 0; j < i; j++) {
> + if (ope_output_fmts[j].mbus_code == mc) {
> + seen = true;
> + break;
> + }
> + }
> +
> + if (seen)
> + continue;
> +
> + if (n++ == code->index) {
> + code->code = mc;
> + return 0;
> + }
> + }
> +
> + return -EINVAL;
> +}
> +
> +static int ope_proc_init_state(struct v4l2_subdev *sd, struct
> v4l2_subdev_state *state)
> +{
> + unsigned int pad;
> +
> + for (pad = 0; pad < OPE_PROC_PADS_NUM; pad++) {
> + struct v4l2_subdev_format fmt = {
> + .which = V4L2_SUBDEV_FORMAT_TRY,
> + .pad = pad
> + };
> + int ret;
> +
> + ret = ope_proc_get_fmt(sd, state, &fmt);
> + if (ret)
> + return ret;
> +
> + *v4l2_subdev_state_get_format(state, pad) = fmt.format;
> + }
> +
> + return 0;
> +}
> +
> +static int ope_proc_get_selection(struct v4l2_subdev *sd,
> + struct v4l2_subdev_state *state,
> + struct v4l2_subdev_selection *sel)
> +{
> + struct ope_ctx *ctx = container_of(sd->v4l2_dev, struct ope_dev,
> v4l2_dev)->shared_ctx;
> +
> + if (sel->pad != OPE_PROC_PAD_SINK_IN)
> + goto source;
> +
> + switch (sel->target) {
> + case V4L2_SEL_TGT_CROP:
> + sel->r = ctx ? ctx->fmt_in.crop : (struct v4l2_rect){ 0, 0,
> 640, 480 };
> + return 0;
> + case V4L2_SEL_TGT_CROP_DEFAULT:
> + case V4L2_SEL_TGT_CROP_BOUNDS:
> + sel->r.left = sel->r.top = 0;
> + sel->r.width = ctx ? ctx->fmt_in.width : 640;
> + sel->r.height = ctx ? ctx->fmt_in.height : 480;
> + return 0;
> + }
> +
> +source:
> + if (sel->pad != OPE_PROC_PAD_SOURCE)
> + return -EINVAL;
> +
> + switch (sel->target) {
> + case V4L2_SEL_TGT_COMPOSE:
> + sel->r = ctx ? ctx->disp_compose : (struct v4l2_rect){ 0, 0,
> 640, 480 };
> + return 0;
> + case V4L2_SEL_TGT_COMPOSE_DEFAULT:
> + sel->r.left = sel->r.top = 0;
> + sel->r.width = ctx ? ctx->fmt_in.crop.width : 640;
> + sel->r.height = ctx ? ctx->fmt_in.crop.height : 480;
> + return 0;
> + case V4L2_SEL_TGT_COMPOSE_BOUNDS:
> + sel->r.left = sel->r.top = 0;
> + sel->r.width = ctx ? ctx->fmt_out.width : 640;
> + sel->r.height = ctx ? ctx->fmt_out.height : 480;
> + return 0;
> + }
> +
> + return -EINVAL;
> +}
> +
> +static int ope_proc_set_selection(struct v4l2_subdev *sd,
> + struct v4l2_subdev_state *state,
> + struct v4l2_subdev_selection *sel)
> +{
> + struct ope_dev *ope = container_of(sd->v4l2_dev, struct ope_dev,
> v4l2_dev);
> + struct ope_ctx *ctx = ope->shared_ctx;
> +
> + if (!ctx)
> + return -ENODEV;
> +
> + if (sel->pad == OPE_PROC_PAD_SINK_IN) {
> + if (sel->target != V4L2_SEL_TGT_CROP)
> + return -EINVAL;
> + sel->r.left = clamp_t(int, sel->r.left, 0,
> + (int)ctx->fmt_in.width - OPE_MIN_W);
> + sel->r.top = clamp_t(int, sel->r.top, 0,
> + (int)ctx->fmt_in.height - OPE_MIN_H);
> + sel->r.width = clamp(sel->r.width, (unsigned int)OPE_MIN_W,
> + ctx->fmt_in.width - (unsigned
> int)sel->r.left);
> + sel->r.height = clamp(sel->r.height, (unsigned int)OPE_MIN_H,
> + ctx->fmt_in.height - (unsigned
> int)sel->r.top);
> + if (sel->which == V4L2_SUBDEV_FORMAT_ACTIVE)
> + ctx->fmt_in.crop = sel->r;
> + return 0;
> + }
> +
> + if (sel->pad == OPE_PROC_PAD_SOURCE) {
> + if (sel->target != V4L2_SEL_TGT_COMPOSE)
> + return -EINVAL;
> + /* Scale size: clamp to crop (downscale only) */
> + sel->r.width = clamp(sel->r.width, (unsigned int)OPE_MIN_W,
> + ctx->fmt_in.crop.width);
> + sel->r.height = clamp(sel->r.height, (unsigned int)OPE_MIN_H,
> + ctx->fmt_in.crop.height);
> + /* Placement: clamp origin to output buffer bounds */
> + sel->r.left = clamp_t(int, sel->r.left, 0,
> + (int)ctx->fmt_out.width -
> (int)sel->r.width);
> + sel->r.top = clamp_t(int, sel->r.top, 0,
> + (int)ctx->fmt_out.height -
> (int)sel->r.height);
> + if (sel->which == V4L2_SUBDEV_FORMAT_ACTIVE)
> + ctx->disp_compose = sel->r;
> + return 0;
> + }
> +
> + return -EINVAL;
> +}
> +
> +static const struct v4l2_subdev_pad_ops ope_proc_pad_ops = {
> + .get_fmt = ope_proc_get_fmt,
> + .set_fmt = ope_proc_set_fmt,
> + .enum_mbus_code = ope_proc_enum_mbus_code,
> + .get_selection = ope_proc_get_selection,
> + .set_selection = ope_proc_set_selection,
> +};
> +
> +static const struct v4l2_subdev_internal_ops ope_proc_internal_ops = {
> + .init_state = ope_proc_init_state,
> +};
> +
> +static const struct v4l2_subdev_ops ope_proc_ops = {
> + .pad = &ope_proc_pad_ops,
> +};
> +
> +/* -------- disp subdev ops -------- */
> +
> +enum ope_disp_pad {
> + OPE_DISP_PAD_SINK,
> + OPE_DISP_PAD_SOURCE,
> + OPE_DISP_PADS_NUM,
> +};
> +
> +static int ope_disp_get_fmt(struct v4l2_subdev *sd,
> + struct v4l2_subdev_state *state,
> + struct v4l2_subdev_format *fmt)
> +{
> + struct ope_ctx *ctx = container_of(sd->v4l2_dev,
> + struct ope_dev,
> v4l2_dev)->shared_ctx;
> + u32 mbus = ctx ? ctx->proc_mbus_code : MEDIA_BUS_FMT_YUYV8_1_5X8;
> +
> + /* Both sink and source reflect the scaler output (compose) size */
> + fmt->format.width = ctx ? ctx->disp_compose.width : 640;
> + fmt->format.height = ctx ? ctx->disp_compose.height : 480;
> + fmt->format.code = mbus;
> + fmt->format.field = V4L2_FIELD_NONE;
> + fmt->format.colorspace = (fmt->pad == OPE_DISP_PAD_SOURCE && ctx)
> + ? ctx->fmt_out.colorspace :
> V4L2_COLORSPACE_SRGB;
> +
> + return 0;
> +}
> +
> +static int ope_disp_set_fmt(struct v4l2_subdev *sd,
> + struct v4l2_subdev_state *state,
> + struct v4l2_subdev_format *fmt)
> +{
> + struct ope_dev *ope = container_of(sd->v4l2_dev, struct ope_dev,
> v4l2_dev);
> + struct ope_ctx *ctx = ope->shared_ctx;
> +
> + if (!ctx)
> + return -ENODEV;
> +
> + /* Both pads are read-only: size is set via ope_proc source S_FMT */
> + return ope_disp_get_fmt(sd, state, fmt);
> +}
> +
> +
> +static int ope_disp_init_state(struct v4l2_subdev *sd,
> + struct v4l2_subdev_state *state)
> +{
> + unsigned int pad;
> +
> + for (pad = 0; pad < OPE_DISP_PADS_NUM; pad++) {
> + struct v4l2_subdev_format fmt = {
> + .which = V4L2_SUBDEV_FORMAT_TRY,
> + fmt.pad = pad
> + };
> + int ret;
> +
> + ret = ope_disp_get_fmt(sd, state, &fmt);
> + if (ret)
> + return ret;
> +
> + *v4l2_subdev_state_get_format(state, pad) = fmt.format;
> + }
> +
> + return 0;
> +}
> +
> +static const struct v4l2_subdev_pad_ops ope_disp_pad_ops = {
> + .get_fmt = ope_disp_get_fmt,
> + .set_fmt = ope_disp_set_fmt,
> +};
> +
> +static const struct v4l2_subdev_internal_ops ope_disp_internal_ops = {
> + .init_state = ope_disp_init_state,
> +};
> +
> +static const struct v4l2_subdev_ops ope_disp_ops = {
> + .pad = &ope_disp_pad_ops,
> +};
> +
> +/* -------- disp-output video ops -------- */
> +
> +static int ope_disp_output_link_validate(struct media_link *link)
> +{
> + struct video_device *vdev =
> media_entity_to_video_device(link->sink->entity);
> + struct v4l2_subdev *sd =
> media_entity_to_v4l2_subdev(link->source->entity);
> + struct ope_ctx *ctx = video_get_drvdata(vdev);
> + struct v4l2_subdev_format sd_fmt = {
> + .which = V4L2_SUBDEV_FORMAT_ACTIVE,
> + .pad = link->source->index,
> + };
> + int ret;
> +
> + if (!ctx)
> + return 0;
> +
> + ret = v4l2_subdev_call(sd, pad, get_fmt, NULL, &sd_fmt);
> + if (ret)
> + return ret;
> +
> + if (ctx->fmt_out.fmt->mbus_code != sd_fmt.format.code) {
> + dev_dbg(ctx->ope->dev, "link validate: mbus 0x%04x incompatible
> with %.4s\n",
> + sd_fmt.format.code, (char *)&ctx->fmt_out.fmt->fourcc);
> + return -EPIPE;
> + }
> +
> + return 0;
> +}
> +
> +static const struct media_entity_operations ope_disp_output_entity_ops = {
> + .link_validate = ope_disp_output_link_validate,
> +};
> +
> +static const struct v4l2_ioctl_ops ope_video_ioctl_ops = {
> + .vidioc_querycap = ope_querycap,
> + .vidioc_enum_fmt_vid_out = ope_enum_fmt_vid_out,
> + .vidioc_g_fmt_vid_out_mplane = ope_g_fmt_vid_out,
> + .vidioc_try_fmt_vid_out_mplane = ope_try_fmt_vid_out,
> + .vidioc_s_fmt_vid_out_mplane = ope_s_fmt_vid_out,
> + .vidioc_enum_fmt_vid_cap = ope_enum_fmt_vid_cap,
> + .vidioc_g_fmt_vid_cap_mplane = ope_g_fmt_vid_cap,
> + .vidioc_try_fmt_vid_cap_mplane = ope_try_fmt_vid_cap,
> + .vidioc_s_fmt_vid_cap_mplane = ope_s_fmt_vid_cap,
> + .vidioc_g_selection = ope_g_selection,
> + .vidioc_s_selection = ope_s_selection,
> + .vidioc_enum_framesizes = ope_enum_framesizes,
> + .vidioc_g_parm = ope_g_parm,
> + .vidioc_s_parm = ope_s_parm,
> + .vidioc_reqbufs = ope_reqbufs,
> + .vidioc_querybuf = ope_querybuf,
> + .vidioc_qbuf = ope_qbuf,
> + .vidioc_dqbuf = ope_dqbuf,
> + .vidioc_prepare_buf = ope_prepare_buf,
> + .vidioc_create_bufs = ope_create_bufs,
> + .vidioc_expbuf = ope_expbuf,
> + .vidioc_streamon = ope_streamon,
> + .vidioc_streamoff = ope_streamoff,
> + .vidioc_subscribe_event = v4l2_ctrl_subscribe_event,
> + .vidioc_unsubscribe_event = v4l2_event_unsubscribe,
> +};
> +
> +static const struct v4l2_ioctl_ops ope_meta_ioctl_ops = {
> + .vidioc_querycap = ope_querycap,
> + .vidioc_enum_fmt_meta_out = ope_enum_fmt_meta_out,
> + .vidioc_g_fmt_meta_out = ope_g_fmt_meta,
> + .vidioc_s_fmt_meta_out = ope_g_fmt_meta,
> + .vidioc_try_fmt_meta_out = ope_g_fmt_meta,
> + .vidioc_reqbufs = ope_reqbufs,
> + .vidioc_querybuf = ope_querybuf,
> + .vidioc_qbuf = ope_qbuf,
> + .vidioc_dqbuf = ope_dqbuf,
> + .vidioc_prepare_buf = ope_prepare_buf,
> + .vidioc_create_bufs = ope_create_bufs,
> + .vidioc_expbuf = ope_expbuf,
> + .vidioc_streamon = ope_streamon,
> + .vidioc_streamoff = ope_streamoff,
> +};
> +
> +/* -------- File/Context ops -------- */
> +
> +static struct ope_ctx *ope_ctx_create(struct ope_dev *ope)
> +{
> + struct ope_ctx *ctx;
> + unsigned int i;
> + int ret;
> +
> + ctx = kvzalloc(sizeof(*ctx), GFP_KERNEL);
> + if (!ctx)
> + return ERR_PTR(-ENOMEM);
> +
> + ctx->ope = ope;
> +
> + mutex_init(&ctx->vbq_lock);
> +
> + ctx->bufq = camss_isp_bufq_init(OPE_QUEUE_COUNT);
> + if (IS_ERR(ctx->bufq)) {
> + ret = PTR_ERR(ctx->bufq);
> + kvfree(ctx);
> + return ERR_PTR(ret);
> + }
> +
> + camss_isp_job_init(&ctx->job, &ope_job_ops, ctx);
> + ctx->config = ope_default_config;
> +
> + /* Default input format */
> + ctx->fmt_in.fmt = &ope_input_fmts[0];
> + ctx->fmt_in.width = OPE_MIN_W;
> + ctx->fmt_in.height = OPE_MIN_H;
> + ctx->fmt_in.bytesperline = OPE_MIN_W * ope_input_fmts[0].depth / 8;
> + ctx->fmt_in.sizeimage = ctx->fmt_in.bytesperline * OPE_MIN_H;
> + ctx->fmt_in.colorspace = V4L2_COLORSPACE_RAW;
> + ctx->fmt_in.crop = (struct v4l2_rect){ 0, 0, OPE_MIN_W,
> OPE_MIN_H };
> + ctx->fmt_in.timeperframe.numerator = 1;
> + ctx->fmt_in.timeperframe.denominator = DEFAULT_FRAMERATE;
> +
> + /* Default output format */
> + ctx->fmt_out.fmt = &ope_output_fmts[0];
> + ctx->fmt_out.width = OPE_MIN_W;
> + ctx->fmt_out.height = OPE_MIN_H;
> + ctx->fmt_out.bytesperline = OPE_MIN_W;
> + ctx->fmt_out.sizeimage = (u64)ope_output_fmts[0].depth * OPE_MIN_W *
> OPE_MIN_H / 8;
> + ctx->fmt_out.colorspace = V4L2_COLORSPACE_SRGB;
> + ctx->disp_compose = (struct v4l2_rect){ 0, 0, OPE_MIN_W,
> OPE_MIN_H };
> + ctx->proc_mbus_code = ope_output_fmts[0].mbus_code;
> +
> + for (i = 0; i < OPE_QUEUE_COUNT; i++) {
> + ret = ope_init_vq(ctx, i);
> + if (ret) {
> + while (i--)
> + vb2_queue_release(&ctx->vqs[i]);
> + camss_isp_bufq_release(ctx->bufq);
> + kvfree(ctx);
> + return ERR_PTR(ret);
> + }
> + }
> +
> + INIT_LIST_HEAD(&ctx->list);
> + list_add(&ctx->list, &ope->ctx_list);
> + ope->shared_ctx = ctx;
> +
> + return ctx;
> +}
> +
> +static void ope_ctx_destroy(struct ope_ctx *ctx)
> +{
> + struct ope_dev *ope = ctx->ope;
> + unsigned int i;
> +
> + list_del(&ctx->list);
> + camss_isp_sched_cancel(&ope->sched, &ctx->job);
> + for (i = 0; i < OPE_QUEUE_COUNT; i++)
> + vb2_queue_release(&ctx->vqs[i]);
> + camss_isp_bufq_release(ctx->bufq);
> + mutex_destroy(&ctx->vbq_lock);
> + kvfree(ctx);
> +}
> +
> +static int ope_open(struct file *file)
> +{
> + struct video_device *vdev = video_devdata(file);
> + struct ope_dev *ope = container_of(vdev->v4l2_dev, struct ope_dev,
> v4l2_dev);
> + struct ope_ctx *ctx;
> + struct v4l2_fh *fh;
> + int ret = 0;
> +
> + fh = kzalloc(sizeof(*fh), GFP_KERNEL);
> + if (!fh)
> + return -ENOMEM;
> +
> + if (mutex_lock_interruptible(&ope->mutex)) {
> + kfree(fh);
> + return -ERESTARTSYS;
> + }
> +
> + /*
> + * For now, only a single shared context is supported,
> + * until media multi-context support is available.
> + */
> + if (!ope->shared_ctx) {
> + ctx = ope_ctx_create(ope);
> + if (IS_ERR(ctx)) {
> + ret = PTR_ERR(ctx);
> + goto unlock;
> + }
> + } else {
> + ctx = ope->shared_ctx;
> + }
> +
> + v4l2_fh_init(fh, vdev);
> + v4l2_fh_add(fh, file);
> + ope->open_count++;
> +
> +unlock:
> + if (ret)
> + kfree(fh);
> + mutex_unlock(&ope->mutex);
> + return ret;
> +}
> +
> +static int ope_release(struct file *file)
> +{
> + struct v4l2_fh *fh = file_to_v4l2_fh(file);
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct ope_dev *ope = ctx->ope;
> +
> + v4l2_fh_del(fh, file);
> + v4l2_fh_exit(fh);
> + kfree(fh);
> +
> + mutex_lock(&ope->mutex);
> +
> + if (--ope->open_count == 0) {
> + ope->shared_ctx = NULL;
> + mutex_unlock(&ope->mutex);
> + ope_ctx_destroy(ctx);
> + } else {
> + mutex_unlock(&ope->mutex);
> + }
> +
> + return 0;
> +}
> +
> +static __poll_t ope_poll(struct file *file, poll_table *wait)
> +{
> + unsigned int idx = ope_queue_idx_from_file(file);
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> + struct v4l2_fh *fh = file_to_v4l2_fh(file);
> + struct vb2_queue *vq;
> + unsigned long flags;
> + __poll_t rc = 0;
> +
> + if (idx >= OPE_QUEUE_COUNT)
> + return EPOLLERR;
> +
> + vq = &ctx->vqs[idx];
> +
> + poll_wait(file, &vq->done_wq, wait);
> + poll_wait(file, &fh->wait, wait);
> +
> + spin_lock_irqsave(&vq->done_lock, flags);
> + if (!list_empty(&vq->done_list)) {
> + if (V4L2_TYPE_IS_OUTPUT(vq->type))
> + rc |= EPOLLOUT | EPOLLWRNORM;
> + else
> + rc |= EPOLLIN | EPOLLRDNORM;
> + }
> + spin_unlock_irqrestore(&vq->done_lock, flags);
> +
> + if (v4l2_event_pending(fh))
> + rc |= EPOLLPRI;
> +
> + return rc;
> +}
> +
> +static int ope_mmap(struct file *file, struct vm_area_struct *vma)
> +{
> + unsigned int idx = ope_queue_idx_from_file(file);
> + struct ope_ctx *ctx = ope_ctx_from_file(file);
> +
> + if (idx >= OPE_QUEUE_COUNT)
> + return -EINVAL;
> +
> + return vb2_mmap(&ctx->vqs[idx], vma);
> +}
> +
> +static const struct v4l2_file_operations ope_fops = {
> + .owner = THIS_MODULE,
> + .open = ope_open,
> + .release = ope_release,
> + .poll = ope_poll,
> + .unlocked_ioctl = video_ioctl2,
> + .mmap = ope_mmap,
> +};
> +
> +/* Pipeline descriptor */
> +static const struct camss_isp_entity_desc ope_entity_descs[] = {
> + [OPE_ENTITY_FRAME_IN] = {
> + .name = "ope_input",
> + .obj_type = MEDIA_ENTITY_TYPE_VIDEO_DEVICE,
> + .function = MEDIA_ENT_F_IO_V4L,
> + .vdev.caps = V4L2_CAP_VIDEO_OUTPUT_MPLANE | V4L2_CAP_STREAMING,
> + .vdev.drvdata = (void *)(uintptr_t)OPE_QUEUE_FRAME_IN,
> + .vdev.fops = &ope_fops,
> + .vdev.ioctl_ops = &ope_video_ioctl_ops,
> + .pads = (const struct camss_isp_pad_desc[]) {
> + { MEDIA_PAD_FL_SOURCE, OPE_ENTITY_PROC,
> OPE_PROC_PAD_SINK_IN, 0 },
> + { }
> + },
> + },
> + [OPE_ENTITY_DISP_OUT] = {
> + .name = "ope_disp_output",
> + .obj_type = MEDIA_ENTITY_TYPE_VIDEO_DEVICE,
> + .function = MEDIA_ENT_F_IO_V4L,
> + .vdev.caps = V4L2_CAP_VIDEO_CAPTURE_MPLANE | V4L2_CAP_STREAMING,
> + .vdev.drvdata = (void *)(uintptr_t)OPE_QUEUE_DISP_OUT,
> + .vdev.fops = &ope_fops,
> + .vdev.ioctl_ops = &ope_video_ioctl_ops,
> + .vdev.entity_ops = &ope_disp_output_entity_ops,
> + .pads = (const struct camss_isp_pad_desc[]) {
> + { MEDIA_PAD_FL_SINK, OPE_ENTITY_DISP,
> OPE_DISP_PAD_SOURCE, 0 },
> + { }
> + },
> + },
> + [OPE_ENTITY_PARAMS] = {
> + .name = "ope_params",
> + .obj_type = MEDIA_ENTITY_TYPE_VIDEO_DEVICE,
> + .function = MEDIA_ENT_F_IO_V4L,
> + .vdev.caps = V4L2_CAP_META_OUTPUT | V4L2_CAP_STREAMING,
> + .vdev.drvdata = (void *)(uintptr_t)OPE_QUEUE_PARAMS,
> + .vdev.fops = &ope_fops,
> + .vdev.ioctl_ops = &ope_meta_ioctl_ops,
> + .pads = (const struct camss_isp_pad_desc[]) {
> + { MEDIA_PAD_FL_SOURCE, OPE_ENTITY_PROC,
> OPE_PROC_PAD_SINK_PAR, 0 },
> + { }
> + },
> + },
> + [OPE_ENTITY_PROC] = {
> + .name = "ope_proc",
> + .obj_type = MEDIA_ENTITY_TYPE_V4L2_SUBDEV,
> + .function = MEDIA_ENT_F_PROC_VIDEO_ISP,
> + .subdev.ops = &ope_proc_ops,
> + .subdev.internal_ops = &ope_proc_internal_ops,
> + .pads = (const struct camss_isp_pad_desc[]) {
> + { MEDIA_PAD_FL_SINK, OPE_ENTITY_FRAME_IN, 0, 0 },
> + { MEDIA_PAD_FL_SINK, OPE_ENTITY_PARAMS, 0, 0 },
> + { MEDIA_PAD_FL_SOURCE, OPE_ENTITY_DISP,
> OPE_DISP_PAD_SINK, MEDIA_LNK_FL_ENABLED },
> + { }
> + },
> + },
> + [OPE_ENTITY_DISP] = {
> + .name = "ope_disp",
> + .obj_type = MEDIA_ENTITY_TYPE_V4L2_SUBDEV,
> + .function = MEDIA_ENT_F_PROC_VIDEO_SCALER,
> + .subdev.ops = &ope_disp_ops,
> + .subdev.internal_ops = &ope_disp_internal_ops,
> + .pads = (const struct camss_isp_pad_desc[]) {
> + { MEDIA_PAD_FL_SINK, OPE_ENTITY_PROC,
> OPE_PROC_PAD_SOURCE, 0 },
> + { MEDIA_PAD_FL_SOURCE, OPE_ENTITY_DISP_OUT, 0, 0 },
> + { }
> + },
> + },
> +};
> +
> +static int ope_v4l2_init(struct ope_dev *ope)
> +{
> + int ret;
> +
> + mutex_init(&ope->mutex);
> + INIT_LIST_HEAD(&ope->ctx_list);
> + camss_isp_sched_init(&ope->sched);
> +
> + ope->mdev.dev = ope->dev;
> + strscpy(ope->mdev.model, OPE_NAME, sizeof(ope->mdev.model));
> + media_device_init(&ope->mdev);
> + ope->v4l2_dev.mdev = &ope->mdev;
> +
> + ret = v4l2_device_register(ope->dev, &ope->v4l2_dev);
> + if (ret)
> + goto err_mdev_cleanup;
> +
> + ret = media_device_register(&ope->mdev);
> + if (ret)
> + goto err_v4l2;
> +
> + ope->pipeline = camss_isp_pipeline_alloc(OPE_ENTITY_COUNT);
> + if (IS_ERR(ope->pipeline)) {
> + ret = PTR_ERR(ope->pipeline);
> + goto err_media;
> + }
> +
> + ret = camss_isp_pipeline_register(ope->pipeline, &ope->v4l2_dev,
> + ope_entity_descs,
> ARRAY_SIZE(ope_entity_descs));
> + if (ret) {
> + camss_isp_pipeline_free(ope->pipeline);
> + ope->pipeline = NULL;
> + goto err_media;
> + }
> +
> + ope->pipeline->drv_priv = ope;
> + return 0;
> +
> +err_media:
> + media_device_unregister(&ope->mdev);
> +err_v4l2:
> + v4l2_device_unregister(&ope->v4l2_dev);
> +err_mdev_cleanup:
> + media_device_cleanup(&ope->mdev);
> + return ret;
> +}
> +
> +static void ope_v4l2_cleanup(struct ope_dev *ope)
> +{
> + if (ope->pipeline) {
> + camss_isp_pipeline_unregister(ope->pipeline);
> + camss_isp_pipeline_free(ope->pipeline);
> + ope->pipeline = NULL;
> + }
> + media_device_unregister(&ope->mdev);
> + v4l2_device_unregister(&ope->v4l2_dev);
> + media_device_cleanup(&ope->mdev);
> + camss_isp_sched_destroy(&ope->sched);
> +}
> +
> +static int ope_soft_reset(struct ope_dev *ope)
> +{
> + u32 version;
> + int ret;
> +
> + ret = pm_runtime_resume_and_get(ope->dev);
> + if (ret)
> + return dev_err_probe(ope->dev, ret, "resume failed\n");
> +
> + version = ope_read(ope, OPE_TOP_HW_VERSION);
> + dev_dbg(ope->dev, "HW version %u.%u.%u\n",
> + (u32)FIELD_GET(OPE_TOP_HW_VERSION_GEN, version),
> + (u32)FIELD_GET(OPE_TOP_HW_VERSION_REV, version),
> + (u32)FIELD_GET(OPE_TOP_HW_VERSION_STEP, version));
> +
> + reinit_completion(&ope->reset_complete);
> + ope_write(ope, OPE_TOP_RESET_CMD, OPE_TOP_RESET_CMD_SW);
> +
> + if (!wait_for_completion_timeout(&ope->reset_complete,
> +
> msecs_to_jiffies(OPE_RESET_TIMEOUT_MS))) {
> + dev_err(ope->dev, "Reset timeout\n");
> + pm_runtime_put(ope->dev);
> + return -ETIMEDOUT;
> + }
> +
> + pm_runtime_put(ope->dev);
> +
> + return 0;
> +}
> +
> +static int ope_init_power(struct ope_dev *ope)
> +{
> + struct dev_pm_domain_list *pmdomains;
> + struct device *dev = ope->dev;
> + int ret;
> +
> + ope->icc_data = devm_of_icc_get(dev, "data");
> + if (IS_ERR(ope->icc_data))
> + return dev_err_probe(dev, PTR_ERR(ope->icc_data),
> + "failed to get interconnect data path\n");
> +
> + ope->icc_config = devm_of_icc_get(dev, "config");
> + if (IS_ERR(ope->icc_config))
> + return dev_err_probe(dev, PTR_ERR(ope->icc_config),
> + "failed to get interconnect config
> path\n");
> +
> + devm_pm_domain_attach_list(dev, NULL, &pmdomains);
> +
> + ret = devm_pm_opp_set_clkname(dev, "core");
> + if (ret)
> + return ret;
> +
> + ret = devm_pm_opp_of_add_table(dev);
> + if (ret && ret != -ENODEV)
> + return dev_err_probe(dev, ret, "invalid OPP table\n");
> +
> + ret = devm_pm_runtime_enable(dev);
> + if (ret)
> + return ret;
> +
> + ret = devm_pm_clk_create(dev);
> + if (ret)
> + return ret;
> +
> + ret = of_pm_clk_add_clks(dev);
> + if (ret < 0)
> + return ret;
> +
> + return 0;
> +}
> +
> +static int ope_init_mmio(struct ope_dev *ope)
> +{
> + struct platform_device *pdev = to_platform_device(ope->dev);
> +
> + ope->base = devm_platform_ioremap_resource_byname(pdev, "top");
> + if (IS_ERR(ope->base))
> + return PTR_ERR(ope->base);
> +
> + ope->base_rd = devm_platform_ioremap_resource_byname(pdev, "bus_read");
> + if (IS_ERR(ope->base_rd))
> + return PTR_ERR(ope->base_rd);
> +
> + ope->base_wr = devm_platform_ioremap_resource_byname(pdev, "bus_write");
> + if (IS_ERR(ope->base_wr))
> + return PTR_ERR(ope->base_wr);
> +
> + ope->base_pp = devm_platform_ioremap_resource_byname(pdev, "pipeline");
> + if (IS_ERR(ope->base_pp))
> + return PTR_ERR(ope->base_pp);
> +
> + return 0;
> +}
> +
> +static int ope_probe(struct platform_device *pdev)
> +{
> + struct device *dev = &pdev->dev;
> + struct ope_dev *ope;
> + int ret, irq;
> +
> + ope = devm_kzalloc(dev, sizeof(*ope), GFP_KERNEL);
> + if (!ope)
> + return -ENOMEM;
> +
> + ope->dev = dev;
> + init_completion(&ope->reset_complete);
> +
> + ret = dma_set_mask(dev, DMA_BIT_MASK(32));
> + if (ret)
> + return dev_err_probe(dev, ret, "Failed to set DMA mask\n");
> +
> + ret = ope_init_power(ope);
> + if (ret)
> + return dev_err_probe(dev, ret, "Power init failed\n");
> +
> + ret = ope_init_mmio(ope);
> + if (ret)
> + return dev_err_probe(dev, ret, "MMIO init failed\n");
> +
> + irq = platform_get_irq(pdev, 0);
> + if (irq < 0)
> + return dev_err_probe(dev, irq, "Unable to get IRQ\n");
> +
> + ret = devm_request_irq(dev, irq, ope_irq, 0, "camss-ope", ope);
> + if (ret < 0)
> + return dev_err_probe(dev, ret, "Requesting IRQ failed\n");
> +
> + ret = ope_soft_reset(ope);
> + if (ret)
> + return ret;
> +
> + ret = ope_v4l2_init(ope);
> + if (ret)
> + return dev_err_probe(dev, ret, "V4L2 init failed\n");
> +
> + platform_set_drvdata(pdev, ope);
> +
> + return 0;
> +}
> +
> +static void ope_remove(struct platform_device *pdev)
> +{
> + struct ope_dev *ope = platform_get_drvdata(pdev);
> +
> + ope_v4l2_cleanup(ope);
> +}
> +
> +static const struct of_device_id ope_dt_ids[] = {
> + { .compatible = "qcom,qcm2290-camss-ope" },
> + { },
> +};
> +MODULE_DEVICE_TABLE(of, ope_dt_ids);
> +
> +static const struct dev_pm_ops ope_pm_ops = {
> + SET_RUNTIME_PM_OPS(pm_clk_suspend, pm_clk_resume, NULL)
> +};
> +
> +static struct platform_driver ope_driver = {
> + .probe = ope_probe,
> + .remove = ope_remove,
> + .driver = {
> + .name = OPE_NAME,
> + .of_match_table = ope_dt_ids,
> + .pm = &ope_pm_ops,
> + },
> +};
> +
> +module_platform_driver(ope_driver);
> +
> +MODULE_DESCRIPTION("CAMSS Offline Processing Engine");
> +MODULE_AUTHOR("Loic Poulain <[email protected]>");
> +MODULE_LICENSE("GPL");
> +/* Downscaler fixed-point helpers */
> diff --git a/drivers/media/platform/qcom/camss/camss-isp-pipeline.c
> b/drivers/media/platform/qcom/camss/camss-isp-pipeline.c
> index
> 8e44bedb0a41e3cf4fc7e3a138c1f48854f5efc8..625e6a65137fa980fb1a8c2b8a8132623f5b7b62
> 100644
> --- a/drivers/media/platform/qcom/camss/camss-isp-pipeline.c
> +++ b/drivers/media/platform/qcom/camss/camss-isp-pipeline.c
> @@ -194,6 +194,8 @@ static int isp_register_vdev(struct
> camss_isp_pipeline_entity *slot,
> vdev->fops = desc->vdev.fops;
> if (desc->vdev.ioctl_ops)
> vdev->ioctl_ops = desc->vdev.ioctl_ops;
> + if (desc->vdev.entity_ops)
> + vdev->entity.ops = desc->vdev.entity_ops;
>
> vdev->entity.obj_type = MEDIA_ENTITY_TYPE_VIDEO_DEVICE;
> vdev->entity.function = desc->function ? desc->function :
> MEDIA_ENT_F_IO_V4L;
> @@ -222,6 +224,10 @@ static int isp_register_subdev(struct
> camss_isp_pipeline_entity *slot,
> strscpy(sd->name, desc->name, sizeof(sd->name));
> sd->entity.function = desc->function ?
> desc->function : MEDIA_ENT_F_V4L2_SUBDEV_UNKNOWN;
> + /* Create a /dev/v4l-subdevN node so userspace can query pad formats */
> + sd->flags |= V4L2_SUBDEV_FL_HAS_DEVNODE;
> + if (desc->subdev.internal_ops)
> + sd->internal_ops = desc->subdev.internal_ops;
>
> ret = media_entity_pads_init(&sd->entity, slot->num_pads, slot->pads);
> if (ret)
> @@ -349,6 +355,11 @@ int camss_isp_pipeline_register(struct
> camss_isp_pipeline *pipeline,
> }
> }
>
> + /* Create /dev/v4l-subdevN nodes for all registered subdevs */
> + ret = v4l2_device_register_subdev_nodes(v4l2_dev);
> + if (ret)
> + goto err_unregister;
> +
> return 0;
>
> err_unregister:
> diff --git a/drivers/media/platform/qcom/camss/camss-isp-pipeline.h
> b/drivers/media/platform/qcom/camss/camss-isp-pipeline.h
> index
> 5dfa32dcafc0a944ca2c160fb5846a2c73214acc..06654948e1eceb7e35aedcabcd8ac3bd92280018
> 100644
> --- a/drivers/media/platform/qcom/camss/camss-isp-pipeline.h
> +++ b/drivers/media/platform/qcom/camss/camss-isp-pipeline.h
> @@ -76,10 +76,12 @@ struct camss_isp_entity_desc {
> void *drvdata;
> const struct v4l2_file_operations *fops;
> const struct v4l2_ioctl_ops *ioctl_ops;
> + const struct media_entity_operations *entity_ops;
> } vdev;
> /* MEDIA_ENTITY_TYPE_V4L2_SUBDEV */
> struct {
> const struct v4l2_subdev_ops *ops;
> + const struct v4l2_subdev_internal_ops *internal_ops;
> } subdev;
> };
> };
>