wuyii8941 opened a new issue, #19570:
URL: https://github.com/apache/tvm/issues/19570
## Expected behavior
`Resize` with `coordinate_transformation_mode=half_pixel` (or
`pytorch_half_pixel`, `asymmetric`) and non-integer scale factors (e.g., 2.5x)
should compute source coordinates as:
```
src = (dst + 0.5) / scale - 0.5
```
and produce results matching ONNX Runtime.
## Actual behavior
TVM computes incorrect source coordinates, leading to wrong pixel mapping.
For a 3×3 input with scale=2.5, the max absolute difference vs ORT is **4.0**
(nearest mode) and **0.63** (linear mode).
## Reproduction
```python
import numpy as np
import onnx
from onnx import helper, TensorProto, numpy_helper
import onnxruntime as ort
import tvm
from tvm import relax
from tvm.relax.frontend.onnx import from_onnx
x = np.arange(9, dtype=np.float32).reshape(1, 1, 3, 3)
X = helper.make_tensor_value_info("X", TensorProto.FLOAT, [1, 1, 3, 3])
Y = helper.make_tensor_value_info("Y", TensorProto.FLOAT, None)
roi = numpy_helper.from_array(np.array([], dtype=np.float32), "roi")
sc = numpy_helper.from_array(np.array([1, 1, 2.5, 2.5], dtype=np.float32),
"scales")
node = helper.make_node("Resize", ["X", "roi", "scales"], ["Y"],
mode="nearest",
coordinate_transformation_mode="half_pixel",
nearest_mode="floor")
graph = helper.make_graph([node], "test", [X], [Y], initializer=[roi, sc])
model = helper.make_model(graph, opset_imports=[helper.make_opsetid("", 18)])
model = onnx.shape_inference.infer_shapes(model)
# ORT
sess = ort.InferenceSession(model.SerializeToString())
ort_out = sess.run(None, {"X": x})[0]
# TVM
mod = from_onnx(model)
exe = tvm.relax.build(
tvm.ir.transform.Sequential([relax.transform.LegalizeOps()])(mod),
target="llvm")
vm = tvm.relax.VirtualMachine(exe, device=tvm.cpu())
tvm_out = vm["main"](tvm.runtime.tensor(x, device=tvm.cpu())).numpy()
print("Max diff:", np.abs(ort_out - tvm_out).max()) # 4.0
```
## Affected configurations
| mode | coordinate_transformation_mode | max_diff vs ORT |
|------|-------------------------------|-----------------|
| nearest | half_pixel | 4.0 |
| nearest | pytorch_half_pixel | 4.0 |
| linear | half_pixel | 0.63 |
| linear | asymmetric | 0.46 |
| linear | pytorch_half_pixel | 0.63 |
Trigger condition: non-integer scale factor (e.g., 2.5) on odd spatial dims
(e.g., 3×3). Integer scales (2x, 3x) appear correct.
## Root cause
The coordinate transformation for `half_pixel` mode applies `floor((dst +
0.5) / scale - 0.5)` but the implementation appears to use a different rounding
or offset, causing the source index to be off by one pixel for certain output
positions.
## Environment
- TVM: 0.24.dev0, commit 0b0afd8dd (2026-04-24)
- Python: 3.11
- OS: Linux
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]