The GitHub Actions job "Comment commands" on texera.git/main has succeeded.
Run started by GitHub user mengw15 (triggered by mengw15).

Head commit for run:
62883b886837fc1d949e8c29b26b6e8153458d34 / Matthew B. <[email protected]>
fix: use BINARY type for RAW_BYTES URL fetcher schema (#5133)

### What changes were proposed in this PR?
`URLFetcherOpDesc.sourceSchema()` advertised `AttributeType.ANY` for
`RAW_BYTES` decoding, even though the executor already emits a concrete
`byte[]`. This change returns `AttributeType.BINARY` instead, matching
the runtime payload and unblocking Iceberg materialization (which
rejects `ANY`). The existing `URLFetcherOpDescSpec` test that pinned the
old behavior is flipped to assert `BINARY`.

Test json: 

[url-fetcher-raw-bytes-test.json](https://github.com/user-attachments/files/28089609/url-fetcher-raw-bytes-test.json)

  ### Any related issues, documentation, or discussions?
  Closes: #5074
  ### How was this PR tested?
Updated `URLFetcherOpDescSpec` covers the schema; `URLFetcherOpExecSpec`
already pins the runtime field as `Array[Byte]`, so static schema and
runtime type now agree. CI Scala test job is expected to pass.
  ### Was this PR authored or co-authored using generative AI tooling?
  Co-authored with Claude Opus 4.7 in compliance with ASF

Report URL: https://github.com/apache/texera/actions/runs/26212163335

With regards,
GitHub Actions via GitBox

Reply via email to