areusch opened a new pull request #8072:
URL: https://github.com/apache/tvm/pull/8072
This PR introduces a `style` key to the Model Library Format metadata with
two initial values:
- `full-model`
- `operator`
Implementations that use Model Library Format will now need to check `style`
before reading other files in the archive. `full-model` indicates the
previously-used format.
`operator` is introduced to allow exporting TVM libraries that contain only
operator functions with no model-level information (e.g. executor
configuration, model-wide memory planning, etc). The goal of `operator` style
is to allow exporting fragments of models (e.g. individual TVM operators) for
use with the TVM RPC Server. After the Project API refactor lands, TVM
auto-tuning will produce MLF in `operator` style, and those MLF archives will
be given to project generators with the ultimate goal of flashing and timing
those operators on-device.
MLF archives with `operator` style contain:
- `codegen` directory, organized as in `full-model`
- `metadata.json` of the same format as `full-model` with different values.
- `src/tir-<device_type>.txt`, containing pretty-printed TIR
Notably, the `memory` key in `metadata.json` contains shape information for
each operator function parameter. The shape information has names correlated
with those used in the TIR sources in `src/tir-*.txt`.
@leandron @manupa-arm @giuseros @Mousius @gromero @mehrdadh @stoa
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]