areusch opened a new pull request #8072:
URL: https://github.com/apache/tvm/pull/8072


   This PR introduces a `style` key to the Model Library Format metadata with 
two initial values: 
    - `full-model`
    - `operator`
   
   Implementations that use Model Library Format will now need to check `style` 
before reading other files in the archive. `full-model` indicates the 
previously-used format.
   
   `operator` is introduced to allow exporting TVM libraries that contain only 
operator functions with no model-level information (e.g. executor 
configuration, model-wide memory planning, etc). The goal of `operator` style 
is to allow exporting fragments of models (e.g. individual TVM operators) for 
use with the TVM RPC Server. After the Project API refactor lands, TVM 
auto-tuning will produce MLF in `operator` style, and those MLF archives will 
be given to project generators with the ultimate goal of flashing and timing 
those operators on-device.
   
    MLF archives with `operator` style contain:
    - `codegen` directory, organized as in `full-model`
    - `metadata.json` of the same format as `full-model` with different values.
    - `src/tir-<device_type>.txt`, containing pretty-printed TIR
   
   Notably, the `memory` key in `metadata.json` contains shape information for 
each operator function parameter. The shape information has names correlated 
with those used in the TIR sources in `src/tir-*.txt`.
   
   @leandron @manupa-arm @giuseros @Mousius @gromero @mehrdadh @stoa


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to