Re: [RFC PATCH 0/5] aarch64: Support for user-defined aarch64 tuning parameters in JSON

David Malcolm Wed, 07 May 2025 06:42:44 -0700

On Tue, 2025-05-06 at 14:00 +0530, soum...@nvidia.com wrote:
> From: Soumya AR <soum...@nvidia.com>
> 
> Hi,
> 
> This RFC and subsequent patch series introduces support for printing
> and parsing
> of aarch64 tuning parameters in the form of JSON.
> 
> It is important to note that this mechanism is specifically intended
> for power
> users to experiment with tuning parameters. This proposal does not
> suggest the
> use of JSON tuning files in production. Additionally, the JSON format
> should not
> be considered stable and may change as GCC evolves.
> 
> [1] Introduction
> 
> Currently, the aarch64 backend in GCC (15) stores the tuning
> parameteres of all
> backends under gcc/config/aarch64/tuning_models/. Since these
> parameters are 
> hardcoded for each CPU, this RFC proposes a technique to support the
> adjustment
> of these parameters at runtime. This allows easier experimentation
> with more
> aggressive parameters to find optimal numbers.
> 
> The tuning data is fed to the compiler in JSON format, which was
> primarily 
> chosen for the following reasons:
> 
> * JSON can represent hierarchical data. This is useful for
> incorporating the
> nested nature of the tuning structures.
> * JSON supports integers, strings, booleans, and arrays.
> * GCC already has support for parsing and printing JSON, removing the
> need for
> writing APIs to read and write the JSON files.
>  
> Thus, if we take the following example of some tuning parameters:
> 
> static struct cpu_addrcost_table generic_armv9_a_addrcost_table =
> {
>     {
>       1, /* hi  */
>       0, /* si  */
>       0, /* di  */
>       1, /* ti  */
>     },
>   0, /* pre_modify  */
>   0, /* post_modify  */
>   2, /* post_modify_ld3_st3  */
>   2, /* post_modify_ld4_st4  */
> };
> 
> static cpu_prefetch_tune generic_armv9a_prefetch_tune =
> {
>   0,                  /* num_slots  */
>   -1,                 /* l1_cache_size  */
>   64,                 /* l1_cache_line_size  */
>   -1,                 /* l2_cache_size  */
>   true,                       /* prefetch_dynamic_strides */
> };
> 
> static struct tune_params neoversev3_tunings =
> {
>   &generic_armv9_a_addrcost_table,
>   10, /* issue_rate  */
>   AARCH64_FUSE_NEOVERSE_BASE, /* fusible_ops  */
>   "32:16",    /* function_align.  */
>   &generic_armv9a_prefetch_tune,
>   AARCH64_LDP_STP_POLICY_ALWAYS,   /* ldp_policy_model.  */
> };
> 
> We can represent them in JSON as:
> 
> {
>   "tune_params": {
>     "addr_cost": {
>       "addr_scale_costs": { "hi": 1, "si": 0, "di": 0, "ti": 1 },
>       "pre_modify": 0,
>       "post_modify": 0,
>       "post_modify_ld3_st3": 2,
>       "post_modify_ld4_st4": 2
>     },
>     "issue_rate": 10,
>     "fusible_ops": 1584,
>     "function_align": "32:16",
>     "prefetch": {
>       "num_slots": 0,
>       "l1_cache_size": -1,
>       "l1_cache_line_size": 64,
>       "l2_cache_size": -1,
>       "prefetch_dynamic_strides": true
>     },
>     "ldp_policy_model": "AARCH64_LDP_STP_POLICY_ALWAYS"
>   }
> }
> 
> ---
> 
> [2] Methodology 
> 
> Before the internal tuning parameters are overridden with user
> provided ones, we
> must ensure the validity of the provided data.
> 
> This is done using a "base" JSON schema, which contains information
> about the 
> tune_params data structure used by the aarch64 backend.
> 
> Example:
> 
> {
>   "tune_params": {
>     "addr_cost": {
>       "addr_scale_costs": {
>         "hi": "int",
>         "si": "int",
>         "di": "int",
>         "ti": "int"
>       },
>       "pre_modify": "int",
>       "post_modify": "int",
>       "post_modify_ld3_st3": "int",
>       "post_modify_ld4_st4": "int"
>     },
>     "issue_rate": "int",
>     "fusible_ops": "uint",
>     "function_align": "string",
>     "prefetch": {
>       "num_slots": "int",
>       "l1_cache_size": "int",
>       "l1_cache_line_size": "int",
>       "l2_cache_size": "int",
>       "prefetch_dynamic_strides": "boolean"
>     },
>     "ldp_policy_model": "string"
>   }
> }
> 
> Using this schema, we can:
>       * Verify that the correct datatypes have been used.
>       * Verify if the user provided "key" or tuning parameter
> exists.
>       * Allow user to only specify the required fields (in nested
> fashion), 
>       eliminating the need to list down every single paramter if
> they only
>       wish to experiment with some.
>       
> The schema is currently stored as a raw JSON string in
> config/aarch64/aarch64-json-schema.h.


Does the schema get used to do validation anywhere?  FWIW (and as
posted in another followup), for SARIF the DejaGnu tests validate the
generated json against a schema; see gcc/testsuite/lib/scansarif.exp;
there's also run-sarif-pytest which allows DejaGnu to run Python
scripts to verify properties of the generated json.  The latter is
probably overkill for the aarch64 tuning use-case, but is very helpful
for SARIF, which has deeply nested json, cross-references, duplication
and de-duplication, etc (so regexps aren't expressive enough for
testing it)


[...snip...]
> 

> ---
> 
> [3] Testing 
> 
> To test out the functionality for this change, we have to ensure the
> following
> things are happening correctly:
> 
> 1. The JSON tunings printer is able to print back the correct values,
> especially 
> when it comes to trickier datatypes like enums. 
> 2. The error handling works as expected, espcially in the case of
> incorrect JSON
> syntax, incorrect datatypes, and incorrect tuning data structure. 
> 3. During GCC invokation, the values from JSON are correctly loaded
> in
> aarch64_tune_params. 

The error-handling seems to use plain "error (msg);" which uses
input_location, which isn't going to tell users where in the JSON the
problem is.  Do the error messages contain enough context for people to
debug this?  Or are we assuming that this is an advanced feature that
doesn't need a lot of UX polish?

As noted in a footnote in another reply, for sarif parsing, the user
gets physical and logical locations within the JSON (and metadata about
violations of the spec):

In JSON property '/runs/0/results/0/level':
bad-level.sarif:12:20: error: unrecognized value for 'level': 'mostly harmless' 
[SARIF v2.1.0 §3.27.10]
   12 |           "level": "mostly harmless",
      |                    ^~~~~~~~~~~~~~~~~

but libsarifreplay.cc here is using libgdiagnostics.h to manage things,
rather than diagnostic-core.h (and thus has a different way of managing
locations.  I have posted patches in the past that wired up the json
parser to location_t and diagnostic-core.h, if that would be useful).


> To test these, we make use of a combination of regression tests (in
> gcc.target/aarch64/aarch64-json-tunings/) as well as self-tests to
> check the
> contents of aarch64_tune_params during the GCC build.
> 
> ---
> 
> [4] Limitations:
> 
> Lack of comments in JSON:
>       * JSON does not have the ability to store comments, which
> leads to the 
>       loss of useful information that is provided in the form of
> comments in 
>       the header files. A workaround is to have a dummy "comment"
> key and
>       ignore it when parsing. (e.g., "comment": "parameter
> description")

Note that gcc/json-parser.cc can support reading C/C++-style comments
if you enable a flag when creating the parser object, but these
comments are discarded (not round-tripped like e.g. XML can support). 
They've helpful for DejaGnu tests of sarif-replay fwiw for verifying
that it emits good error messages on bogus json inputs.

FWIW I have experimental code for writing out json::value trees as
JSON5, which does support comments (also, for that matter, for CBOR,
though that doesn't support comments).  I don't yet have parsing for
JSON5.

> 
> No enum support in JSON:
>       * The current workaround for this is to use strings instead
> of enums,
>       but we lose out on the ability to pass enum as values, as
> well as doing
>       bitwise operations on the enums, something used quite
> frequently for
>       some parameters. 

Not sure if this is helpful, but I suppose the json could contain
something like:

"prop" : { "binop": "or", "values": ["ENUM1", "ENUM2", "ENUM3"]}

to do the equivalent of (ENUM1 | ENUM2 | ENUM 3).  Though that might
not be very amenable to schema-validation.

> 
> No type distinction in JSON:
>       * JSON uses the "number" type which allows signed and
> unsigned integers
>       as well as floats but provides no distinction between them.

Yes, this is a pain.

> 
> Storing the JSON schema:
>       * The JSON schema is currently stored as a raw JSON string
> in
>       aarch64-json-schema.h. This is helpful in exposing the file
> to the
>       testing framework, but is not the cleanest solution.
>       
>       * Theoretically, the schema could be stored in the
> installation
>       directory, but this interferes with the idea of having self-
> tests for
>       the JSON parser.

How are the self-tests using the schema?

>       
> Maintaing the printer/parser routines and JSON schema:
>       * Any change in the aarch64 tuning format will result in the
> need for 
>       manual changes to be made to the routines for the JSON
> tunings printer,
>       parser, and schema.     
>       

Thanks; hope this is constructive
Dave

Re: [RFC PATCH 0/5] aarch64: Support for user-defined aarch64 tuning parameters in JSON

Reply via email to