Hi, Currently we save the entire register content for caller-saved register, even though only lower 4/8 bytes are used, as in the case of SSE math without vectorizer. Is it possible to only save the used portion of register content for caller-saved register?
Thanks. H.J.