Fixed shadowed variables in assembly macros for Cortex-M convolution Fixed type promotions in _f64 matrix and transform code