arm_cmplx_mag_q15 reworked to be more accurate but less efficient.
New function arm_cmplx_mag_fast_q15 introduced to retain previous
(less accurate) but faster behavior.
Improvement to Python wrapper to support the new functions.
Solve most of f16 issues. But there are still some remaining
build issues with gcc10q4.
2 functions are reverting to scalar version when build with gcc on M55.
(Since Helium versions of those functions are not building).
(For our internal use. In short term, we won't give support about it).
CMSIS-DSP: Update to cmake build for the testing framework
CMSIS-NN:Implementation of arm_fully_connected_s8
Use API and quantization compatible with TF Lite.