arm_cmplx_mag_q15 reworked to be more accurate but less efficient. New function arm_cmplx_mag_fast_q15 introduced to retain previous (less accurate) but faster behavior. Improvement to Python wrapper to support the new functions.