arm_mat_solve_lower_triangular_f32 and arm_mat_solve_upper_triangular_f32 had constraints which could be removed to make the function more generic.