arm neon optimization for layernorm fp32/bf16s/fp16s (#5746) #4688
linux-mips64-cpu-gcc.yml
on: push
linux-gcc-mips64el
10m 6s
linux-gcc-mipsisa64r6el
11m 24s