Skip to content

Commit

Permalink
SPU LLVM: Minor SUMB AVX-512 path optimization
Browse files Browse the repository at this point in the history
- Tweak shuffle to allow LLVM to emit a cheap blend instruction instead of the expensive VPERMI2W instruction
  • Loading branch information
Whatcookie authored and Nekotekina committed Jan 27, 2023
1 parent 6532d2b commit 813f7b5
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion rpcs3/Emu/Cell/SPURecompiler.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -7709,7 +7709,7 @@ class spu_llvm_recompiler : public spu_recompiler_base, public cpu_translator

const auto ax = vdbpsadbw(a, zeroes, 0);
const auto bx = vdbpsadbw(b, zeroes, 0);
set_vr(op.rt, shuffle2(ax, bx, 0, 8, 2, 10, 4, 12, 6, 14));
set_vr(op.rt, shuffle2(ax, bx, 0, 9, 2, 11, 4, 13, 6, 15));
return;
}

Expand Down

0 comments on commit 813f7b5

Please sign in to comment.