Are you sure? My understanding is that PLP and PLA
increment the stack
pointer *before* fetching the byte off the stack.
[...]
Maybe you're thinking of PHP and PHA ([SP]:=A;
SP:=SP+1), which take one
cycle less than PLP/PLA?
You're right, I was confused.