On 1/8/19 1:23 PM, Tapley, Mark via cctalk wrote:
Why so (why surprising, I mean)? Understood an
unrolled loop executes
faster...
That can't always be true, can it?
I'm thinking of an architecture where the instruction cache is slow to
fill and multiple overlapping operations are involved and branch
prediction assumes a branch taken. I'd say it was very close in that case.
--Chuck