[cctalk] Re: Delay slots, was: Re: Re: early microprocessor limited pipelining [was: Intel 8086 - 46 yrs. ago]

13 Jun 2024

On Thu, 13 Jun 2024 at 18:22, Jonathan Stone via cctalk <
cctalk(a)classiccmp.org&gt; wrote:

...

 On Thursday, June 13, 2024 at 03:00:22 PM PDT, Maciej W. Rozycki via
 cctalk &lt;cctalk(a)classiccmp.org&gt; wrote:

  The architecture designers cheated however even
in the original ISA in
 that moves from the MD accumulator did interlock.  I guess they figured
 people (either doing it by hand or by writing a compiler) wouldn't get
that right anyway. ;)

 I always assumed that was because the latency of multiply, let alone
 divide, was far too many cycles for anyone to plausibly schedule "useful"
 instructions into. Wasn't r4000 divide latency over 60 cycles? Wasn't r4000
 divide latency more than 60 cycles?

The MIPS R4000 manual
https://groups.csail.mit.edu/cag/raw/documents/R4400_Uman_book_Ed2.pdf
says that double precision divide is 36 cycles, and double precision square
root is 112 (!).

What's interesting about that is that GCC's model of the R4000 says that
divide is 69 cycles; I'm not sure of the reason for the discrepancy.
https://gcc.gnu.org/git/?p=gcc.git;a=blob_plain;f=gcc/config/mips/4000.md;h…

-Henry

2025

2024

2023

2022

[cctalk] Re: Delay slots, was: Re: Re: early microprocessor limited pipelining [was: Intel 8086 - 46 yrs. ago]