PDP-11/45 RSTS/E boot problem

Fritz Mueller fritzm at fritzm.org
Wed Feb 6 20:55:45 CST 2019


On 2/6/19 6:25 PM, Jon Elson via cctalk wrote:
> I'm thinking it is bad memory.  It seems unlikely bus problems could 
> alter only ONE BIT per word, so I think it is just a bad memory chip, 
> and finding multiple words where the 010000 bit is now turned on sure 
> looks like that kind of problem.

So, there was an issue specifically relating to bit 12 on the front 
panel (d'oh!), which I have now cleared up.

Furthermore, the "authoritative" sequence of 16 words obtained from the 
front panel last night, after addressing this issue, is:

PA:171600: 016162 004767 000224 000414 016700 016152 016702 016144
PA:171620: 004767 000206 000405 012404 012467 016124 000167 177346

...and, as it turns out, this exact sequence also occurs within the ls 
binary, on disk (per "od"):

0004220 016162 004767 000224 000414 016700 016152 016702 016144
0004240 004767 000206 000405 012404 012467 016124 000167 177346

So, the memory there _seems_ fine with the latest info at our disposal. 
It looks like the question boils down to either "how did that part of 
the binary get to that part of memory?", or "how did we end up executing 
out of that part of memory?"

Could still be a memory issue _elsewhere_ that lands us there, of 
course...  Could also be a translation error lurking in the KT11, or a 
CPU bug not found by any of the DEC diagnostic suites.

I will scope the refresh clock when I get home tonight, and I'm planning 
on hauling out the logic analyzer for an IR trace this weekend...

    --FritzM.


P.S. One idea that popped into my head recently, after a suggestion here 
to check the KT11 address translation adders, and my response "but the 
diagnostics!"...  A bug in one of the carry lookahead generators used 
between the bit slices of that adder could cause a mistranslation on 
only a fairly selective subset of virtual addresses, and this might 
conceivably be missed by the KT11 diagnostics?  *IF* that's the case and 
we can chase the IR trace upstream to the place of an unlucky 
mistranslation, it will be pretty easy to track down then in the hw and fix.


More information about the cctech mailing list