OT? Upper limits of FSB

8 Jan 2019

Some architectures (I?m thinking of the latest Intel CPUs) have a small loop cache
whose aim is to keep a loop entirely within that cache.  That cache operates at the
full speed of the instruction fetch/execute (actually I think it keeps the decoded uOps)
cycles (e.g. you can?t go faster).  L1 caches impose a penalty and of course there is
the instruction decode time as well both of which are avoided.
TTFN - Guy
...
  On Jan 8, 2019, at 2:43 PM, Chuck Guzis via cctalk
<cctalk at classiccmp.org> wrote:
 On 1/8/19 1:23 PM, Tapley, Mark via cctalk wrote:
  Why so (why surprising, I mean)? Understood an
unrolled loop executes
 faster... 
 That can't always be true, can it?
 I'm thinking of an architecture where the instruction cache is slow to
 fill and multiple overlapping operations are involved and branch
 prediction assumes a branch taken.  I'd say it was very close in that case.
 --Chuck

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

1999

1998

1997

OT? Upper limits of FSB