AMD's 65nm Preview Part 2 - The Plot Thickens (Updated with Information from AMD)
by Anand Lal Shimpi on December 21, 2006 12:12 AM EST- Posted in
- CPUs
Brisbane Performance Issues Demystified: Higher Latencies to Blame
As you'll remember from Part 1, for some reason, our 65nm Athlon 64 X2 5000+ performed slower than our 90nm part. We had contacted AMD before publication of the article but didn't receive a response until after we were well underway with Part 2. AMD's explanation for the reduced performance? Higher memory latencies.
We wanted to investigate exactly how much higher, thus we turned to CPU-Z's latency benchmark to give us a quick indication of how things had changed.
CPU | CPU-Z Latency (8192KB, 128-byte) |
AMD Athlon 64 X2 5000+ (65nm) | 122 cycles (46.92 ns) |
AMD Athlon 64 X2 5000+ (90nm) | 121 cycles (46.54 ns) |
A single cycle increase in memory access latency, or 0.4ns, is a slight increase but not enough to cause the sort of performance deltas we saw in Quake 4 and Half Life 2, something else was amiss. Luckily it was another metric that CPU-Z's latency test reported that helped us understand the cause of the poor performance: L2 cache access latency.
CPU | CPU-Z L2 Cache Latency | ScienceMark 2.0 L2 Cache Latency |
AMD Athlon 64 X2 5000+ (65nm) | 20 cycles | 20 cycles |
AMD Athlon 64 X2 5000+ (90nm) | 12 cycles | 12 cycles |
Updated - 1/5/07: Although AMD previously did not mention any issues with our findings, we were contacted today and informed that the latency information both ScienceMark and CPU-Z produced is incorrect. The Brisbane core's L2 latency should be 14 cycles, up from 12 cycles and not 20 cycles. This would help explain the relatively low impact on application performance that we've seen across the board. We are still waiting to hear back from AMD on a handful of other issues regarding Brisbane and will update you as soon as we have more information.
The original K8 core, in both 130nm and 90nm flavors, had a 12-cycle L2 cache. With Brisbane, as reported by both CPU-Z and ScienceMark, 65nm K8 now has a 20-cycle L2 cache. Generally speaking you move to a higher latency cache if you're planning on introducing a larger cache size, but a quick glance at AMD's roadmaps doesn't show anything larger than a 1MB L2 per core for the next year. The argument for higher clock speeds isn't valid either as the highest clock speed on AMD's roadmaps thus far is only 3.2GHz.
Luckily the performance impact of the higher latency L2 cache isn't noticeable in all applications, thanks to the K8's on-die memory controller, but make no mistake - the new core is slower. We couldn't figure out why AMD made the change and with most of our key AMD contacts on vacation due to the holidays, we still have no official response on the matter. Rest assured that if/when we learn more we will let you know.
Updated: AMD has given us the official confirmation that L2 cache latencies have increased, and that it purposefully did so in order to allow for the possibility of moving to larger cache sizes in future parts. AMD stressed that this wasn't a pre-announcement of larger cache parts to come, but rather a preparation should the need be there to move to a vastly larger L2. Thankfully the performance delta isn't huge, at least in the benchmarks that we saw, so AMD's decision isn't too painful - especially as it comes with the benefit of a cooler running core that draws less power; ideally we'd like the best of all worlds but we'll take what we can get. Note that none of AMD's current roadmaps show any larger L2 parts (other than the usual 2x1MB offerings), which tells us one of two things: either AMD has some larger L2 parts that it's planning on releasing or AMD is being completely honest with the public in saying that the larger L2 parts will only be released if necessary.
52 Comments
View All Comments
OcHungry - Wednesday, January 3, 2007 - link
I was wondering if there was a way to PM or email you.I would like to bring to your attention a few concerns regarding the forum and the moderators that you need to be aware of. Is there an email or can I PM you in any way?
I would appreciate your respond.
Bayad shoma bedaneed.
tashakore
SilverMirage - Saturday, December 23, 2006 - link
I understand that we are looking at limiting games by the CPU and not the graphics card, but lets be a little realistic...some of those performance per watt figures aren't what an end user is going to be seeing when comparing his options.Wwhat - Saturday, December 23, 2006 - link
Perhaps a weird thought but AMD bought that new ZRAM process that theoretically could put a huge cache on a small space with the flaw that it had so-so latency, then they later bought version 2 of the same but so far they didn't use it.Now what if this is an experiment with that technology, or a preamble, because at some time you would expect them to start using stuff they bought the license off, although at the time of the ZRAM announcement people were projecting use in a very far future that might not be what AMD has in mind, or the situation might have changed as to the performance of current versions of ZRAM.
What do you think, any link to it?
Tujan - Friday, December 22, 2006 - link
If you look,you'll see a story here in Anandtech.com for the first AMD X2s dated June 2005. The processor there was the 'top notch,for something unheard of in the processor industry. And even so doing with DDR400 memory.What is strange in these processor scenarios. There is Moores law. And there is the 'business quarter law. The article here,of wich I am commenting to,if the first of the detail in the "new processors"-the 65Nm processors. The AMDX2xs where also "new" when the 2005 articles was published. Today,as of reading this article,the "old"(and then top notch)is no longer being supported ..or as per use,no longer maintaining the subject cache sizes. AND well,point me if I am not correct,but the "old"processors are no longer going to be manufactured.
Well I have usually considered that in these events of "new,and old"technology that somehow,if something is 18 months past.I can actually afford it. This changes a little with the new Intel setups. Since the Intel lineups finally seem to break this cycle of the previous as I explained here.
For an entire year,I saw article after article,putting AMDs top notch as base line for performance. Ney user could address something of a culminating relationship to what had performance and what they could afford. And AMDs "top notch" an "industry standard" wich of course nobody could afford. At least if somebody was passing the buck,it wasn't happening with me.
Annyway,I would just like to say I wish AMD good luck.Yet I cannot be ashamed to say that I can now put together a system for less than 1000$ with the same parts as that '"top notch"industry standard seen for the permiating part of 2005.
Am I better for thinking that way ? 18 months passed,and my dollars are spendable but not supported ? I dont think that anybody could consider an AMD setup a 'low-end setup if for example the 4800+1Mb l2 can be had for 230$.
That is the enevitable consequence is the final exhaustion of the supply of the component. Yet I could say that I have a "top notch" 2005 version of AMD technology. With 2007 PARTS! Being on the exhausted end,I dont know who could feel better about this.
Wish AMD luck. Still with their record,I probably should say that I do not wish to look forward to exhaustion,at the same time as extingusishment,as in the mentioning of taking a break to what one pays for.
mino - Thursday, December 21, 2006 - link
Just my pint into the fire:X2 4200+EE & GF6150 board (MSI K9NGM2-FID)
$240 (170+75)
E6300 $ G965 board (ASUS P5B-VM)
$285 (185+100)
Conclusion:
Anything cheaper is K8 vs. Netburst so Intel is no contender.
Anything more expensive is K8 vs. C2D clock/clock so AMD is no contender (4800+65nm is more expensive than E6400 it matches by perf.)
For decent IGP-free boards the difference is comparable.
So, going for
stock performance the choice is simple:
<$300 for CPU+MB combo go AMD X2
>$300 for CPU+MB combo go Intel C2D
for overclocking:
<$240 for CPU+MB combo go AMD X2
$240-$285 -> wait and then Intel C2D
>$285 for CPU+MB combo go Intel C2D
for power consumption (i.e. PC's for office use):
AMD X2 3800+EE to 5000+EE (anything above or down is a waste of money in this case)
for single core:
<$190 for CPU + MB combo -> go AMD A64
$190-$230 for CPU + MB combo -> go AMD X2 at $240
>$240 see dual-core recommendations
That IMO sums up the whole Desktop PC market as of now.
mino - Thursday, December 21, 2006 - link
$190-$230 for CPU + MB combo -> go AMD X2 at $240should be:
$190-$230 for CPU + MB combo -> go AMD X2 3800+ at $200
mino - Thursday, December 21, 2006 - link
The prices are Newegg based.Shintai - Thursday, December 21, 2006 - link
Anand, can you test the 65nm K8 with lower res in games and a broader selection of games so we can more truely see the difference?http://www.firingsquad.com/hardware/amd_athlon_64_...">http://www.firingsquad.com/hardware/amd_athlon_64_...
If these numbers hold water, then 65nm K8s is a disaster in terms of gaming performance.
mongoosesRawesome - Thursday, December 21, 2006 - link
I'd be interested in seeing these same performance/watt graphs using the 965 chipset. The 680i is a power hog.mino - Thursday, December 21, 2006 - link
The same power hog is 590SLI for AMD.Actually 965 vs. RD580 would hurt Intel even more ... So, go figure.