AMD Athlon 64 4000+ & FX-55: A Thorough Investigation
by Anand Lal Shimpi on October 19, 2004 1:04 AM EST- Posted in
- CPUs
Workstation Applications
Visual Studio 6
Carried over from our previous CPU reviews, we continue to use Visual Studio 6 for a quick compile test. We are still using the Quake 3 source code as our test and measure compile time in seconds. The results are pretty much in line with what we've seen in the past.
Without compiler speed optimizations for NetBurst, the Pentium 4 architecture is not very well suited for highly branchy applications such as compiling and code optimization. The Athlon 64 is far better suited for this type of usage pattern and thus AMD wins the compiler test.
SPECviewperf 8
For our next set of professional application benchmarks we turn to SPECviewperf 8. SPECviewperf is a collection of application traces taken from some of the most popular professional applications, and compiled together in a single set of benchmarks used to estimate performance in the various applications the benchmark is used to model. With version 8, SPEC has significantly improved the quality of the benchmark, making it even more of a real world indicator of performance.
We have included SPEC's official description of each one of the 8 tests in the suite.
3dsmax Viewset (3dsmax-03)
"The 3dsmax-03 viewset was created from traces of the graphics workload generated by 3ds max 3.1. To insure a common comparison point, the OpenGL plug-in driver from Discreet was used during tracing.
The models for this viewset came from the SPECapc 3ds max 3.1 benchmark. Each model was measured with two different lighting models to reflect a range of potential 3ds max users. The high-complexity model uses five to seven positional lights as defined by the SPECapc benchmark and reflects how a high-end user would work with 3ds max. The medium-complexity lighting models uses two positional lights, a more common lighting environment.
The viewset is based on a trace of the running application and includes all the state changes found during normal 3ds max operation. Immediate-mode OpenGL calls are used to transfer data to the graphics subsystem."
The biggest surprise here is that there is a huge performance impact (13%) by moving down to a single channel memory subsystem with the Athlon 64. There are also a couple of other surprises, with the Pentium 4 560 doing surprisingly well, coming in at the heels of the Athlon 64 FX-55.
CATIA Viewset (catia-01)
"The catia-01 viewset was created from traces of the graphics workload generated by the CATIATM V5R12 application from Dassault Systemes.
Three models are measured using various modes in CATIA. Phil Harris of LionHeart Solutions, developer of CATBench2003, supplied SPEC/GPC with the models used to measure the CATIA application. The models are courtesy of CATBench2003 and CATIA Community.The car model contains more than two million points. SPECviewperf replicates the geometry represented by the smaller engine block and submarine models to increase complexity and decrease frame rates. After replication, these models contain 1.2 million vertices (engine block) and 1.8 million vertices (submarine).
State changes as made by the application are included throughout the rendering of the model, including matrix, material, light and line-stipple changes. All state changes are derived from a trace of the running application. The state changes put considerably more stress on graphics subsystems than the simple geometry dumps found in older SPECviewperf viewsets.
Mirroring the application, draw arrays are used for some tests and immediate mode used for others."
The single channel Athlon 64 3400+ does exceptionally poorly here, with the dual channel Athlon 64 parts holding a significant performance advantage. By now it's no surprise to see the FX-55, 4000+ and 3800+ at the top of the charts.
Interestingly enough, the CATIA benchmark appears to favor Intel's Prescott core over Northwood.
EnSight (ensight-01)
"The ensight-01 viewset replaces the Data Explorer (dx) viewset. It represents engineering and scientific visualization workloads created from traces of CEI's EnSight application.
CEI contributed the models and suggested workloads. Various modes of the EnSight application are tested using both display-list and immediate-mode paths through the OpenGL API. The model data is replicated by SPECviewperf 8.0 to generate 3.2 million vertices per frame.
State changes as made by the application are included throughout the rendering of the model, including matrix, material, light and line-stipple changes. All state changes are derived from a trace of the running application. The state changes put considerably more stress on graphics subsystems than the simple geometry dumps found in older viewsets.
Mirroring the application, both immediate-mode and display-list modes are measured."
No 925X based test system would complete this test, thus all we have are AMD chips to look at. Once again we see that the 3400+ is seriously crippled by its single channel memory interface.
Lightscape Viewset (light-07)
"The light-07 viewset was created from traces of the graphics workload generated by the Lightscape Visualization System from Discreet Logic. Lightscape combines proprietary radiosity algorithms with a physically based lighting interface.
The most significant feature of Lightscape is its ability to accurately simulate global illumination effects by precalculating the diffuse energy distribution in an environment and storing the lighting distribution as part of the 3D model. The resulting lighting "mesh" can then be rapidly displayed."
Maya Viewset (maya-01)
"The maya-01 viewset was created from traces of the graphics workload generated by the Maya V5 application from Alias.
The models used in the tests were contributed by artists at NVIDIA. Various modes in the Maya application are measured.
State changes as made by the application are included throughout the rendering of the model, including matrix, material, light and line-stipple changes. All state changes are derived from a trace of the running application. The state changes put considerably more stress on graphics subsystems than the simple geometry dumps found in older viewsets.
As in the Maya V5 application, array element is used to transfer data through the OpenGL API."
Pro/ENGINEER (proe-03)
"The proe-03 viewset was created from traces of the graphics workload generated by the Pro/ENGINEER 2001TM application from PTC.
Two models and three rendering modes are measured during the test. PTC contributed the models to SPEC for use in measurement of the Pro/ENGINEER application. The first of the models, the PTC World Car, represents a large-model workload composed of 3.9 to 5.9 million vertices. This model is measured in shaded, hidden-line removal, and wireframe modes. The wireframe workloads are measured both in normal and antialiased mode. The second model is a copier. It is a medium-sized model made up of 485,000 to 1.6 million vertices. Shaded and hidden-line-removal modes were measured for this model.
This viewset includes state changes as made by the application throughout the rendering of the model, including matrix, material, light and line-stipple changes. The PTC World Car shaded frames include more than 100MB of state and vertex information per frame. All state changes are derived from a trace of the running application. The state changes put considerably more stress on graphics subsystems than the simple geometry dumps found in older viewsets.
Mirroring the application, draw arrays are used for the shaded tests and immediate mode is used for the wireframe. The gradient background used by the Pro/E application is also included to better model the application workload."
When the Athlon was first released, it was a very solid performer in Pro/ENGINEER. But of course, back then, very few companies would think about Athlon workstations. With the Athlon 64 times have obviously changed, but the performance advantage does not seem to have changed at all, AMD continues to lead the way in the proe-03 viewset.
SolidWorks Viewset (sw-01)
"The sw-01 viewset was created from traces of the graphics workload generated by the Solidworks 2004 application from Dassault Systemes.
The model and workloads used were contributed by Solidworks as part of the SPECapc for SolidWorks 2004 benchmark.
State changes as made by the application are included throughout the rendering of the model, including matrix, material, light and line-stipple changes. All state changes are derived from a trace of the running application. The state changes put considerably more stress on graphics subsystems than the simple geometry dumps found in older viewsets.
Mirroring the application, draw arrays are used for some tests and immediate mode used for others."
Unigraphics (ugs-04)
"The ugs-04 viewset was created from traces of the graphics workload generated by Unigraphics V17.
The engine model used was taken from the SPECapc for Unigraphics V17 application benchmark. Three rendering modes are measured -- shaded, shaded with transparency, and wireframe. The wireframe workloads are measured both in normal and anti-alised mode. All tests are repeated twice, rotating once in the center of the screen and then moving about the frame to measure clipping performance.
The viewset is based on a trace of the running application and includes all the state changes found during normal Unigraphics operation. As with the application, OpenGL display lists are used to transfer data to the graphics subsystem. Thousands of display lists of varying sizes go into generating each frame of the model.
To increase model size and complexity, SPECviewperf 8.0 replicates the model two times more than the previous ugs-03 test."
89 Comments
View All Comments
RaistlinZ - Tuesday, October 19, 2004 - link
I may have missed it, but does anyone know if the Athlon 64 4000+ will be multiplier unlocked like the FX-53 is? That's the only thing I see that would differentiate the two chips.RaistlinZ - Tuesday, October 19, 2004 - link
Illissius - Tuesday, October 19, 2004 - link
Re: the necessity of Prescott. You are missing one very important consideration: Prescott has iAMD64 support. (Although it is currently disabled, no doubt because Intel has intentions of selling you the same processor twice). A simple die shrink of Northwood would not.I half suspect one of the reasons for Prescott's problems could be that AMD's 64-bit extensions don't mesh very well with a Netburst architecure, but they had to shoehorn it in anyways, and had to make a lot of unappealing design decisions in the process. (I've never designed a processor, though, so this is just baseless speculation.) I'd be interested in seeing 64-bit enabled chips on a Pentium M architecture...
CrystalBay - Tuesday, October 19, 2004 - link
Moores law is dead...:(Runamile - Tuesday, October 19, 2004 - link
Awsome read. Great Job. And HOLY COW does Intel get their a$$ handed to them!I would of liked to see some price/performance curves too. That would of summed it up quite nicely.
hertz9753 - Tuesday, October 19, 2004 - link
Athlon 64 3700+ 2.4GHz 1MB 64-bitAthlon 64 3400+ 2.4GHz 512KB 64-bit
Athlon 64 3400+ 2.2GHz 1MB 64-bit
araczynski - Tuesday, October 19, 2004 - link
nice, but luckily i still see no reason to upgrade my 2.4@3.3, at least not for a few measly benchmark FPS.hertz9753 - Tuesday, October 19, 2004 - link
AlphaFox - Tuesday, October 19, 2004 - link
Id like to see some kind of comparison with an OC XP Mobile. I have one runing at 2.46ghz and not really sure how it stacks up here...PrinceGaz - Tuesday, October 19, 2004 - link
An excellent article, well done.About the only thing missing was a bit of overclocking of the FX-55 to see if the introduction of strained silicon considerably increased the headroom. Obviously it has allowed them to ship parts rated at 2.6GHz which they weren't previously able to do, but how much better is the FX-55 compared to a CG-stepping FX-53? Does the use of strained silicon mean the FX-55 is a new stepping?