Sizing 'em up
If you haven't caught on by now, the Radeon HD 4870 X2 is, well, really frickin' fast. We'll get to the game benchmarks shortly, but we can quantify the X2's prowess in several ways. Here it is compared to the most relevant competitors and some older cards of the same class.
|
Peak pixel fill rate (Gpixels/s) |
Peak bilinear texel filtering rate (Gtexels/s) |
Peak bilinear FP16 texel filtering rate (Gtexels/s) |
Peak memory bandwidth (GB/s) |
|
| GeForce 8800 GTX | 13.8 | 18.4 | 18.4 | 86.4 |
| GeForce 9800 GTX | 10.8 | 43.2 | 21.6 | 70.4 |
| GeForce 9800 GX2 | 19.2 | 76.8 | 38.4 | 128.0 |
| GeForce GTX 260 | 16.1 | 36.9 | 18.4 | 111.9 |
| GeForce GTX 280 | 19.3 | 48.2 | 24.1 | 141.7 |
| Radeon HD 2900 XT | 11.9 | 11.9 | 11.9 | 105.6 |
| Radeon HD 3870 | 12.4 | 12.4 | 12.4 | 72.0 |
| Radeon HD 3870 X2 | 26.4 | 26.4 | 26.4 | 115.2 |
| Radeon HD 4850 | 10.0 | 25.0 | 12.5 | 63.6 |
| Radeon HD 4870 | 12.0 | 30.0 | 15.0 | 115.2 |
| Radeon HD 4870 X2 | 24.0 | 60.0 | 30.0 | 230.4 |
Those are the theoreticals. Here's how the cards measure in 3DMark's synthetic tests.


For what it's worth, I continue to be befuddled by the Gtexels/second numbers coming out of 3DMark Vantage. The units have to be off; they don't match the capabilities of the cards. We first discovered this problem and asked FutureMark about it in early June. They have been very polite about telling us several times since then that the people who might fix this problem are on vacation. (Note to self: try to get job at FutureMark.) For now, we'll continue to assume the relative performance measured here tracks well, even if the units reported are incorrect.
And the numbers look quite nice for the 4870 X2, which beats out its semi-ostensible competition in the dual GeForce GTX 260 SLI in both tests and proves to be easily the fastest "single card" anywhere in these two key metrics.
Then again, shader processing is quickly becoming the primary performance constraint in newer games. Here's how the 4870 X2 stacks up in that regard.
|
Peak shader arithmetic (GFLOPS) |
||
| Single-issue | Dual-issue | |
| GeForce 8800 GTX | 346 | 518 |
| GeForce 9800 GTX | 432 | 648 |
| GeForce 9800 GX2 | 768 | 1152 |
| GeForce GTX 260 | 477 | 715 |
| GeForce GTX 280 | 622 | 933 |
| Radeon HD 2900 XT | 475 | - |
| Radeon HD 3870 | 496 | - |
| Radeon HD 3870 X2 | 1056 | - |
| Radeon HD 4850 | 1000 | - |
| Radeon HD 4870 | 1200 | - |
| Radeon HD 4870 X2 | 2400 | - |
Yeah, uh, 2.4 teraflops will probably do you pretty well. Nothing else out there even comes close. 3DMark has some synthetic shader tests that will give us a sense of the X2's delivered shader performance.




None of the multi-GPU solutions performance particularly well in the GPU cloth and GPU particles tests, including the X2. Its additional sideport bandwidth doesn't seem to be of any help, either; two 4870 cards in CrossFire perform similarly in those two tests.
The parallax occlusion mapping and Perlin noise tests are another story altogether. The X2 is the fastest single-card config, and dual X2s are even faster than three GeForce GTX 260 cards.
