The theoryand practice
To see how the changes to the GeForce GTX 260 affect its position in the grand scheme, here's a comparison of some recent graphics cards. Note that we've derived these numbers from the actual clock speeds of the cards we're testing rather than from the stock clock speeds established by the GPU vendors.
|
Peak pixel fill rate (Gpixels/s) |
Peak bilinear texel filtering rate (Gtexels/s) |
Peak bilinear FP16 texel filtering rate (Gtexels/s) |
Peak memory bandwidth (GB/s) |
Peak shader arithmetic (GFLOPS) |
||
| Single-issue | Dual-issue | |||||
| GeForce 9500 GT | 4.4 | 8.8 | 4.4 | 25.6 | 90 | 134 |
| GeForce 9600 GSO | 6.7 | 26.6 | 13.3 | 38.5 | 259 | 389 |
| GeForce 9600 GT | 11.6 | 23.2 | 11.6 | 62.2 | 237 | 355 |
| GeForce 9800 GT | 9.6 | 33.6 | 16.8 | 57.6 | 339 | 508 |
| GeForce 9800 GTX+ | 11.8 | 47.2 | 23.6 | 70.4 | 470 | 705 |
| GeForce 9800 GX2 | 19.2 | 76.8 | 38.4 | 128.0 | 768 | 1152 |
| GeForce GTX 260 | 16.1 | 36.9 | 18.4 | 111.9 | 477 | 715 |
| GeForce GTX 260 216 SPs | 18.1 | 46.7 | 23.3 | 117.9 | 607 | 910 |
| GeForce GTX 280 | 19.3 | 48.2 | 24.1 | 141.7 | 622 | 933 |
| Radeon HD 4650 | 4.8 | 19.2 | 9.6 | 16.0 | 384 | - |
| Radeon HD 4670 | 6.0 | 24.0 | 12.0 | 32.0 | 480 | - |
| Radeon HD 3850 | 11.6 | 11.6 | 11.6 | 57.6 | 464 | - |
| Radeon HD 4850 | 10.0 | 25.0 | 12.5 | 63.6 | 1000 | - |
| Radeon HD 4870 | 12.0 | 30.0 | 15.0 | 115.2 | 1200 | - |
| Radeon HD 4870 X2 | 24.0 | 60.0 | 30.0 | 230.4 | 2400 | - |
Like I said before, the GTX 260 Reloaded comes very, very close to the original GeForce GTX 280. By contrast, the Radeon HD 4870's theoretical throughput is unchanged by the addition of more RAM.
In theory, it would seem that the GTX 260 Reloaded should have the edge in fill rate and texturing capacity, while the 4870 1GB ought to have more shader power. However, these things are often complicated in actual use by the quirks and varying efficiencies of the GPU architectures in question. Here's how things shake out when we measure them with 3DMark's directed benchmark tests.






This is pretty much the inverse of what we expected: the 4870 1GB proves faster in the color and texture fill rate tests, while the GTX 260 Reloaded takes three out of the four shader processing tests. Funny how that works. Another intriguing result: the GTX 260 Reloaded outruns the GTX 280 in the GPU cloth and particles benchmarks. This result suggests Nvidia may be using only one or some subset of the GT200's thread processing clusters for certain types of work, such as vertex processing. In that case, the GTX 260 Reloaded's higher shader clocks would matter more than the GTX 280's additional TPC.
| Friday night topic: The trouble with Best Buy | 133 |