The theory—and practice
To see how the changes to the GeForce GTX 260 affect its position in the grand scheme, here's a comparison of some recent graphics cards. Note that we've derived these numbers from the actual clock speeds of the cards we're testing rather than from the stock clock speeds established by the GPU vendors.

Peak
pixel
fill rate
(Gpixels/s)
Peak bilinear
texel
filtering
rate
(Gtexels/s)
Peak bilinear
FP16 texel
filtering
rate
(Gtexels/s)
Peak
memory
bandwidth
(GB/s)
Peak shader
arithmetic (GFLOPS)
Single-issue Dual-issue
GeForce 9500 GT 4.4 8.8 4.4 25.6 90 134
GeForce 9600 GSO 6.7 26.6 13.3 38.5 259 389
GeForce 9600 GT 11.6 23.2 11.6 62.2 237 355
GeForce 9800 GT 9.6 33.6 16.8 57.6 339 508
GeForce 9800 GTX+ 11.8 47.2 23.6 70.4 470 705
GeForce 9800 GX2 19.2 76.8 38.4 128.0 768 1152
GeForce GTX 260 16.1 36.9 18.4 111.9 477 715
GeForce GTX 260 216 SPs 18.1 46.7 23.3 117.9 607 910
GeForce GTX 280 19.3 48.2 24.1 141.7 622 933
Radeon HD 4650 4.8 19.2 9.6 16.0 384 -
Radeon HD 4670 6.0 24.0 12.0 32.0 480 -
Radeon HD 3850 11.6 11.6 11.6 57.6 464 -
Radeon HD 4850 10.0 25.0 12.5 63.6 1000 -
Radeon HD 4870 12.0 30.0 15.0 115.2 1200 -
Radeon HD 4870 X2 24.0 60.0 30.0 230.4 2400 -

Like I said before, the GTX 260 Reloaded comes very, very close to the original GeForce GTX 280. By contrast, the Radeon HD 4870's theoretical throughput is unchanged by the addition of more RAM.

In theory, it would seem that the GTX 260 Reloaded should have the edge in fill rate and texturing capacity, while the 4870 1GB ought to have more shader power. However, these things are often complicated in actual use by the quirks and varying efficiencies of the GPU architectures in question. Here's how things shake out when we measure them with 3DMark's directed benchmark tests.

This is pretty much the inverse of what we expected: the 4870 1GB proves faster in the color and texture fill rate tests, while the GTX 260 Reloaded takes three out of the four shader processing tests. Funny how that works. Another intriguing result: the GTX 260 Reloaded outruns the GTX 280 in the GPU cloth and particles benchmarks. This result suggests Nvidia may be using only one or some subset of the GT200's thread processing clusters for certain types of work, such as vertex processing. In that case, the GTX 260 Reloaded's higher shader clocks would matter more than the GTX 280's additional TPC.