Putting those new shaders to work
| Peak shader
|GeForce GTX 280||0.6||142|
|GeForce GTX 480||1.3||177|
|GeForce GTX 580||1.6||192|
|Radeon HD 5870||2.7||154|
|Radeon HD 6970||2.7||176|
|Radeon HD 7970||3.8||264|
The first couple of tests above, the cloth and particles simulations, primarily use vertex and geometry shaders to do their work. In those tests, the 7970 easily outperforms the 6970, but it's not quite as fast as the two Fermi-based GeForces. As we've noted, vertex processing remains a strength of Nvidia's architecture.
Boy, things turn around in a hurry once we move into the last three tests, which rely on pixel shader throughput. True to form, AMD's older GPUs tend to outrun the GeForces in these tests, since they're quite efficient with pixel-centric workloads. Even so, Tahiti is substantially faster. In a couple of cases, the 7970 delivers on its potential to crank out over twice the FLOPS of the GeForce GTX 580.
GPU computing performance
These results are instructive. When we move from pixel shaders into DirectCompute performance, the Fermi-based GeForces recapture the lead from the Cypress- and Cayman-based Radeons. The Radeons have much higher theoretical FLOPS peaks, but the GeForces tend to be more efficient here. Tahiti, though, changes the dynamic. The Radeon HD 7970 outruns the GTX 580 and is nearly 50% faster than the Cypress-based Radeon HD 5870.
LuxMark is a ray-traced rendering test that uses OpenCL to harness any compatible processor to do its work. As you can see, we've even included the Core i7-980X CPU in our test system as a point of comparison. Obviously, though, the 7970 is the star of this show. The newest Radeon nearly doubles the throughput of its elder siblings—and nearly triples the performance of the Fermi-based GeForces. We've only run a couple of GPU computing tests, so our results aren't the last word on the matter, but Tahiti may be the best GPU computing engine out there. AMD appears to have combined two very desirable traits in this chip's shader array: much higher utilization (and thus efficiency) than previous DX11-class Radeons, and gobs of FLOPS in the given chip area.
|In the lab: FLIR's One thermal camera||30|
|Black Friday deals: Dell's U3415 curved monitor for $650 and more||24|
|Abu Dhabi government fund may be shopping GlobalFoundries||59|
|Asus goes for the gold with its 20th Anniversary GTX 980 Ti||7|
|MSI's Eco motherboards let owners fine-tune power consumption||8|
|Gigabyte's Z170X-Gaming G1 motherboard reviewed||15|
|Star Wars Battlefront video review||40|
|Club 3D active adapters convert DisplayPort 1.2 to HDMI 2.0||22|
|Phanteks' Power Splitter lets two systems run on one PSU||45|
|This is the answer to SSK's question on the Firefox news post.||+33|