With a little help from AMD, researchers at North Carolina State University have been able to increase the performance of general-purpose GPU computing by an average of over 20%. Their method requires a "fused architecture" in which the CPU and GPU reside on the same die and share both last-level cache and system memory. AMD's Llano-based A-series APUs qualify, as do Intel's Sandy Bridge processors.
As this press release explains, the approach involves using the CPU to prefetch data for the GPU. Dr. Huiyang Zhou, the associate professor of electrical and computer engineering who co-authored the team's paper on the subject, contends that this approach works because it allows CPUs and GPUs to focus on their strengths. Each one is capable of fetching data from memory at "approximately the same speed," but GPUs are much quicker to crunch the data once they have it. With the CPU fetching data for the GPU, which can then pull the data directly from cache, the researchers observed performance increases as high as 113%.
Obviously, some workloads will benefit more than others. The fact that this approach requires a processor with integrated graphics is also rather limiting, at least in practical terms. Even the fastest integrated GPUs are painfully slow compared to the performance offered by discrete graphics hardware. Thanks to CPU World for the tip.
|1. BIF - $340||2. chasp_0 - $251||3. mbutrovich - $250|
|4. Ryu Connor - $250||5. YetAnotherGeek2 - $200||6. aeassa - $175|
|7. dashbarron - $150||8. Captain Ned - $100||9. Anonymous Gerbil - $100|
|10. Bill Door - $100|
|Samsung's Portable SSD T3 reviewed||7|
|TR BBQ Day Shortbread||16|
|Watch the "second-10th" TR BBQ live in 360 degrees right now||9|
|G.Skill hooks up the TR BBQ with some giveaway goodies||11|
|We threw a Minecraft party to test Samsung's Gear VR headset||9|
|Deals of the week: cheap solid-state storage and more||19|
|Sapphire Nitro+ Radeon RX 480 hot-rods Polaris 10||60|
|AMD gets back in the black with its second-quarter financials||41|
|Nvidia unveils a Pascal-powered Titan X with 11 TFLOPS on tap||178|
|I'll...just review the thin air on my desk where a GTX 1060 would fit, since that's what we have.||+115|