Thanks for the link.
Looks like they are running multiple WU to improve utilization....but still only getting 55% usage on the card.
I must also note that my Radeon HD 7950 with 28 CUs is running 1200MHz core and 1700MHz memory (4000 GigaFLOPS SP), whilst my old Radeon HD 6850 with 12 CUs is only running at 860Mhz core an 1250MHz memory (1700 GigaFLOPS SP). Running 5 work units with my 7950, each taking 45 minutes to complete, equals 9 minutes per completed WU at 55% utilization. Meanwhile, my 6850s running 5 work units take around 77 minutes to complete, so 15.5 minutes per completed task with 90% utilization.
Running many WU leads to CPU/bus bottleneck...
Sounds like he is certainly improving on the standard performance, but still using a kludge instead of natively optimized client.
I'm not up for that much work right now.
But again, thanks for passing on the information.