x264 HD video encoding
This benchmark tests performance with one of the most popular H.264 video encoders, the open-source x264. The results come in two parts, for the two passes the encoder makes through the video file. I've chosen to report them separately, since that's typically how the results are reported in the public database of results for this benchmark. These scores come from the newer, faster version 0.59.819 of the x264 executable.
I'm at a bit of a loss to express the reality of what we're seeing. Across a broad mix of applications, the Xeon W5580 isby farthe fastest processor we've ever tested. Yes, this is a very high end part, but Intel's new architecture is unquestionably effective.
We've included this final test largely just to satisfy our own curiosity about how the different CPU architectures handle from SSE extensions and the like. SiSoft Sandra's "multimedia" benchmark is intended to show off the benefits of "multimedia" extensions like MMX, SSE, and SSE2. According to SiSoft's FAQ, the benchmark actually does a fractal computation:
This benchmark generates a picture (640x480) of the well-known Mandelbrot fractal, using 255 iterations for each data pixel, in 32 colours. It is a real-life benchmark rather than a synthetic benchmark, designed to show the improvements MMX/Enhanced, 3DNow!/Enhanced, SSE(2) bring to such an algorithm.
The benchmark is multi-threaded for up to 64 CPUs maximum on SMP systems. This works by interlacing, i.e. each thread computes the next column not being worked on by other threads. Sandra creates as many threads as there are CPUs in the system and assignes [sic] each thread to a different CPU.
The benchmark contains many versions (ALU, MMX, (Wireless) MMX, SSE, SSE2, SSSE3) that use integers to simulate floating point numbers, as well as many versions that use floating point numbers (FPU, SSE, SSE2, SSSE3). This illustrates the difference between ALU and FPU power.
The SIMD versions compute 2/4/8 Mandelbrot point iterations at once - rather than one at a time - thus taking advantage of the SIMD instructions. Even so, 2/4/8x improvement cannot be expected (due to other overheads), generally a 2.5-3x improvement has been achieved. The ALU & FPU of 6/7 generation of processors are very advanced (e.g. 2+ execution units) thus bridging the gap as well.
We're using the 64-bit version of the Sandra executable, as well.
Well, OK, then.
|Are retail Radeon R9 290X cards slower than press samples?||182|
|Valve joins the Linux Foundation||33|
|USB group designing slim, orientation-independent connector||54|
|Cherry intros MX RGB key switch; first keyboard due from Corsair||52|
|MSI's latest Z87 motherboard, GeForce GTX 760 graphics card have Mini-ITX dimensions||30|
|Tuesday Night Shortbread||20|
|HP unveils two Tegra 4-powered tablets||50|
|Unofficial AMD roadmap details desktop plans through 2015||131|
|It's official: Toshiba will snatch up OCZ's SSD business||38|