x264 HD video encoding
This benchmark tests performance with one of the most popular H.264 video encoders, the open-source x264. The results come in two parts, for the two passes the encoder makes through the video file. I've chosen to report them separately, since that's typically how the results are reported in the public database of results for this benchmark. These scores come from the newer, faster version 0.59.819 of the x264 executable.
I'm at a bit of a loss to express the reality of what we're seeing. Across a broad mix of applications, the Xeon W5580 isby farthe fastest processor we've ever tested. Yes, this is a very high end part, but Intel's new architecture is unquestionably effective.
We've included this final test largely just to satisfy our own curiosity about how the different CPU architectures handle from SSE extensions and the like. SiSoft Sandra's "multimedia" benchmark is intended to show off the benefits of "multimedia" extensions like MMX, SSE, and SSE2. According to SiSoft's FAQ, the benchmark actually does a fractal computation:
This benchmark generates a picture (640x480) of the well-known Mandelbrot fractal, using 255 iterations for each data pixel, in 32 colours. It is a real-life benchmark rather than a synthetic benchmark, designed to show the improvements MMX/Enhanced, 3DNow!/Enhanced, SSE(2) bring to such an algorithm.
The benchmark is multi-threaded for up to 64 CPUs maximum on SMP systems. This works by interlacing, i.e. each thread computes the next column not being worked on by other threads. Sandra creates as many threads as there are CPUs in the system and assignes [sic] each thread to a different CPU.
The benchmark contains many versions (ALU, MMX, (Wireless) MMX, SSE, SSE2, SSSE3) that use integers to simulate floating point numbers, as well as many versions that use floating point numbers (FPU, SSE, SSE2, SSSE3). This illustrates the difference between ALU and FPU power.
The SIMD versions compute 2/4/8 Mandelbrot point iterations at once - rather than one at a time - thus taking advantage of the SIMD instructions. Even so, 2/4/8x improvement cannot be expected (due to other overheads), generally a 2.5-3x improvement has been achieved. The ALU & FPU of 6/7 generation of processors are very advanced (e.g. 2+ execution units) thus bridging the gap as well.
We're using the 64-bit version of the Sandra executable, as well.
Well, OK, then.
|Raspberry Pi Compute Module 3 flaunts a quad-core SoC||11|
|Imagination Technologies freshens up mid-range PowerVR GPUs||1|
|be quiet! unveils entry-level Pure Base 600 chassis||15|
|Sapphire launches Radeon RX 460 with 1024 SPs in China||10|
|Google RAISR upsamples thumbnails for massive bandwidth savings||56|
|Biostar's Z270 boards race to the finish||20|
|Synology RT2600ac offers up speedy Wi-Fi and tight controls||5|
|Deals of the week: a gaming monitor and system components||17|
|Nintendo reveals Switch launch date, pricing, and initial line-up||70|