Before we dive into the test results, let's have a quick review of what makes the Pentium M unique. The quick-and-dirty line on the Pentium M is that it's a Pentium III core mated to a Pentium 4 bus, and that's not entirely inaccurate. However, the Pentium M is much more than just that.
Yes, it is based on the Pentium III, or more properly, the P6 core that started out in the Pentium Pro processor, which evolved into the Pentium II and then Pentium III. And the Pentium M does use essentially the same bus protocol as the Pentium 4, quad-pumped and everything. But the Pentium M has been extensively modified for better performance, higher clock speeds, and lower power consumption. In fact, the Pentium M's main pipeline is somewhat longer than the 10 stages in the original P6 core, although Intel is coy on exactly how many stages are involved. The number is probably closer to the 12 stages in the Athlon 64 than to the 20 stages in the original Pentium 4 Netburst architecture or the 31 stages in the P4 Prescott. Other factors aside, longer pipelines generally mean higher clock speeds and lower clock-for-clock performance. As we'll see, the Pentium M hits clock speeds similar to the Athlon 64 and delivers comparable performance at those speeds.
The Pentium M we're playing with here is actually the second generation of Pentium M, code-named Dothan. (Our review of the original Pentium M "Banias" core is here.) Dothan is manufactured on Intel's 90nm fab process, and it packs a healthy 2MB of L2 cache RAM onboard (along with the corresponding logic for prefetching data into the cache.) That's in addition to a 64KB L1 cache evenly subdivided between data and instruction caches. Thanks to the die shrink, Dothan's 140 million transistors are packed into a die that's only 84mm2, nearly the same size as the original Pentium M Banias core, which had only 1MB of L2 cache. Compare that, if you dare, to the P4 Prescott's 122mm2 die size, or the massive 192mm2 die of the 130nm Athlon 64. The 90nm Athlon 64 "Winchester" also has an 84mm2 die, but that chip has only 512K of L2 cache. I don't have the exact numbers, but I believe 90nm Opterons with 1MB of L2 are expected to be about 100mm2.
The impressive thing about the Pentium M is that the entire processor core was designed, massaged, and tweaked in order to cut down on the amount of power it required. Intel's Israel-based design team used extensive statistical analysis in order to guide its decisions in making tradeoffs between performance and power consumption, and the Pentium M CPU is the result of that process. That's not to say that the Pentium M is full of compromises that harm performance. To the contrary, some of the very best types of power optimizations are performance enhancements, because getting work done in fewer CPU cycles can save power. Also, the Pentium M team didn't lean too aggressively toward saving power because the CPU is only a small part of overall system power consumption in a laptop, where things like the hard drive and LCD display can dominate the battery life equation. For these reasons, the Pentium M may very well make good sense as a desktop processor, even when raw performance is one of the user's primary concerns.
Intel has produced some very informative papers on the Pentium M's design, and I can't go into too much depth about such things here, but I would encourage you to read them if you would like more info. There's one on power savings and another on microarchitecture and performance. I will give you the highlights, though, of some of the changes made to increase the Pentium M's performance and power efficiency. Among them:
Intel claims micro-ops fusion cuts micro-ops by over 10% in Banias, leading to performance gains of 5% for integer code and 9% for floating-point. The additional logic for micro-ops fusion does consume more power, but Intel says the additional performance offsets this effectan instruction sequence requires less energy to complete. The Dothan core apparently fuses even more instructions, although we don't yet have any details on which or how many.
|The TR Podcast 162: Apple's biggest and Nvidia's fastest||12|
|ARM announces faster Cortex-M core for embedded apps||7|
|Nvidia wants to sell you LED-infused SLI bridges||30|
|Microsoft unveils a wireless display dongle of its own||37|
|Micro Center selling AOC's 24'' G-Sync monitor for $450||25|
|Steam storefront revamped with Discovery Update||16|
|Reversible, USB Type-C cables can pass DisplayPort signals alongside data and power||47|
|Early deal of the week: Delicious SSD discounts||20|
|New Gmail accounts no longer require Google+||24|
|You married well.||+52|