The Pentium 4's new look
Wondering how Prescott got to have so many more transistors? The answer is that Prescott is a serious overhaul of the Netburst microarchitecture all Pentium 4s share. In fact, Prescott is arguably a more major revamp than the P6 core got during its long tenure at the heart of the Pentium Pro, Pentium II, and Pentium III processors. There are too many changes to cover in depth here, but I will attempt to summarize them and talk about the most significant modifications of the chip's design.
The watchwords for the Prescott changes are "higher clock frequencies." Virtually all the modifications to the Prescott core are intended to produce high performance while allowing the chip to run at clock speeds of 4GHz and beyond. Many of the radical elements of the original Netburst design are present here in even more radical form, including the deep main pipeline, execution trace cache, and ample amounts of speculative logic and prefetching. Most of these changes represent tradeoffs of various types, between, say, higher clock speeds and higher clock-for-clock performance, or, in many cases, between higher latencies and better peak performance. Generally, Prescott has been tuned for higher clock frequencies, and the choices Intel's design team has made reflect that emphasis.
With that said, we'll let the bullets start flying on our summary of Prescott's new features.
By making each stage of the pipeline less complex, Intel increases the processor's tolerance for running at higher clock speeds. In doing so, though, Intel's engineers have chosen to reduce clock-for-clock performance. This change, by itself, would significantly lower the number of instructions per clock (IPC) the Pentium 4 can execute. Higher clock speeds can offset a lower IPC, but Prescott starts out at only 3.4GHz, and Northwood runs at that speed, too.
Fortunately, there are a number of countervailing forces to take into account. For one thing, instruction latencies vary; not all instructions use all stages of the pipeline. More importantly, Prescott includes a whole raft of enhancements aimed at increasing its clock-for-clock performancesome in very specific ways. That's what the rest of these bullet points are about.
Before we move on, I should point out once more that taken in context, a lower IPC isn't necessarily a bad thing. Higher or lower IPCs in processor design are tradeoffs, and need not evoke a value judgment. What is true of the Pentium 4, and of Prescott more so than prior revisions, is that Intel has chosen to go full-bore the way of lower IPC and higher clock speeds. This "speed demon" approach to processor design seems to fit reasonably well with Intel's technological prowess in chip fabrication.
Like previous Netburst processors, Prescott's L1 instruction cache is an unconventional execution trace cache that holds decoded micro-ops for the processor's RISC-like core instead of CISC-style x86 instructions. Prescott's execution trace cache still holds roughly 12,000 micro-ops, but the chip can now encode more types of micro-ops into the trace cache, making it more efficient.
Of course, programs must be rewritten or recompiled to take advantage of SSE3 instructions, so we won't see SSE3's benefits immediately.
For instance, Prescott's branch prediction unit has been improved to avoid branch mispredictions, which will be more costly than ever with Prescott's long pipeline. One of the enhancements is the addition of an indirect branch predictor, borrowed from the work of the Pentium M team.
Another key change is a new shifter/rotator block added to one of the chip's simple arithmetic logic units, or ALUs. You will recall that the Pentium 4's simple ALUs run at twice the speed of the rest of the chip; that's still true for Prescott, and now one of the ALUs can handle shift and rotate operations. Also, Prescott now does integer multiplication in a dedicated integer multiplier instead of using the floating-point multiplier, as previous Netburst chips did.
There are also store-to-load forwarding enhancements, improvements to SSE/2/3 and x87 multimedia performance, and more.
All told, Prescott is a rather different animal from the Northwood and Willamette chips that precede it and share the Pentium 4 name. These changes will affect performance in ways that are difficult to predict. Instruction latencies will be higher, except where they're lower. The same is true for performance in general, and that's why we run the benchmarks.
Prescott pullin' the juice
There has been some concern, leading up to Prescott's launch, about how much power the chip will consume and how much heat it will produce. The key spec Intel provides in this realm is TDP, or Thermal Design Power. TDP is not, however, a peak power load number; it is a thermal design guideline. As Intel puts it, "The TDP is not the maximum power that the processor can dissipate." So we have something to go on there, but perhaps not much.
Northwood's TDP at 3.2GHz is 82W, while the Extreme Edition's is about 92W. Prescott's TDP at 3.2GHz is 103W. So yeah, this thing pulls some juice and generates some heat.
To manage Prescott's thermal prowess, Intel has created a new specification for thermals that allows for finer-grained control of fan speeds based on a value returned from the CPU. This value is set "based on the power dissipation of each unit," according to Intel, and combined with the thermal diode temp, will dictate safe fan speeds for coolers. Implementing this scheme will require motherboard changes, but not changes to the actual cooler designs. In fact, Intel-approved coolers for current Pentium 4s should work for Prescott at its initial speed grades.
Intel is also pushing a verification program for ATX cases, trying to ensure enclosures have proper venting and the like. Clearly, Intel is squeezing all it can from ATX while waiting for the new BTX form factor to arrive in force.
So the hundred dollar question is: will Prescott work with my motherboard? The answer is, as with so many things in life, it depends. These first Prescott chips drop into 478-pin sockets, just like Northwoods. Newer motherboards from top vendors have probably been ready for Prescott for some time, but they will have to provide adequate power for Prescott, and not all older motherboards can. So Intel's answer is, "Check with your motherboard manufacturer." We checked with Abit about our IC7-G test platform, and they were able to provide us with a Prescott-ready BIOS. Once we flashed to it, the Prescott ran like a champ on our board. Depending on your motherboard's age and power design, your mileage may vary.
|Friday night topic: Light bulbs? Yep, light bulbs||201|
|Smoky Back Room types, we need you||0|
|International Women's Day Shortbread||63|
|Newest Thermaltake Urban case has dual doors||22|
|Deal of the week: Discounted Windows and cheap storage||11|
|MSI gaming barebones has Mini-ITX mobo, external overclocking button||32|
|Fan-made Morrowind remake looks amazing||34|
|Thursday Night Shortbread||41|