Intel. Strikes. Back. The massive 18-core Core i9-7980X and 16-core Core i8-7960X are the chipmaker’s response to AMD’s Ryzen Threadripper, which has been eating Intel’s lunch for many months.
But can Goliath Intel rise from its stunning defeat to challenge David AMD to a re-match? Can it shake off rumors of clock speeds and high temperatures? To find out you’ll have to read on.
Because there’s so much to say about Core i9, we put the prices, features and FAQs into a separate story you’ll want to read for background. For this review, we’ll walk through some of the under-the-hood issues directly related to performance, and then we’ll dive into the benchmarks.
Core i9, under the hood
Core i9 is the first new “Core i” Intel has introduced in 10 years. The company guarded the secret so closely that it even intentionally mislabeled the first round of chips (including our review sample) as “Core i7” to throw off leakers. Our 16- and 18-core samples are labeled correctly, however.
Like most major Intel launches, the Core i9 family represents a new platform, not just a new CPU, which means a new chipset, the X299, and a new socket, the LGA2066, all incompatible with previous CPUs.
The new platform also does something no previous one did by unifying two CPU families. Before today, if you wanted the company’s latest Kaby Lake core, you had to buy a motherboard using the LGA1151 socket. And if you wanted to buy, say, a 6-core Skylake CPU such as Intel’s Core i7-6800K, you had to buy an LGA2011 V3-based motherboard.
With X299 and LGA2066, you can now pick your poison, because the platform encompasses everything from a 4-core Core i5 Kaby Lake CPU to an 18-core Core i9 Extreme Edition, which is a Skylake CPU. For clarity’s sake, the Kaby Lake CPUs, also called Kaby Lake-X, are the Core i5-7640X and the Core i7-7740X. The rest of the Core i7 and Core i9 CPUs are Skylake, collectively called Skylake-X.
The Core X series is made up of CPUs constructed with Skylake-X cores and Kaby Lake-X cores. The monster 18-core part is due in October.
This union has been greeted with some confusion and trepidation. It’s likely that X299 motherboards will be expensive. Some are rightly wondering who would buy a $350 motherboard to install a $250 CPU.
Intel’s motives for the Kaby Lake-X may actually be a nod to the overclocking sports. Unlike LGA1151-socketed Kaby Lake chips, Kaby Lake-X chips have no integrated graphics capability. In fact, we’ve been told the chips physically have no IGP on the die at all. This allows the two Kaby Lake-X chips to overclock potentially far higher than the LGA1151 versions. At the recent Computex show in Taipei, in fact, Intel said a record was set for the highest overclock of a Kaby Lake chip, and it was on X299.
In a perfect world, we’d all have 18-core CPUs, but the truth is there are those who buy cheap CPUs on nice motherboards. Kaby Lake-X is for you.
PCIe lanes: Still being rationed
Still, having Kaby Lake-X and Skylake-X on the same socket is bound to create confusion. Exhibit A is the PCIe rationing. With the Core i9-7900X, for example, you get quad-channel memory support and 44 PCIe Gen 3 lanes directly from the CPU. If you were to drop a Core i7-7740K into the same build, the motherboard drops down to dual-channel memory support. Perhaps worse, the PCIe lanes drop down to 16, because that’s the maximum supported by the Kaby Lake core. That means some of the slots on a motherboard would fall back in performance or be completely disabled.
While Kaby Lake-X’s 16-lane limit is due to the CPU’s design, Intel dials back PCIe lanes on Skylake-X intentionally. Rather than the 44 lanes the 10-core version gets, the 6-core and 8-core versions of Skylake-X get just 28 lanes. From what we understand, there’s no technical reason for it, just “market segmentation,” which is a business school way of saying, “so we can charge you more.” Oy.
Intel VROC
Even more controversial than PCIe rationing is Intel’s VROC, or Virtual RAID on CPU. It’s a nifty feature on Skylake-X that allows a user to configure up to 20 NVMe PCIe drives in RAID into a single bootable partition.
The problem? Intel apparently intends to charge more money for the feature. Details haven’t been released, but vendors at Computex told us they believed RAID 0 would be free, RAID 1 would cost $99, and RAID 5 and RAID 10 could cost $299. Once you’ve ponied up the cash, you get a hardware dongle that unlocks the feature.
It gets worse: VROC will work only with Intel SSDs and pricier Skylake-X parts. If you buy Kaby Lake-X, you’re shut out. VROC also applies only to PCIe RAID that runs directly through the CPU’s PCIe lanes. X299 still supports various RAID 0, 1, 5, 10 through the chipset, but the chipset RAID won’t touch the performance you get from VROC.
We’ll reserve final judgment until Intel confirms pricing. Considering the face-palms it caused at Computex, we’re interested to see how this shakes out if it ever does.
How Core i9 changes Skylake
Once you’ve gotten beyond the platform confusion and controversy, there’s a reward. The Skylake-X chip itself is indeed something to admire, because it’s built unlike any previous high-end Intel consumer chip.
Previous “enthusiast” or “extreme” CPUs have mostly been the same. That is, a 4-core Haswell Core i7-4770K wasn’t all that different from from an 8-core Haswell-E Core i7-5960X except for the support of quad-channel RAM.
With Skylake-X, Intel breaks from tradition, with some major tinkering under the hood. The most noticeable is an increase in Mid-Level Cache (MLC), or L2 cache: Intel has quadrupled it to 1MB per core, up from 256MB in last year’s Broadwell-E and the majority of Intel’s CPUs. The Last-Level Cache (L3) actually gets smaller, with 1.375MB per core versus the 2.5MB of the previous Broadwell-E chip, but Intel compensates with the larger MLC and also the use of a non-inclusive cache design. Compared to the inclusive design in Broadwell-E, which might keep data that’s not needed, non-inclusive cache attempts to track what should be in the cache so it can more efficiently use the available space.
Intel also ditches the ring bus architecture it has used for several years (including Kaby Lake and Skylake) for a new mesh architecture. If you think of a quad-core CPU as four homes connected by a bus line that makes stops at each home, it’s perfectly fine until you add, say, 12 or 18 homes to the community. You could connect two bus lines, but that still isn’t as fast as simply driving from one home to the next, which is what the new mesh architecture does.
Intel’s use of a mesh design clearly puts it on a better footing to compete with Threadripper, as more and more cores are added to CPUs. AMD’s Ryzen series uses something it calls an Infinity Fabric, which is essentially a super-high-speed mesh network.
The last feature worth noting is the improved Turbo Boost Max 3.0. This is the feature wherein Intel identifies the “best” CPU core at the factory and gives it a little more boost speed. With Broadwell-E CPUs, only one core was chosen. With Skylake-X, two cores are identified as the “best” and allowed to run a couple of hundred megahertz faster.
18-core Core i9 performance
For our performance testing, we ejected the 10-core Core i9-7900X from the socket of our Asus Prime X299-Deluxe and installed our 18-core Core i9-7980X. Other than that, the components haven’t changed from our original Core i9-7900X review, including: a GeForce GTX 1080 Founders Edition, 32GB of DDR4/2600 RAM, and a HyperX 240GB Savage SATA SSDs. For our Adobe Premiere CC 2017 test, the source project and the target drive used a Plextor M8pe PCIe SSD in all but the Core i5 and the Ryzen 5 CPUs. This exception is due to a problem with the Ryzen 5’s motherboard, which failed to recognize the Plextor drive. A Samsung 960 Pro NVMe SSD was swapped in. The AMD Ryzen Threadripper 1950X remains the same as when we first reviewed the chip.
Due to time constraints, some of the tests also feature scores from a Core i9-7960X—the 16-core version of the chip. The CPU was used in a pair of identical Falcon Northwest Talon systems we used for our upcoming Threadripper vs. Core i9 showdown. Although that system features completely different GPUs, the CPU is bone stock, so the CPU-only tests are valid to compare.
Cinebench R15 performance
Our first test is CineBench R15, a free 3D rendering test based on Maxon’s professional Cinema4D engine. It’s almost entirely CPU bound and also scales very well as you increase the number of CPU cores and threads.
The top dog is not a surprise: Intel’s 18-core Core i9-7980X, with the 16-core Core i9-7960X getting the silver medal. AMD’s Threadripper 1950X, once undisputed among consumer CPUs, has to settle for the bronze.
But the placing of that Threadripper 1950X is nothing to be ashamed of. Yes, AMD fans, yes, we know: It costs significantly less. Let’s just acknowledge that now so you can read the rest of this review without constantly want to scream, “but it cost a ton less!” Just silently repeat that phrase after every test result you see.
Multi-threaded performance isn’t the end-all be all. The sad truth is the vast majority of applications and games just don’t use all of those cores, so we also use CineBench to measure single-threaded performance. Surprise: The Core i9-7980X comes out on top, pulling ahead of even the higher clocked Core i7-7700K CPU. For the most part we really see three tiers of performance, with the Kaby Lake and Skylake-X chips on top, and then Broadwell and Zen-based chips under that.
To keep this all in perspective, we’re really not looking at a huge difference between a Skylake-X chip and a Broadwell-E or Zen chip. But the winner is clearly Core i9 and Skylake-X.
POV Ray performance
The Persistence of Vision Raytracer actually traces its lineage back to the day of the Commodore Amiga, and it continues to be supported by an active community of developers. Like Cinebench, it favors high core- and thread-count chips. The results are predictable, with 18-core Core i9-7980X on top. That 16-core Ryzen Threadripper 1950X does pretty well, but two more cores pay real dividends.
As we still want to know how CPUs do under much lighter loads, we also run POV Ray using a single thread. The results again favor high-clock-speed quad-cores, but the Skylake-X chips hang close, and the Zen and Broadwell-E chips aren’t that far behind. The only dog here is AMD’s ancient Vishera-based FX CPU.
Blender performance
Our next test is the open-source Blender modeler. It’s a popular app used to create effects for many indie movies. Blender performance results can vary greatly by the workload. For example, sample benchmarks provided for Intel’s quad-core Kaby Lake and AMD’s Ryzen don’t really scale with core count. For this test, we run the popular Mike Pan BMW benchmark. The winners again are Intel’s two latest Core i9 CPUs, with the Threadripper 1950X coming in slightly behind them.
It’s a good showing for all three but, again, Blender’s performance very much depends on what model and what you’re doing. We’ve also found Blender to be very sensitive to the operating system.
Because these are the big-nerd chips, we decided to throw something heavier at them in the form of the Gooseberry benchmark file. It’s a benchmark frame from the Blender Institute’s upcoming open-source movie: Cosmos Laundromat. While the BMW benchmark takes a couple of minutes or three to run, Gooseberry pushes it to more than 20 minutes to render a frame.
The results in Gooseberry look great for the new Core i9s and definitely paint a worse picture for the 16-core Threadripper 1950X in our Falcon Northwest Talon PC. Even worse, a test on our reference machine was a bit slower, too.
Performance
Intel sent us the Core i9-7900X in an Asus Prime X299-Deluxe motherboard. We ran the testbed with the Anniversary Update build of Windows 10. Yes, we know, the world has moved on to the Creators Update, but in order to compare it with past CPUs we stuck with this earlier build.
All of the systems (except where noted) used a GeForce GTX 1080 Founders Edition, 32GB of DDR4/2133 RAM, and HyperX 240GB Savage SATA SSDs. For our Adobe Premiere CC 2017 test, the source project and the target drive used a Plextor M8pe PCIe SSD in all but the Core i5 and the Ryzen 5 CPUs. This exception is due to a problem with the Ryzen 5’s motherboard, which failed to recognize the Plextor drive. A Samsung 960 Pro NVMe SSD was swapped in.
Where we sourced from our previous tests, those tests used the same Nvidia drivers, the same OS, and the same hardware that we used for this Core i9-7900X review. We did, however, decide to update the testbed for the original 10-core Broadwell-E Core i7-6950X. That test was originally conducted on a very early Asrock X99 motherboard that didn’t fully support Intel’s new Turbo Boost Max 3.0 technology. This time around, we used the same Asus X99-Deluxe II that we used for testing the two Broadwell-E chips in the comparison pool.
Cinebench R15 performance
Our first test is Maxon’s Cinebench R15. It’s a free benchmark based on the same rendering engine used in Maxon’s Cinema4D product. It scales well with core count and frequency and is pretty much a pure CPU test. The results speak well for the the 10-core CPUs when compared to the 8-core parts. Even though we’re increasing thread count by only 22 percent, we’re seeing almost a 30-percent increase in performance.
The difference between the 10-core Broadwell-E Core i7-6850X and the 10-core Skylake-X Core i9-7900X is less than expected. According to Intel, you might see up to a 10-percent difference in multi-threaded tasks and up to 15 percent in single-threaded tasks when compared to the Broadwell-E 10-core. In Cinebench, we’re seeing just about 3.5 percent.
What changed? The motherboard. What we’re likely seeing is a result of more than a year of tuning by Asus of its X99 platform. It just pushes the CPU far harder and far faster than the first motherboard. Our initial review of the CPU in this test gave it a score of 1,792, which is quite a bit off from the 2,107 we’re seeing from it now. Other initial reviews put the chip in the low-1,800s. If that score remained true, Skylake-X would be almost 20 percent faster than Broadwell-E.
We also ran Cinebench R15 limited to just a single thread. Because the vast majority of applications and games still rely on a single thread, the performance here is just as important as it is on multi-threaded tests. The 10-core Broadwell-E now drops back a few spots, as its clock speeds can’t keep up with the 10-core Skylake-X chip’s. You can also see that the CPUs with the higher clock speeds move ahead of the 6- and 8-core chips. All except for the 10-core Core i9-7900X.
Large 8- and 10-core chips have had trouble keeping up with the spry quad-cores in high clock speeds. Intel started fixing that in Broadwell-E, but if this Cinebench result holds true, Skylake-X has the potential to hang with Kaby Lake just fine.
POV-Ray performance
The second test we’ll highlight is the POV-Ray. It’s a free ray tracer that goes back to the Commodore Amiga. Using the built-in benchmark, we saw the 10-core Core i9 outpace the 10-core Core i7 by about 8 percent, which is closer to Intel’s 10-percent claim. Against the 8-core Core i7-6900K, we’re seeing Skylake-X outpace it by roughly 36 percent.
POV-Ray also has a single-threaded test. Although the Core i9-7900X can’t quite keep up with the spry Core i7-7700K, it’s pretty close. We’re seeing about a 13-percent gap between the Core i9 and the Core i7-6950X, too, which is just a hair shy of Intel’s claim of 15 percent.
Blender performance
Our last 3D rendering test is the open-source Blender test. Using Peter Pan’s popular BMW benchmark we’re seeing a scant 2 percent gap between the 10-core Core i9 and 10-core Core i7 chips.
For kicks, we also used Blender on AMD’s custom Ryzen workload (which you can find here.) The performance difference between Core i9 and Core i7 is minimal.
WinRAR performance
Moving on to compression tests, we used WinRAR’s built-in benchmark to measure the compression performance of the various chips. One thing you’ll notice is we no longer break out the performance of the Ryzen 5 1600X and the Core i5-7600K CPUs. That’s because both of those were tested with the RAM set at DDR4/2933. Memory bandwidth doesn’t matter that much in 3D rendering tests, but it definitely can tilt the scales in compression tests. Rather than cloud the results, we’re dropping them.
One thing you’ll notice is that the 10-core Core i9 suddenly takes second place to the 10-core Core i7 chip. We also tried this test with the latest beta version of WinRAR and saw no change. We surmised this might be the cache design of the new chip, but after talking with Intel, the company suggested it could be the new mesh design.
7-Zip performance
A second compression test we ran is 7-Zip’s built-in benchmark. Like WinRAR, we suspect it prefers the cache design of the Core i7-6950X more than the new one in the Core i9-7900X. It’s not enough to matter, but the upshot is there are going to be some applications favor Broadwell-E over Skylake-X.
Adobe Premiere Creative Cloud 2017 performance
For video editing, we tasked the CPUs with exporting a video shot by our studio on a Sony 4K camera. The project was exported using the Blu-ray preset and the Maximum Render option enabled in Premiere, which helps when video is resized. We also opted to use the Mercury software engine, which relies on the CPU for the encode rather than the GPU. Many use the GPU for encoding today, but CPU encoding is still the standard for image quality.
The Core i9-7900X registered about a 7 percent advantage over the Core i7-6950X CPU, which isn’t bad, and close to the “up to 10 percent” Intel promised.
We know people will say none of this matters because “I use my GPU for encoding,” so we also ran the same test using the Mercury CUDA engine in Premiere CC 2017. This means the GeForce GTX 1080 was tasked for much of the heavy lifting.
We saw an immediate improvement in export times, but if you look at the results—the CPU still matters. In fact, the 10-core boxes still win by a decent amount. If you were encoding a multi-hour project, the 10 cores would be worth the extra cash.
Handbrake encode
Our last encoding test uses the free Handbrake to convert a 30GB 1080p MKV file using Handbrake 0.9.9 Android Tablet preset. The test is multi-threaded and scales well with clock speed. The winner is the Core i9-7900X, which comes in—cha-ching—about 10 percent faster than the 10-core Core i7-6950X. We’re also seeing nice scaling: The 10-core is about 30 percent faster than the 8-core Core i7-6900K and 60 percent faster than the 6-core Core 7-6800K.
3DMark Fire Strike performance
For gaming performance, we first run Futuremark’s 3DMark Fire Strike. We’re reporting only the physics portion of the test, as that’s the only one that matters for the CPU. The test uses a real-world physics engine that scales well with core count. Oddly, the Core i7-6950X nudges the Core i9-7900X out of the way, perhaps because of the cache difference between the chips or the mesh architecture. Note, though, that this is a theoretical test of what a game could do if it stressed all those cores. In reality, games don’t devote this much to game physics.
Tomb Raider performance
Moving on to a real game, we use Ubisoft’s older Tomb Raider to measure CPU performance by running the game at 1920×1080 resolution and the normal preset. At this low game setting and relatively low resolution for a GeForce GTX 1080, the only difference we’re likely seeing is clock speed. Each CPU’s cache can occasionally move the needle, too.
The Ryzen 7 1800X chip performance is off, likely due to code that isn’t optimized for its microarchitecture. Case in point, Rise of the Tomb Raider recently received an update that greatly helped Ryzen out. And for the most part, it’s not an issue at higher game settings where the GPU is the bottleneck on performance.
You can also see from our results why Ryzen’s performance was so confusing: CPUs don’t matter in conventional gaming as much as people wish they would.
Tom Clancy’s Rainbow Six Siege performance
We’ll close out our gaming performance of Skylake-X with the more modern Tom Clancy’s Rainbow Six Siege. The Core i9-7900X is slightly slower than the Kaby Lake and 8- and 10-core Broadwell-E chips. We suspect, again, that the game slightly favors the cache design of Broadwell-E and Kaby Lake, but it’s not a big deal. Lackluster Ryzen performance is again possibly linked to game optimization.
IPC performance
One difficult thing to discern with most of these benchmarks is just how “efficient” each CPU microarchitecture is. One way to tease that out is by running a test using a single thread, with all the CPUs locked to the same clock speed. We locked most of the CPUs here to 2.5GHz and turned off any Turbo Boost. We then ran Cinebench R15.037 (which we used to generate scores for the older CPUs).
We can see that instructions per clock (IPC) has built itself up slowly from the days of Sandy Bridge. Skylake-X comes out in front of even Kaby Lake, surprisingly.
Note: The FX-8370 is fast because we couldn’t underclock the CPU to 2.5GHz, so we just used its score running at 4GHz. Yes, the performance of Vishera at 4GHz is still below that of a Haswell CPU running at 2.5GHz. Woof.
IPC isn’t everything, so we’ll close this out with a big chart of Cinebench R15 scores that we’ve personally run on various CPU models. Keep in mind, they’re not exactly one-to-one, as some CPUs have higher-clocked RAM, or run on DDR3 instead of DDR4. But Cinebench is mostly a CPU benchmark, so memory bandwidth doesn’t affect it as much as it would in some tests.
Conclusion
If you point your eyes at that last chart, which includes just about every Intel design represented since Sandy Bridge, you can only conclude that the new Core i9-7900X is the fastest consumer CPU ever produced by Intel. There’s just no argument. The fact that it’s being offered at $1,000 compared to the $1,723 tag on the previous 10-core part is another reason for the PC community to cheer.
The problem is, it’s a different world now. At $1,000 for a 10-core chip, you’re paying about 100 percent over an 8-core Ryzen 7 chip for about 30 percent more performance. Even worse, we still don’t know what price AMD set for its 12-core and 16-core Threadripper chips. If AMD introduces a 12-core CPU at $850, as some predict, a 10-core Core i9 for $1,000 loses its luster.
For now, the Core i9-7900X reigns as the fastest consumer CPU on the planet. But it should be looking over its shoulder, as will we, for Threadripper.
Sоurсе: pcworld.com