by Ilya Gavrichenkov
05/16/2011 | 07:59 PM
It is believed that memory frequency and timings do not have that much influence over the performance in contemporary systems. So, it only makes sense to invest into high-seed memory modules if the other system components, such as CPU, graphics card and hard drive, are already running at the top of their ability. This reasoning didn’t emerge out of thin air. True, the tests show that improving the memory sub-system settings in Phenom II, Core i7 and Core i5 systems will only provide about 3-7% speed gain, which is a minor improvement.
However, these conclusions were first made a while back, so they are primarily true for the previous generation platforms. As for the influence the memory sub-system speed has on the overall performance of contemporary LGA1155 systems, we haven’t yet discussed anything like that. And it obviously makes little sense to translate the old results onto the new
Although the specifications of the
This internal reorganization had exceptionally positive effect on the memory sub-system performance. The memory controller got closer to the computational cores, so to speak. Now it is not only in the same semiconductor die as the L3 cache and the CPU cores, but also isn’t logically separated from them, because it is connected to the same ring bus, which connects all inter-processor components with one another. This connection to the ring bus working at the processor frequency ensured a significant increase in bandwidth between the processor cores, L3 cache and memory controller.
We clearly see that processors from the
As we see,
Increased efficiency is not the only advantage of the new memory controller.
The memory frequency in Core processors is set as the base clock generator frequency (BCLK) times a corresponding multiplier. The multipliers supported by each processor type actually determine the acceptable frequency intervals.
However, do not forget that new LGA1155 systems do not allow overclocking by raising the base clock generator frequency. The changes made to the clocking algorithms in LGA1155 systems allow only a miniscule adjustment of the BCLK frequency: in most cases the system will immediately lose its stability if the BCLK is increased by more than 5% above the nominal setting of 100 MHz. In other words, while in LGA1156 systems we could increase BCLK frequency to overclock not only the processor, but also the memory, this approach will not work in the new LGA1155 platforms. Now overclocking is merely the multiplier adjustment, so it is quite logical that the new
At the same time you do not need any tricks to ensure that
To check out whether our assumptions are true, we put together an LGA1155 system on a quad-core Core i5-2500K processor from the overclocking-friendly K-series that features an unlocked clock frequency multiplier. We completed the system with a pair of DDR3-2100 memory modules from GeIL: GeIL EVO ONE PC3-17000, that support a wide range of frequencies and latencies. As a result, in the end our test system consisted of the following components:
In nominal mode the technologies responsible for interactive management of the processor clock frequency, namely – Turbo Boost and Intel Enhanced SpeedStep – remained active.
In overclocked mode Turbo Boost technology was disabled, but Intel Enhanced SpeedStep remained up and running. The CPU clock frequency was set at 4.7 GHz.
The memory was tested in the following modes, which represent the settings of the today’s most popular DDR3 SDRAM kits:
First of all we are going to check out the synthetic tests of the memory sub-system performance. We will test the actual bandwidth and latency using Cachemem benchmark built into AIDA64 utility.
The obtained results reveal a few interesting things. First of all, I would like to point out right away that there is a significant difference in actual memory sub-system speeds between the memory modules with different frequencies and timings. By simply increasing the DDR3 SDRAM speed from 1067 MHz to 2133 MHz we stimulate a gigantic 60% increase in practical bandwidth. We haven’t seen anything like that in systems based on other processors, which indicates clearly that there are really no serious bottlenecks on the bus fragment between the processor cores and system memory.
Secondly, it is quite symptomatic that not only the read speed, but also the write speed depends on the memory modules frequency. There was no dependency like that in the previous-generation systems at all, or it was really minimal. This peculiarity of the
Thirdly, I have to say that DDR3 memory modules frequency has a greater effect on the memory sub-system performance than their timings. In fact, lower timings produce just a little lower practical latency, while by simply setting the memory frequency one 266-MHz increment higher we can easily outdo the effect from lowering the timings.
As a result, we can conclude that it definitely makes sense to use overclocker memory in LGA1155 systems. However, we should have our preferences set for higher frequency rather than lower timings. Anyway, now we are talking only about the results of synthetic benchmarks, which serve to estimate the system performance during work with the memory.
To estimate the average platform performance, PCMark Vantage measures the speed of actual popular algorithms users work with every day. And here we no longer see any dramatic performance differences between the systems featuring memory modules with different specifications. The memory frequency increase by one 266-MHz increment produces a barely noticeable 1-2% performance gain. And the performance difference between the system equipped with the fastest DDR3-2133 and the slowest DDR3-1067 memory is only 5% in nominal mode and 6% in overclocked mode.
According to a popular 3DMark 11 graphics test, the graphics sub-system performance doesn’t really depend on the memory speed at all.
However, besides the general graphics score, 3DMark 11 also generates another score, which is particularly interesting in our specific case – Physics rating. This number is produced by a specific physics test that emulates the work of a complex mechanical system with a large number of objects.
It turns out that the mathematical calculations performed within this test are pretty sensitive to the memory speed. And by simply increasing its frequency you can significantly boost the performance up to 15-20%. Note that the effect from the increase in the memory sub-system performance is most noticeable in an overclocked system. However, when our test Core i5-2500K works at its nominal frequency, most of the performance growth occurs in the interval between DDR3-1067 and DDR3-1600. Faster memory modules have less obvious effect on the performance in the physics test.
To test the processors performance during data archiving we take WinRAR archiving utility. Using maximum compression rate we archive a folder with multiple files 1.1 GB in total size.
Different applications react differently to the changes in the memory sub-system parameters. And although the average dependency between the performance and memory frequency or timings is usually not very prominent, other situations are also possible. Archiving is actually one of these situations: you can’t underestimate the importance of the memory sub-system performance here. It is remarkable that when our Core i5-2500K overclocked to 4.7 GHz works with the slow DDR3-1067 or DDR3-1333 SDRAM, it is slower than a non-overclocked processor working in tandem with faster DDR3-1866 or DDR3-2133 SDRAM. In fact, this is not surprising at all, because the 266 MHz increase in the memory frequency leads to about 5-10% acceleration in data compression speed. The memory timings have a much smaller effect: one increment either way causes about 2-3% change in the compression time.
We measured the performance in Adobe Photoshop using our own benchmark made from Retouch Artists Photoshop Speed Test that has been creatively modified. It includes typical editing of four 10-megapixel images from a digital photo camera.
The memory sub-system speed does affect the overall performance during image processing, but its influence is not so visible. Even if we compare the time it takes to complete the test using the slowest memory vs. the time it takes to complete the same test using the fastest memory, the results won’t exceed 3.5% for a non-overclocked system and 5.5% for an overclocked one.
In order to measure how fast our testing participants can transcode a video into H.264 format we used x264 HD benchmark. It works with an original MPEG-2 video recorded in 720p resolution with 4 Mbps bitrate. I have to say that the results of this test are of great practical value, because the x264 codec is also part of numerous popular transcoding utilities, such as HandBrake, MeGUI, VirtualDub, etc.
The results are almost the same as we have just seen in Photoshop. This process also doesn’t care much for the memory sub-system performance.
We use special Cinebench test to measure the final rendering speed in Maxon Cinema 4D.
Looks like low dependence of the system performance on the memory sub-system speed and timings settings is typical for
At the same time, there are some applications that use operating memory very actively, and therefore react immediately to any changes in its speed. These applications are 3D games.
As you know, it is the graphics subsystem that determines the performance of the entire platform equipped with pretty high-speed processors in the majority of contemporary games. Therefore, we do our best to make sure that the graphics card is not loaded too heavily during the test session: we select the most CPU-dependent tests and all tests are performed without antialiasing and in far not the highest screen resolutions. In other words, obtained results allow us to analyze not that much the fps rate that can be achieved in systems equipped with contemporary graphics accelerators, but rather how well contemporary processors can cope with gaming workload.
As we can see, gamers should really take memory speed into consideration. Of course, the situation is different in different games, but all in all one 266-MHz increment increase in the memory frequency produces about 2% gain in fps rate in nominal mode and about 3-4% in a system with an overclocked processor. Therefore, choosing the right memory for a gaming computer should be taken seriously enough. Slow DDR3 SDRAM modules may turn into a system bottleneck that will prevent the processor and graphics card from unveiling their true potential. Especially, since there are some games (in our case it is F1 2010), where you can gain a fps or two by simply playing around with memory timings. Not to mention a significant performance boost resulting from the increase in the memory frequency.
Another modification of the memory controller that happened during the launch of the new processors on Sandy Bridge microarchitecture deserves our most positive feedback. Intel engineers not only managed to fix the issues in the memory controller of the previous generation Westmere processors, but also to create a new controller, which turned out the highest performing of all existing modifications. Due to elimination of all major bottlenecks between the computational cores and the memory controller, Sandy Bridge proved to be more dependent on the specifications of the DDR3 SDRAM modules in the system than the predecessors or competitors.
However, it doesn’t change the situation in a larger scale. Every time when we discussed the effects of memory speed on the overall performance in certain configurations, we arrived at the conclusion that these effects were quite insignificant. This conclusion that we made back in the days for Socket AM3 and LGA1156 systems proved true one more time. It is also valid for Sandy Bridge based platforms and is backed up by the test results. The results show that the 266 MHz increase in the memory frequency produces only 2-4% growth in the average performance. And by setting all latencies one step lower we can only boost the performance by 1-2% at best.
However, all this doesn’t mean that you should disregard the need to make an educated decision on the best memory for your LGA1155 system. A slight practical effect from the use of faster memory is an average picture. At the same time, there are applications that work with large amounts of data and their performance depends much greater on the DDR3 SDRAM specifications. Among applications like that are, for example, some contemporary games, where you can gain a few extra frames per second by simply upgrading your memory.
This uncertainty together with pretty wide range of DDR3 SDRAM prices on modules with different specifications do not allow us to give specific recommendations regarding the best memory choices for Sandy Bridge platform. However, in general terms, you should keep in mind two things. Firstly, the memory frequency is of greater importance for the overall system performance than the memory timings. Secondly, the additional financial investments into faster memory may not pay back in the long run. In particular, high-speed DDR3-2133 and DDR3-1866 modules may cost 1.5-2 times more than the ordinary DDR3-1333 SDRAM.
Therefore, we believe that inexpensive DDR3-1600 SDRAM with not very aggressive timings would be the most reasonable choice for contemporary LGA1155 systems: in our opinion, memory like that offers the best price-to-performance ratio today.