News
 

Bookmark and Share

(0) 

Modern microprocessors contain up to sixteen high-performance processing cores that all need bandwidth to fetch enough data to be efficient. Despite that all leading-edge chips include techniques to hide long memory access latencies and maximize bandwidth, researchers from North Carolina State University claim that their technologies can improve bandwidth and boost performance of chips.

It is not a secret that each core within a multi-core central processing unit (CPU) needs to retrieve data from memory that is not stored on its chip. There is a limited pathway – or bandwidth – these cores can use to retrieve that off-chip data. As chips have incorporated more and more cores, the bandwidth has become increasingly congested – slowing down system performance.

One of the ways to expedite core performance is called prefetching. Each chip has its own small memory component, called a cache. In prefetching, the cache predicts what data a core will need in the future and retrieves that data from off-chip memory before the core needs it. Ideally, this improves the core’s performance. But, if the cache’s prediction is inaccurate, it unnecessarily clogs the bandwidth while retrieving the wrong data, which actually slows the chip’s overall performance.

The researchers from the NC State University propose two techniques: one improved efficiency of prefetching and another allocates bandwidth required to particular cores. Unfortunately, the research is highly theoretical and may not contain practical value.

“The first technique relies on criteria we developed to determine how much bandwidth should be allotted to each core on a chip. Some cores require more off-chip data than others. By better distributing the bandwidth to the appropriate cores, the criteria are able to maximize system performance,” said Dr. Yan Solihin, associate professor of electrical and computer engineering at NC State and co-author of a paper describing the research.

The researchers use easily-collected data from the hardware counters on each chip to determine which cores need more bandwidth.

“The second technique relies on a set of criteria we developed for determining when prefetching will boost performance and should be utilized as well as when prefetching would slow things down and should be avoided," said Mr. Solihin.

These criteria also use data from each chip’s hardware counters. The prefetching criteria would allow manufacturers to make multi-core chips that operate more efficiently, because each of the individual cores would automatically turn prefetching on or off as needed.

Utilizing both sets of criteria, the researchers were able to boost multi-core chip performance by 40%, compared to multi-core chips that do not prefetch data, and by 10% over multi-core chips that always prefetch data. Given the fact that all modern chips use prefetching (except, perhaps, many-core graphics chips), it means that the allocation techniques can boost performance by only 10% in certain cases.

Tags: Intel, AMD, IBM, Oracle, Fujitsu

Discussion

Comments currently: 0

Add your Comment




Related news

Latest News

Friday, May 24, 2013

6:09 pm | Second-Generation Kinect Sensor for Windows Due in 2014 – Microsoft. Microsoft Discloses Additional Details About Kinect 2

4:24 pm | New Technique May Open Up an Era of Atomic-Scale Semiconductor Devices. Atom-Scale Semiconductor Devices May Be Incoming, Thanks to New Researchers

Thursday, May 23, 2013

11:30 pm | Kinect Support Is Not Mandatory for Xbox One Video Games – Microsoft. Microsoft Will Not Require Compulsory Support of Kinect from Xbox One Games

11:20 pm | Thermaltake Publishes List of PSUs Compatible with Intel Cori i “Haswell” Chips. 20 PSUs from Thermaltake Are Compatible with Next-Gen Intel Chips

11:10 pm | European Amazon Stores Start to List Xbox One with €599 Price-Tag. Microsoft Xbox One May Cost €599 in Europe, If First Listings Are Correct

9:28 pm | Apple to Assemble Macs in Texas, Set to Manufacture Parts Across the U.S. Apple’s Plan to Move Production Back to U.S. Gets Shape

9:12 pm | Microsoft Confident in Lack of Quality Issues with Xbox One Hardware. Microsoft Vows Xbox One Will Not Have RROD-Like Issues

8:52 pm | AMD Officially Launches New-Generation APUs for Mobile Applications [UPDATED]. AMD Introduces Kabini, Temash and Richland Accelerated Processing Units

6:51 pm | OCZ Reveals Vertex 450 Solid-State Drives: High-End Performance at Mainstream Prices. OCZ Introduces New SSDs Based on Indilinx Barefoot 3 Controller

3:40 pm | Nvidia Unveils GeForce GTX 780: GK110-Based Consumer Solution for $649. Nvidia’s Cut Down Titan LE Becomes GeForce GTX 780