Information

X-bit Labs for mobile users! Do not forget that we are running a special version of X-bit Labs web-site for users of mobile and handheld devices: http://pda.xbitlabs.com. Check out our news and articles from smartphones and PDAs to be always updated on the latest computer and technology news.

 

Articles: Video

Highly Defined: ATI Radeon HD 2000 Architecture Review (page 4)


Category: Video

by Anton Shilov , Yaroslav Lyssenko, Alexey Stepin

[ 05/15/2007 | 07:34 AM ]


Pages : 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22

When 64 Equals 320: Unified Shader Processor of ATI Radeon HD 2000

ATI’s new shader processor is more complex than those in Radeon X1000 or Nvidia GeForce 8000.

Each pixel processor of the R500 series contained 2 scalar and 2 vector ALUs and a branch execution unit. Thus, it was capable of executing up to 4 instructions per clock cycle plus 1 branch instruction. The new shader processor of the R600 chip incorporates 5 scalar ALUs capable of executing one floating-point MAD (Multiply-ADD) instruction per cycle, and one ALU can also execute transcendental instructions like SIN, COS, LOG, EXP, etc. The sixth unit in the R600 shader processor is a branch execution unit responsible for executing flow control instructions (comparisons, loops, subroutine calls). First introduced in the R520, this unit worked with the dispatch processor to accelerate the processing of shaders with dynamic branching.

Besides that, each subunit is equipped with a dedicated array of general-purpose registers. Theoretically, each ALU has to have access to another shader processor’s registers, but it is not certain how things stand in reality. The integration of general-purpose registers into the shader processor helps make GPUs more scalable because a reduction/increase in the number of shader processors automatically reduces/increases the number of registers.

Interesting to note, ATI/AMD prefers to specify the total number of execution units rather than mention the 64 shader processors with 5 ALUs in each. This approach is no worse than others, but you should realize that it’s not quite correct to compare the number of ALUs in the GeForce 8800 and Radeon HD 2000.

We know that each of the Nvidia G80’s 128 shader processors (whose architecture still remains a mystery, by the way) can execute two scalar MAD+MUL instructions per clock cycle. Each of the AMD R600’s processors can perform up to 5 instructions (including one complex one) plus a flow control instruction. Considering the difference in the frequencies of the execution units between the R600 and the G80, we can expect them to deliver similar overall performance. The developers’ data confirm this: Nvidia estimates the computing power of its GeForce 8800 GTX at approx. 520 gigaflops whereas AMD estimates its Radeon HD 2900 XT at 475 gigaflops.

<<< Previous page Next page >>>

Discussion

Comments currently: 21
Discussion started: 05/15/07
View comments

Add your Comment

Name/Nickname
Your Comments
 

Category News

Category: Video

Thursday, July 17, 2008

5:48 am Microsoft Preps to Unveil DirectX 11 Features in Several Days. ATI, Nvidia, Microsoft to Discuss DirectX 11 Techniques at XNA, Siggraph

Wednesday, July 16, 2008

12:30 pm New Generation ATI Radeon for Mainstream, Mobile Markets are Ready. PCI-SIG Approves ATI RV730, M98-L, M96 Graphics Chips

7:22 am EVGA and XFX Reimburse Price Difference on GeForce GTX 200 after Price Collapse. EVGA and XFX to Return Money to GeForce GTX 200 Purchasers

Tuesday, July 15, 2008

4:23 pm Startup Promises to Revolutionize Multi-GPU Technology Early Next Year. LucidLogix Unveils Hydra Distributed Processing Engine

Friday, July 11, 2008

10:26 pm AMD Plans to Launch Two Dual-Chip ATI Radeon HD 4800 Graphics Cards. ATI Touts 8-Way ATI CrossFireX Multi-GPU Technology

 
News Archive
All Latest News