Bookmark and Share


In order to demonstrate capabilities of its SpursEngine processor, Toshiba exhibited its Qosmio notebook with special FaceMation application installed at CEATEC exhibition in Japan. As expected, the new chip can easily perform geometry morphing and other operations given that it features four processing engines.

The FaceMation application by Toshiba can create a 3D model of a face and then perform various transformations with it as well as with hair. When face morphing and other operations rely on software, Toshiba Qosmio notebook powered by dual-core Intel Core 2 Duo processor was loaded by 80% while only rendering at 16 frames per second speed. When the same task was carried out by Toshiba SpursEngine, rendering speed rose to 30fps, whereas CPU load was 30%, according to a news-story at PC Watch web-site.

The SpursEngine can also unload decoding and encoding of MPEG-2 and H.264 from CPU, whereas synergetic processing engines (SPEs) can detect user’s motion commands, e.g. “stop playback”.

SpursEngine is a co-processor that integrates four of Cell high-performance RISC core SPEs, half the number of the full configuration, hardware dedicated to decoding and encoding of MPEG-2 and H.264 video, XDR memory interface as well as PCI Express interface. By combining the high level, real time processing software of the SPEs with the hardware video codecs, the SpursEngine realizes an optimized balance of processing flexibility and low power consumption. The prototype of SpursEngine operates at a clock frequency of 1.5GHz and consumes power at 10W to 20W.

Spurs Engine can be used in both consumer electronics and computer applications. For example, in the PC space it could process graphics or physics. Nevertheless, considering that four SPEs can offer tangible advantages over dual-core x86 chips, but would hardly rival contemporary graphics processors that feature 64 – 128 or even more processing engines, Toshiba’s new development will hardly find home in general-purpose PCs.


Comments currently: 4
Discussion started: 10/04/07 06:42:34 AM
Latest comment: 09/25/08 05:54:05 AM


Good way for Sony to get rid of all of those defective Cell Procs that don't make the PS3 grade. Sony is on to a winner with these Cell's, with this and the add on boards coming out, think we might see more companies developing this kinda proc, escpecially with the AMD Torrenza approach
0 0 [Posted by:  | Date: 10/05/07 12:43:00 AM]

There is a glaring and technically worrying misconception in the article at the end, and that is that the number of stream processors is all that matters (Spursengine/Cell vs GPUs). This is by far not the case, of way more importance is what the stream processors are capable of. And even though programmers complain about Cell being hard to program for, it is nothing compared to porting general purpose code to a GPU. Getting that to run fast is a real nightmare, and general purpose code is a lot different to running some shaders... Some algorithms are inherently unsuited for GPU processing, and that's even more than are already unsuited for Cell processing (though, thanks to the PS3 and Cell being used in supercomputers, there are breakthroughs where someone ports something to Cell that was said to be not possible before rather frequently!)

I'll give you three examples:

1) People at IBM tell me they can decode 2 1920x1080 H.264 streams in realtime on the PS3. I doubt Nvidia or ATI GPU can handle that, and we have no clue for how many tasks of the H.264 decoding pipeline they actually use the main CPU, because the task is inherently unsuited for GPU processing (CABAC aka Arithmetic Coding comes to mind!). There are realtime 1080p H.264 encoders for Cell (see: ). There's some software out that speeds up H.264 encoding called Badaboom that supposedly gets a 2x speedup on a 1440x1080 video. Ofcourse we have no idea what quality parameters both use, but let's just assume they both try to make their solution appear as fast as possible: A G80 would still only be about 1.5x as fast as a Cell, even though it has hundreds of stream processors more. And again it's questionable how much of the whole task in the GPU solution actually runs on the GPU...

2) Folding@home runs on PS3s Cell and Nvidia/ATI GPU. While GPUs are a lot faster at folding the jobs they're assigned by Stanford, the nature of these jobs also needs to be accounted for. On their FAQ page the F@h people say that only a very limited set of tasks can be run on the GPU client (very fast though!) and that only normal CPUs run the full breadth of simulations. And Cell is somewhere in between the two, meaning it lends itself for more tasks than a GPU, but still less than a CPU (assumably that's only for the tasks where double precision matters!). And it is also a lot faster than a CPU! ;-)

3) Let's look at a general purpose task like Raytracing. IBMs Realtime Cell raytracer vs the Saarland GPU one (that Intel also loves!), doing the Stanford bunny:
A PS3 with a somewhat limited Cell (6 SPUs of 8) is a little over 3x as fast as an Nvidia 8800 GTX in primary rays, but ofcourse in raytracing you atleast want secondary rays (what's the point of raytracing if you don't have reflections, right?), and here the PS3 is over 4x as fast as the 8800 GTX! And that is with the 8800 having many times the transistors, stream processors and theoretical GFLOPS of Cell!

Let's just face the facts: While modern GPUs *can* be used for other purposes than 3D gfx, it does not mean that they were *built* with anything else in mind! 3D FPS is what sells these things, not GFLOPS or protein folds per second!... Cell however was built to cover more breadth by having better programmability and more flexibility. I'm not sure, but I think it is still impossible to daisychain GPU stream processors to sequentially work on some data like you can on Cell (e.g. in Video decoding: IDCT -> Motion Estimation -> YUV scaling)..
0 0 [Posted by: DeeKay2  | Date: 09/25/08 05:54:05 AM]


Add your Comment

Related news

Latest News

Monday, July 28, 2014

6:02 pm | Microsoft’s Mobile Strategy Seem to Fail: Sales of Lumia and Surface Remain Low. Microsoft Still Cannot Make Windows a Popular Mobile Platform

12:11 pm | Intel Core i7-5960X “Haswell-E” De-Lidded: Twelve Cores and Alloy-Based Thermal Interface. Intel Core i7-5960X Uses “Haswell-EP” Die, Promises Good Overclocking Potential

Tuesday, July 22, 2014

10:40 pm | ARM Preps Second-Generation “Artemis” and “Maya” 64-Bit ARMv8-A Offerings. ARM Readies 64-Bit Cores for Non-Traditional Applications

7:38 pm | AMD Vows to Introduce 20nm Products Next Year. AMD’s 20nm APUs, GPUs and Embedded Chips to Arrive in 2015

4:08 am | Microsoft to Unify All Windows Operating Systems for Client PCs. One Windows OS will Power PCs, Tablets and Smartphones