AMD Stream Processor First to Break 1 Teraflop Barrier

June 16, 2008
AMD Stream Processor First to Break 1 Teraflop Barrier

At the International Supercomputing Conference, AMD today introduced its next-generation stream processor, the AMD FireStream 9250, specifically designed to accelerate critical algorithms in high-performance computing (HPC), mainstream and consumer applications.

Leveraging the GPU design expertise of AMD’s Graphics Product Group, AMD FireStream 9250 breaks the one teraflop barrier for single precision performance. It occupies a single PCI slot, for unmatched density and with power consumption of less than 150 watts, the AMD FireStream 9250 delivers an unprecedented rate of performance per watt efficiency with up to eight gigaflops per watt.

Customers can leverage AMD’s latest FireStream offering to run critical workloads such as financial analysis or seismic processing dramatically faster than with CPU alone, helping them to address more complex problems and achieve faster results. For example, developers are reporting up to a 55x performance increase on financial analysis codes as compared to processing on the CPU alone, which supports their efforts to make better and faster decisions. Additionally, the use of flexible GPU technology rather than custom accelerators assists those creating application-specific systems to enhance and maintain their solutions easily.

The AMD FireStream 9250 stream processor includes a second-generation double-precision floating point hardware implementation delivering more than 200 gigaflops, building on the capabilities of the earlier AMD FireStream 9170, the industry’s first GP-GPU with double-precision floating point support. The AMD FireStream 9250’s compact size makes it ideal for small 1U servers as well as most desktop systems, workstations, and larger servers and it features 1GB of GDDR3 memory, enabling developers to handle large, complex problems.

AMD enables development of the FireStream family of processors with its AMD Stream SDK, designed to help developers create accelerated applications for AMD FireStream, ATI FireGL and ATI Radeon GPUs. AMD takes an open-systems approach to its stream computing development environment to ensure that developers can access and build on the tools at any level. AMD offers published interfaces for its high-level language API, intermediate language, and instruction set architecture; and the AMD Stream SDK’s Brook+ front-end is available as open source code.

In keeping with its open systems philosophy, AMD has also joined the Khronos Compute Working Group. This working group’s goals include developing industry standards for data parallel programming and working with proposed specifications like OpenCL. The OpenCL specification can help provide developers with an easy path to development across multiple platforms.

“An open industry standard programming specification will help drive broad-based support for stream computing technology in mainstream applications,” said Rick Bergman, senior vice president and general manager, Graphics Product Group, AMD. “We believe that OpenCL is a step in the right direction and we fully support this effort. AMD intends to ensure that the AMD Stream SDK rapidly evolves to comply with open industry standards as they emerge.”

The growth of the stream computing market has accelerated over the past few years with Fortune 1000 companies, leading software developers and academic institutions utilizing stream technology to achieve tremendous performance gains across a variety of applications.

“Stream computing is increasingly important for mainstream and consumer applications and is no longer limited to just the academic or engineering industries. Today we are truly seeing a fundamental shift in emerging system architectures,” said Jon Peddie, president, Jon Peddie Research. “As the industry’s only provider of both high-performance discrete GPUs and x86-compatible CPUs, AMD is uniquely well-suited to developing these architectures.”

AMD customers, including ACCIT, Centre de Physique de Particules de Marseille, Neurala and Telanetix are using the AMD Stream SDK and current AMD FireStream, ATI FireGL or ATI Radeon boards to achieve dramatic performance gains on critical algorithms in HPC, workstation and consumer applications. Currently, Neurala reports that it is achieving 10-200x speedups over the CPU alone on biologically inspired neural models, applicable to finance, image processing and other applications.

AMD is also working closely with world class application and solution providers to ensure customers can achieve optimum performance results. Stream computing application and solution providers include CAPS entreprise, Mercury Computer Systems, RapidMind, RogueWave and VizExperts. Mercury Computer Systems provides high-performance computing systems and software designed for complex image, sensor, and signal processing applications. Its algorithm team reports that it has achieved 174 GFLOPS performance for large 1D complex single-precision floating point FFTs on the AMD FireStream 9250.

AMD plans to deliver the FireStream 9250 and the supporting SDK in Q3 2008 at an MSRP of $999 USD. AMD FireStream 9170, the industry’s first double-precision floating point stream processor, is currently available for purchase and is competitively priced at $1,999 USD.

Source: AMD

4.6 /5 (36 votes)  

Filter


Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

Graeme
Jun 17, 2008

Rank: 4 / 5 (3)
Nvidia Tesla-10 Series processor has beaten AMD to the Teraflop speed. It has 240 cores.
Egnite
Jun 17, 2008

Rank: 3.5 / 5 (2)
"the industry's first double-precision floating point stream processor, is currently available for purchase and is competitively priced at $1,999 USD."

Would this be the Nvidia Tesla-10 then? At half the price, I know which I'll be buying for the next upgrade :-)
ryuuguu
Jun 17, 2008

Rank: 4.3 / 5 (3)
at 700w the Telse-10 produces a lot of heat. Also it does not just plug into a PCI slot.
fleem
Jun 17, 2008

Rank: 4.7 / 5 (3)
Stream computing has some advantages and some disadvantages of both pure parallel (multi-core with totally separate, but synchronous threads) processing and pure serial (single core--one thread) processing. So when comparing hardware platforms its a little hard to compare apples to apples. It depends on how many pipelines, cores, pipeline lag, and application requirements. Granted, most apps that need tons of CPU can be distributed.
donjoe0
Jun 20, 2008

Rank: 4.5 / 5 (2)
"It occupies a single PCI slot, for unmatched density"

Funny you should use a picture of a dual-slot card to illustrate that. :rolleyes:
Rank 4.6 /5 (36 votes)
Tags

Relevant PhysicsForums posts
  • Calling function with no input argument
    created6 hours ago
  • Force free body diagram problem on gym equipment
    created7 hours ago
  • Empirical data regarding shower heads and water
    created15 hours ago
  • feed hold button on CNC lathe
    createdFeb 09, 2012
  • RFAC in Fortran
    createdFeb 09, 2012
  • dynamics 2/32
    createdFeb 08, 2012
  • More from Physics Forums - General Engineering

More news stories

Japan scientist makes 'Avatar' robot

A Japanese-developed robot that mimics the movements of its human controller is bringing the Hollywood blockbuster "Avatar" one step closer to reality.

Electronics / Robotics

created 11 hours ago | popularity 5 / 5 (5) | comments 4

Google to make home entertainment system: report

Google will mirror Apple's winning hardware-software formula with an Android-powered entertainment system that wirelessly streams content through homes, the Wall Street Journal reported on Thursday.

Electronics / Consumer & Gadgets

created 23 hours ago | popularity 4.5 / 5 (2) | comments 0

Intel packs performance and reliability into its latest SSD 520 series

Intel Corporation announced today its fastest, most robust client/consumer solid-state drive (SSD) to date, the Intel Solid-State Drive 520 Series (Intel SSD 520), a 6 gigabit-per-second (gbps) SATA III SSD ...

Electronics / Hardware

created Feb 07, 2012 | popularity 5 / 5 (1) | comments 4

Google rumored to have built Heads-Up-Display glasses prototype

(PhysOrg.com) -- 9to5Google is reporting that they have received a tip from someone they believe to be a reliable source saying that Google is working on a Heads-Up-Display (HUD) pair of eye-glasses. The per ...

Electronics / Consumer & Gadgets

created Feb 08, 2012 | popularity 4.3 / 5 (8) | comments 2 | with audio podcast weblog

Apple to debut 'iPad 3' in March: report

Apple will unveil a new version of its market-ruling iPad table computer in March, according to a report in Dow Jones-owned technology blog All Things D.

Electronics / Consumer & Gadgets

created Feb 09, 2012 | popularity 2 / 5 (20) | comments 0


NASA sees wide-eyed cyclone Jasmine

Cyclone Jasmine's eye has opened wider on NASA satellite imagery, as it moves through the Southern Pacific Ocean.

NASA sees Giovanna reach cyclone strength, threaten Madagascar

Tropical Storm 12S built up steam and became a cyclone on February 10, 2012 as NASA's Terra satellite passed overhead. Residents of east-central Madagascar should prepare for this cyclone to make landfall ...

CIA website offline, Anonymous takes credit

The website of the Central Intelligence Agency was unresponsive on Friday after the hacker group Anonymous claimed to have knocked it offline.

Complex wiring of the nervous system may rely on a just a handful of genes and proteins

Researchers at the Salk Institute have discovered a startling feature of early brain development that helps to explain how complex neuron wiring patterns are programmed using just a handful of critical genes. ...

The power of estrogen -- male snakes attract other males

A new study has shown that boosting the estrogen levels of male garter snakes causes them to secrete the same pheromones that females use to attract suitors, and turned the males into just about the sexiest ...

New error-correcting codes guarantee the fastest possible rate of data transmission

Error-correcting codes are one of the triumphs of the digital age. They’re a way of encoding information so that it can be transmitted across a communication channel — such as an optical fiber o ...