Feeds

Oak Ridge goes gaga for Nvidia GPUs

Fermi chases Cell for HPC dough

High performance access to file storage

Oak Ridge National Laboratories may not be the first customer that Nvidia will have for its new "Fermi" graphics processor, which was announced yesterday, but it will very likely be one of the largest customers.

Oak Ridge, one of the giant supercomputing centers managed and funded by the US Department of Energy to do all kinds of simulations and supercomputing design research, has committed to using the GPU co-processor variants of the Fermi chips, the kickers to the current Tesla GPU co-processors, in a future hybrid system that would have ten times the floating point oomph of the fastest supercomputer installed today.

Depending on the tests you want to use, the most powerful HPC box in the world is either the Roadrunner hybrid Opteron-Cell massively parallel custom blade box made by IBM for Los Alamos National Laboratory, or the Jaguar massively parallel XT5 machine at Oak Ridge, which uses only the Opterons to do calculations.

The Roadrunner machine relies on the Cell chips, which are themselves a kind of graphics processor with a single Power core linked into it, to do the heavy lifting on floating point calculations. The compute nodes in the Roadrunner are comprised of a two-socket blade server using dual-core Opteron processors running at 1.8GHz.

Advanced Micro Devices has six-core Istanbul Opterons in the field that are pressing up against the 3GHz performance barrier. But shifting to these faster x64 chips would not radically improve the overall performance of the Roadrunner machine.</p

Going faster miles an hour

Each Opteron blade uses HyperTransport links out to the PCI-Express bus to link to two dual-socket Cell blades. Each Cell processor is running at 3.2GHz, and has eight vector processors (which are used to do the graphics in the Sony PlayStation 3, among other tasks the Cell chips were created to do). The Cell chips also include one 64-bit Power core to manage these on-chip vector processors, which deliver 12.8 gigaflops of double-precision performance each.

Each Opteron core gets its own Cell chip to do its math for it, like the blonde who isn't dating the nerd but the nerd thinks is, and the beauty is that the x64 applications using the message passing interface (MPI) protocol to run parallelized applications runs with minor modifications on the hybrid Opteron-Cell box.

And each server node (one x64 node with two Cell co-processor nodes) can deliver 409.6 gigaflops of double-precision floating-point math. On the Linpack Fortran benchmark test, the Roadrunner with 129,600 cores is able to deliver 1.1 petaflops of sustained performance.

While the Jaguar XT5 machine at Oak Ridge is powerful, weighing in at 1.06 petaflops, it has to rely on its 150,152 cores to do the math. What Jaguar needs is some powerful nerds so its blondes can run code, and it looks like the next generation of machines at the supercomputer center are going to be using the Fermi GPUs.

High performance access to file storage

Next page: Hybrid futures

More from The Register

next story
Bono bests Bezos in Fortune's 'World's 50 Greatest Leaders' list
That Apple CEO? #33. The US president and UK prime minister? M.I.A.
Google slashes cloud storage to $0.026 per GB. Your move, Amazon
Wipes third off compute, two thirds off storage, Big Query costs plummet
WTF: Twitter bug temporarily kills THAT Oscar selfie
Loadsa tweets caught in Senseless Uncoupling drama
Microsoft: Let's be clear, WE won't read your email – but the cops will
Redmond rewrites T&Cs AGAIN – and taps up privacy warriors for help
Why it's time to wrap brains around software-defined networking
Gartner says under 500 orgs use SDN in production, but it's condensing in the cloud
Nvidia, VMware join to pipe high-quality 3D graphics from the cloud
NaviSite to be first desktop-as-a-service provider running VMware virty infrastructure on Nvidia GPUs
Big Yellow loses its head... again: Symantec, we need to talk
Enormo security firm needs to get serious about acquisitions
prev story

Whitepapers

HP Global 2000 mobile risk report
Download this report to see the alarming realities regarding the sheer number of applications vulnerable to attack, as well as the most common and easily addressable vulnerability errors.
3 Big data security analytics techniques
Applying these Big Data security analytics techniques can help you make your business safer by detecting attacks early, before significant damage is done.
High performance access to file storage
In this whitepaper learn about the new approach Avere is pioneering for the problems of providing high performance access to a common file storage environment.
The benefits of software based PBX
Why you should break free from your proprietary PBX and how to leverage your existing server hardware.
SANS - Survey on application security programs
In this whitepaper learn about the state of application security programs and practices of 488 surveyed respondents, and discover how mature and effective these programs are.