Futuristic 48-Core Intel Chip Could Reshape How Computers are Built (w/ Video)

December 3, 2009
Futuristic Intel Chip

Enlarge

Single Chip Cloud Computer has 48 Intel cores and runs at as low as 25 watts

(PhysOrg.com) -- Researchers from Intel Labs demonstrated an experimental, 48-core Intel processor, or "single-chip cloud computer," that rethinks many of the approaches used in today's designs for laptops, PCs and servers.

Researchers from Labs demonstrated an experimental, 48-core Intel , or "single-chip cloud computer," that rethinks many of the approaches used in today's designs for laptops, PCs and servers. This futuristic chip boasts about 10 to 20 times the processing engines inside today's most popular Intel Core-branded processors.

The long-term research goal is to add incredible scaling features to future computers that spur entirely new software applications and human-machine interfaces. The company plans to engage industry and academia next year by sharing 100 or more of these experimental chips for hands-on research in developing new software applications and programming models.

This video is not supported by your browser at this time.

While Intel will integrate key features in a new line of Core-branded chips early next year and introduce six- and eight-core processors later in 2010, this prototype contains 48 fully programmable Intel processing cores, the most ever on a single silicon chip. It also includes a high-speed on-chip network for sharing information along with newly invented power management techniques that allow all 48 cores to operate extremely energy efficiently at as little as 25 watts, or at 125 watts when running at maximum performance (about as much as today's Intel processors and just two standard household light bulbs).

Intel plans to gain a better understanding of how to schedule and coordinate the many cores of this experimental chip for its future mainstream chips. For example, future laptops with processing capability of this magnitude could have "vision" in the same way a human can see objects and motion as it happens and with high accuracy.

Imagine, for example, someday interacting with a computer for a virtual dance lesson or on-line shopping that uses a future laptop's 3-D camera and display to show you a "mirror" of yourself wearing the clothes you are interested in. Twirl and turn and watch how the fabric drapes and how the color complements your skin tone.

This kind of interaction could eliminate the need of keyboards, remote controls or joysticks for gaming. Some researchers believe computers may even be able to read brain waves, so simply thinking about a command, such as dictating words, would happen without speaking.

Intel Labs has nicknamed this test chip a "single-chip cloud computer" because it resembles the organization of datacenters used to create a "cloud" of computing resources over the Internet, a notion of delivering such services as online banking, social networking and online stores to millions of users.

Cloud datacenters are comprised of tens to thousands of computers connected by a physically cabled network, distributing large tasks and massive datasets in parallel. Intel's new experimental research chip uses a similar approach, yet all the computers and networks are integrated on a single piece of Intel 45nm, high-k metal-gate silicon about the size of a postage stamp, dramatically reducing the amount of physical computers needed to create a cloud datacenter.

"With a chip like this, you could imagine a cloud datacenter of the future which will be an order of magnitude more energy efficient than what exists today, saving significant resources on space and power costs," said Justin Rattner, head of Intel Labs and Intel's Chief Technology Officer. "Over time, I expect these advanced concepts to find their way into mainstream devices, just as advanced automotive technology such as electronic engine control, air bags and anti-lock braking eventually found their way into all cars."

Futuristic 48-Core Intel Chip Could Reshape How Computers are Built
Enlarge


Cores Allow Software to Intelligently Direct Data for Efficiency

The concept chip features a high-speed network between cores to efficiently share information and data. This technique gives significant improvement in communication performance and energy efficiency over today's datacenter model, since data packets only have to move millimeters on chip instead of tens of meters to another computer system.

Application software can use this network to quickly pass information directly between cooperating cores in a matter of a few microseconds, reducing the need to access data in slower

off-chip system memory. Applications can also dynamically manage exactly which cores are to be used for a given task at a given time, matching the performance and energy needs to the demands of each.

Related tasks can be executed on nearby cores, even passing results directly from one to the next as in an assembly line to maximize overall performance. In addition, this software control is extended with the ability to manage voltage and clock speed. Cores can be turned on and off or change their performance levels, continuously adapting to use the minimum energy needed at a given moment.

Overcoming Software Challenges

Programming processors with multiple cores is a well-known challenge for the industry as computer and software makers move toward many-cores on a single silicon chip. The prototype allows popular and efficient parallel programming approaches used in cloud datacenter software to be applied on the chip. Researchers from Intel, HP and Yahoo's Open Cirrus collaboration have already begun porting cloud applications to this 48 IA core chip using Hadoop, a Java software framework supporting data-intensive, distributed applications as demonstrated by Rattner today.

Intel plans to build 100 or more experimental chips for use by dozens of industrial and academic research collaborators around the world with the goal of developing new software applications and programming models for future many-core processors.

"Microsoft is partnering with Intel to explore new hardware and software architectures supporting next-generation client plus cloud applications," said Dan Reed, Microsoft's corporate vice president of Extreme Computing. "Our early research with the single cloud computer prototype has already identified many opportunities in intelligent resource management, system software design, programming models and tools, and future application scenarios."

This milestone represents the latest achievement from Intel's Tera-scale Computing Research Program, aimed at breaking barriers to scaling future chips to 10s-100s of cores. It was co-created by Intel Labs at its Bangalore (India), Braunschweig (Germany) and Hillsboro, Ore. (U.S.) research centers. Details on the chip's architecture and circuits are scheduled to be published in a paper at the International Solid State Circuits Conference in February.

Provided by Intel (news : web)

4.4 /5 (32 votes)  

Filter


Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

SincerelyTwo
Dec 03, 2009

Rank: 4.3 / 5 (6)
That CPU might do the trick, I was hoping that eventually programmers can develop software without actually needing to be concerned about threading. At the most we could gives some 'direction' for the CPU through meta-instructions in special situations to do explicit threading, but in general threading should be implicit and managed by the CPU which would be able to determine how to best deal with certain types of information.

A mix of implicit/explicit threading through meta-instructions compiled in to the application, that would be such a fantastic approach and alleviate a lot of burden from programmers.

And when I talk about threading I mean both across all cores and on any single core, the CPU should decide when to use extra or share cores unless given explicit instructions. That way software will always run utilizing far more of the CPU's potential.
jselin
Dec 03, 2009

Rank: 2 / 5 (2)
What SincerelyTwo said ^^^

I'm amazed its left to the programmers to make use of the technology the way it is now. Software developers have to make the decision that such additional programming work is value added. This results in pricer software if you want to take advantage of multithreading :(

If it doesn't take full advantage of the cores the way tailored programming might then there should be a way for the programmer to choose to override.
PinkElephant
Dec 03, 2009

Rank: 5 / 5 (1)
SincerelyTwo/jselin, what you propose is impossible. Whether a program can be efficiently threaded depends entirely on the nature of the program's workflow and algorithms. Some tasks are just inherently serial (i.e. each step vitally depends on the output of a preceding step), and cannot be parallelized at all; other tasks are "embarassingly parallel", and can be easily farmed out to a set of worker threads. But many real-life tasks fall into serial, rather than parallel category. Thus, as a rule, most programs will not gain performance on massively parallel systems. Still, such systems would typically enable you to simultaneously run many non-threaded programs with no loss of performance. Of course, some useful tasks are parallelizable: e.g. tile-based rendering, image processing, breadth-first searches (like in chess games, or similar AI). So many-core systems aren't totally useless; but such tasks must still be explicitly implemented with a mind toward parallelism.
Zenmaster
Dec 04, 2009

Rank: 2.5 / 5 (2)
"Application software can use this network to quickly pass information directly between cooperating cores in a matter of a few microseconds, reducing the need to access data in slower off-chip system memory."

Can't be right. Since when does even off-chip memory access take a few microseconds...
Feldagast
Dec 04, 2009

Rank: not rated yet
Where is my talking computer?? Why read brain waves when our voices would do as much. Talking Star Trek and the enterprises main computer. Not the one from the 60's either.
Buyck
Dec 04, 2009

Rank: not rated yet
These 48 cores are realized on a 45nm chip (not for consumers). What if in 2015 the 22nm or even the 15nm chips are for sale to every consumer?! We will likely see a significant increase of many cores on commercial available chips in the coming years. I think by 2020-2025 a thousand cores or maybe even more.
plasticpower
Dec 04, 2009

Rank: not rated yet
]
Can't be right. Since when does even off-chip memory access take a few microseconds...


You can access registers and different levels of cache a magnitude faster than RAM (main memory) and thousands of times faster than non-volatile (dear god) memory such as hard drives. When you're passing information between machines, you're running it over layers and layers of devices. From the registers in the CPU through main memory via some sort of a bus to a device that's sitting on the bus and is probably sharing said bus with other similar devices, which then has to send information over some kind of networked connection and then the process is repeated in reverse on the receiving end.

Instead what Intel is doing is passing information between registers of multiple cores, the same thing that makes RISC machines so great, you aren't making all these accesses to main memory and beyond. This article doesn't really explain just how hard it would be to implement something like this.
Temple
Dec 04, 2009

Rank: not rated yet
Apple has already released its first step toward making it easy and intuitive for developers to make use of multiple cores/processors.

Whatever your particular OS religion, it's worth checking out the open-source (open-ish) Grand Central Dispatch.

There's some info here (follow the link in the article for an exhaustive review): http://arstechnic...inux.ars
SincerelyTwo
Dec 04, 2009

Rank: not rated yet
PinkElephant,

I realize that, which is why I dedicated a significant part of my statements to describing implicit *and explicit multi-threading approach. For example, it could be possible for multiple applications to be working on the 'current' core, in which case the CPU will decide to offload the work to another core without the programmer actually have to command the CPU to do such a thing.

I'm a programmer with regular experience in this, it's obvious to me that there are ways the systems can be architected to automate threading based on various conditions not directly tied to any application itself, but the size of the job and current consumption of a given core.

It's more about the distribution of work than it is segmenting of any given algorithm. Not 'automated parallelism', automated distribution* of work.

Explicit threading for parallelism.

Implicit threading for redistribution of work across multiple cores for general processing.
Nik_2213
Dec 04, 2009

Rank: not rated yet
http://en.wikiped...ansputer

IIRC, the Occam language written to work with the Transputer enabled automatic task distribution into parallel processes, pipelines etc. One benefit, IIRC, was that different cores ran asynchronously, so you could add more / faster hardware as easy as USB...
simulus
Dec 04, 2009

Rank: not rated yet
Increasing number of cores becomes meaningless at some point (so don't expect billion-core chips for example). More of something being equivalent to better is a meaningless addiction.

Between one and one billion cores, what is the optimum and do we have to design software around it? How long would that take?
vantomic
Dec 04, 2009

Rank: 5 / 5 (1)
"This kind of interaction could eliminate the need of keyboards, remote controls or joysticks for gaming. Some researchers believe computers may even be able to read brain waves, so simply thinking about a command, such as dictating words, would happen without speaking."

I have a real doubt about how this could work. If I used this setup to type these two simple sentences it would come out as...

"I have a real doubt as to about no wait I have a real doubt as to how this could work man my grammar sucks where is my beer ok I don't think this could work what no i didn't take the damn garbage out what this stupid software ARAGGSDGHGH stop"

"
PinkElephant
Dec 04, 2009

Rank: not rated yet
PinkElephant,
...
It's more about the distribution of work than it is segmenting of any given algorithm. Not 'automated parallelism', automated distribution* of work.


That's already being done by every single modern OS (beginning with the first flavors of UNIX.) The word you're looking for, is "scheduling". Of course, some OSes are better at scheduling and fair resource allocations, than others. My experience with Windows, for example, leads me to conclude that Microsoft can't be bothered to implement a decent scheduler. Very frequently, the entire computer hangs up (even on dual-core systems!) when a single task somehow manages to "hog" the CPU. One such routine culprit is (of all things) Outlook....
tkjtkj
Dec 05, 2009

Rank: not rated yet
25watts?? so, how come the video shows the test-bed to include a massive copper multi-pipe heatsink???
EvgenijM
Dec 05, 2009

Rank: not rated yet
They are a little bit late, nvidia fermi is going out soon. Besides, nvidia gpgpu already works in software for popular OS, unlike this architechtuure.
Velanarris
Dec 06, 2009

Rank: not rated yet
Cores don't need to use a bus to talk to main memory since Nehalem came out.

This isn't about memory registers it's about more processing power with fixed clock speeds. More cores means more calculative ability meaning less time even talking to memory. This is about streamlining the process through over pwoering the application. It will be interesting to see if this is the correct methodology going forward.
psommerfeld
Dec 07, 2009

Rank: not rated yet
25watts?? so, how come the video shows the test-bed to include a massive copper multi-pipe heatsink???


I'm wondering if they mean 25 - 125W PER CORE?! Not so power-efficient if this is the case! If these are full-blown cores than they I would expect them to be at least 25W/core (or 1000 W per package!)
Velanarris
Dec 07, 2009

Rank: not rated yet
25W over average die resistance figures is a lot of heat in a space that small.

PinkElephant
Dec 08, 2009

Rank: 5 / 5 (1)
I'm guessing they mean 125W for the entire 48-core chip. That would mean about 2.6 W per core when active (I'm also guessing 25W figure is when the chip is idling.) 2.6W is within a normal range of operation for small chips such as used in netbooks. Intel's own Atom line fits within such a power envelope, so perhaps those 48 cores are each derived from the Atom design... See here for more info about Atom:

http://en.wikiped...itecture
Rank 4.4 /5 (32 votes)
Related Stories
Relevant PhysicsForums posts
  • Calling function with no input argument
    created10 hours ago
  • Force free body diagram problem on gym equipment
    created11 hours ago
  • Empirical data regarding shower heads and water
    created18 hours ago
  • feed hold button on CNC lathe
    createdFeb 09, 2012
  • RFAC in Fortran
    createdFeb 09, 2012
  • dynamics 2/32
    createdFeb 08, 2012
  • More from Physics Forums - General Engineering

More news stories

Japan scientist makes 'Avatar' robot

A Japanese-developed robot that mimics the movements of its human controller is bringing the Hollywood blockbuster "Avatar" one step closer to reality.

Electronics / Robotics

created 15 hours ago | popularity 5 / 5 (5) | comments 5

Intel packs performance and reliability into its latest SSD 520 series

Intel Corporation announced today its fastest, most robust client/consumer solid-state drive (SSD) to date, the Intel Solid-State Drive 520 Series (Intel SSD 520), a 6 gigabit-per-second (gbps) SATA III SSD ...

Electronics / Hardware

created Feb 07, 2012 | popularity 5 / 5 (1) | comments 4

Google rumored to have built Heads-Up-Display glasses prototype

(PhysOrg.com) -- 9to5Google is reporting that they have received a tip from someone they believe to be a reliable source saying that Google is working on a Heads-Up-Display (HUD) pair of eye-glasses. The per ...

Electronics / Consumer & Gadgets

created Feb 08, 2012 | popularity 4.3 / 5 (8) | comments 2 | with audio podcast weblog

Apple to debut 'iPad 3' in March: report

Apple will unveil a new version of its market-ruling iPad table computer in March, according to a report in Dow Jones-owned technology blog All Things D.

Electronics / Consumer & Gadgets

created Feb 09, 2012 | popularity 1.9 / 5 (21) | comments 0

New Kindle Touch is an impressive e-reader

When it comes to reading digital books, tablets are all the rage. But there's a lot to like about simple e-readers, which over the past year have become both a lot cheaper and a lot less clunky.

Electronics / Consumer & Gadgets

created Feb 09, 2012 | popularity 5 / 5 (3) | comments 1


Google users warned of threat to smartphone wallets

Users of Google smartphone wallets were being warned on Friday that there is a way to crack pass codes intended to thwart thieves from going on illicit shopping sprees.

Anonymous knocks CIA website offline (Update)

The website of the Central Intelligence Agency was inaccessible on Friday after the hacker group Anonymous claimed to have knocked it offline.

Complex wiring of the nervous system may rely on a just a handful of genes and proteins

Researchers at the Salk Institute have discovered a startling feature of early brain development that helps to explain how complex neuron wiring patterns are programmed using just a handful of critical genes. ...

New error-correcting codes guarantee the fastest possible rate of data transmission

Error-correcting codes are one of the triumphs of the digital age. They’re a way of encoding information so that it can be transmitted across a communication channel — such as an optical fiber o ...

Putting the squeeze on planets outside our solar system

(PhysOrg.com) -- Using high-powered lasers, scientists at Lawrence Livermore National Laboratory and collaborators discovered that molten magnesium silicate undergoes a phase change in the liquid state, abruptly ...

The power of estrogen -- male snakes attract other males

A new study has shown that boosting the estrogen levels of male garter snakes causes them to secrete the same pheromones that females use to attract suitors, and turned the males into just about the sexiest ...