backtop


Print E-mail del.icio.us 135 comment(s) - last by phxfreddy.. on Feb 27 at 3:58 PM


A Blue Gene P rack, which sports 1096 nodes, 4096 processors, and 2 TB memory.  (Source: IBM)

With a high degree of space efficiency, reduced power consumption, and quicker deployment, IBM looks to use its Blue Gene systems, such as the Blue Gene/P setup pictured here, to take the cluster-dominated web-hosting market by SMP storm.  (Source: IBM)
Dreaming big; IBM looks to host entire internet on a single modified Blue Gene supercomputer

Thomas Watson, founder of IBM, is oft misquoted as stating that the world really only would need five computers.  Ironically the frequently used, erroneous quotation may come to true by the very hands of the business Watson created.

IBM launched an Epic project with an almost unfathomable goal -- to develop a single supercomputer capable of running the entire internet as a web application.  The project, codenamed Kittyhawk (detailed in a white paper by IBM) created quite the stir in internet technology community.

While the software details descend quickly into the realm of the cerebral, one number that jumps off the page is the estimate for the number of cores and memory for the finished proposed system -- 67.1 million cores with 32PB of memory. 

The system is based on IBM's Blue Gene/P architecture, which takes millions of cores and arranges them in a hierarchal architecture.  At the lowest level four 850 MHz Power PC cores run on a single chip, with built in memory controllers and interconnects. The next level up is the card, which contains 32 of these quad core chips known as "nodes."  Up a level, 16 cards compose a midplane.  A server rack has two midplanes, yielding a total of 1024 nodes, or 4096 processors.  Each server rack has 2TB of memory to play with.  A maximum of 16,384 racks can be networked to yield the finally staggering metrics.  As each rack has an I/O bandwidth of 640Gb/s, a "full" 67.1m core system would sport 10.4Pb/s of bandwidth.

The design is certainly not unproven technology -- IBM's Blue Gene architectures own 4 of the top 10 spots of the list of fastest supercomputers on the planet, including the top spot, which is occupied by IBM's Blue Gene/L.  IBM's Blue Gene/L architecture is the successor to its P architecture.  The Kittyhawk project, initially designed with the Blue Gene/P architecture, will likely make the eventual switch to the more powerful and efficient "L" architecture.

IBM argues that there are many advantages to using large SMP (symmetric multi-processing) systems for internet hosting.  Such systems beat clusters in power efficiency and space requirements.  However, clusters beat SMPs in terms of pricing and availability, due to the ability to utilize generic commercial hardware for much of the system.  This had led companies such as Sun Microsystems, Amazon.com, Google and Microsoft to adopt cluster hosting centers, which provide the companies with the ability to flexibly increase their capacity as demand mandates.

While it acknowledges the benefits of clusters, IBM feels that by using its proven, scalable Blue Gene architecture it can provide box solutions for web software jobs which beat cluster solutions at their own game.   IBM says that by choosing Blue Gene, web customers would be able to obtain large savings due to the more efficient infrastructure.  

The company also points out that a fast-growing web company often has need for bandwidth outpaced by demand.  Their web hosts can only add so much capacity at a time, due to the long validation and construction process needed to expand clusters.  IBM says its Blue Gene system is validated per rack, so can be quickly deployed for increased capacity. Additionally, it points out that clusters have relatively high failure rates, leading to loss in service or need for backup systems, while its racks seldom fail.

IBM's current implementation is in a dead heat with current clusters, according to the company.  It can currently run most standard web applications, including Linux, Apache, MySQL and Ruby on Rails.  Its conclusions are based on the SPECjbb2005 Java performance benchmark, which yielded an impressive 9565 Business Operations per second (BOPS) for the current implementation, and LAMP (Linux Apache MySQL Perl/Python) software benchmarks, which yielded similar encouraging results.  IBM's researchers feel that this is only the start and that an optimized version will blow away clusters.  They stated in the paper, "We hypothesize that for a large class of web-scale workloads the Blue Gene/P platform is an order of magnitude more efficient to purchase and operate than the commodity clusters in use today."

The current implementation makes use of a Linux microkernel, network-based management, software appliances and a quasi-stateless approach to provide high performance.  Parallel processing optimizations both in software and hardware, sophisticated error checking, and built in trackable administrative tools are among the many weapons in the system's arsenal.

One criticism of the system is that it uses IBM's proprietary PowerPC architecture.  This leaves system implementers with the choice of either programming in Linux for PowerPC, an unpopular choice, or being forced to used to use relatively clumsy ports of Linux/x86 code at the cost of performance.  Despite the landmark idea, IBM still finds itself a bit weighed down in its refusal to accept the worldwide dominance of the x86 architecture.

While these issues and the hiccups of a brand new approach still need to be ironed out, IBM stands to gain some significant customers in the web hosting world with its attractive setup, even if it can't accomplish its Herculean goal of the Kittyhawk.



Comments     Threshold


This article is over a month old, voting and posting comments is disabled

It shall be known as...
By DigitalFreak on 2/8/2008 1:53:04 PM , Rating: 5
Skynet




RE: It shall be known as...
By Shark Tek on 2/8/2008 2:06:28 PM , Rating: 2
Dang you got it first I was thinking the same thing.

The end is near !!!!


RE: It shall be known as...
By Shoal07 on 2/8/2008 2:15:42 PM , Rating: 5
I, for one, welcome our new synthetic overlords.


RE: It shall be known as...
By SectionEight on 2/8/2008 2:27:05 PM , Rating: 2
more like Helios.


RE: It shall be known as...
By deeznuts on 2/8/2008 2:47:50 PM , Rating: 5
Damn Scientologists!


RE: It shall be known as...
By nbachman on 2/8/2008 6:17:41 PM , Rating: 2
I hate those guys.


RE: It shall be known as...
By Kaleid on 2/8/2008 5:34:14 PM , Rating: 2
Instantly I thought about Deus Ex too.


RE: It shall be known as...
By 306maxi on 2/8/2008 4:17:40 PM , Rating: 5
As we all know from Terminator 3. Skynet is actually software.


RE: It shall be known as...
By Griswold on 2/8/2008 6:36:01 PM , Rating: 5
Yea software that runs on all computers connected by the internet - thats why it was impossible to stop it. Now, what do you think happens when one machine becomes the internet? It would only become easier for the software to take over control - and likewise easier to pull the plug.

Maybe this isnt so bad after all? :p


RE: It shall be known as...
By ImSpartacus on 2/8/2008 10:28:21 PM , Rating: 2
A program that attempts to run itself on all possible computers? We call that a virus...


RE: It shall be known as...
By Griswold on 2/9/2008 10:09:25 AM , Rating: 2
Yea and so did the folks at NORAD in the movie and thats where it went downhill. It wasnt a virus, it was more than that.


RE: It shall be known as...
By daftrok on 2/9/2008 12:38:14 PM , Rating: 2
It was SATAN! The internet is the devil! TURN OFF YOUR WIRELESS! RIP OUT THE ETHERNET CORDS!


RE: It shall be known as...
By Shawn5961 on 2/9/2008 1:20:14 PM , Rating: 3
Repent mechanical demon! Repent!


RE: It shall be known as...
By Xodus Maximus on 2/9/2008 2:25:06 PM , Rating: 5
Yeah the military version of Folding@HOME gone terribly wrong...


RE: It shall be known as...
By Clauzii on 2/10/2008 4:53:47 PM , Rating: 4
Cookie for You on that one!


RE: It shall be known as...
By burnttoy on 2/11/2008 7:04:08 AM , Rating: 2
Don't worry...

I've got enough C4 to blow up TEN supercomputers.


RE: It shall be known as...
By TSS on 2/8/2008 6:32:06 PM , Rating: 3
more like deep thought.

"i can't give you the ultimate porn unless i know the ultimate hyperlink"
"what is the ultimate hyperlink?"
"i'll build another computer for that, i'll just enjoy the porn."


RE: It shall be known as...
By BruceLeet on 2/9/2008 11:36:28 AM , Rating: 2
Well Skynet was the Internets..Software.

And having the Internets in one supercomputer would actually be a good thing, I know we're just makin cracks here. But seriously, its not Myles Dyson, its Barack Obama, hmm...just clicked in my head how much those two look alike lol.


RE: It shall be known as...
By del on 2/10/2008 11:34:05 PM , Rating: 2
lol... dayum you guys are funny :D

But seriously, folks: single point of failure is bad, just like that other guy said. I think IBM just wants to get back on top of their game (and possibly rule the universe). This thing would make crysis look like an old DOS game. Maybe that would be the best use of this computer... or not... I'm not sure crysis is optimized for parallel processing. It certainly won't spawn 67.1 mega-threads.

The only thing I can add to these comments, since everything else has been said, is that one doesn't need a nuclear missile to take out the Internet. My old computer teacher told me how it can be done with software. ;)


RE: It shall be known as...
By 1078feba on 2/11/2008 11:24:11 AM , Rating: 4
quote:
But seriously, folks: single point of failure is bad...


Yeah, and the last thing it says before we shut it down will be:

"Will I dream?"