backtop


Print 50 comment(s) - last by surt.. on Mar 15 at 2:37 PM

Intel says parallel software is more important for many-core CPUs like "Larrabee"

Multi-core processors have been in the consumer market for several years now. However, despite having access to CPUs with two, three, four, and more cores, there are still relatively few applications available that can take advantage of multiple cores. Intel is hoping to change that and is urging developers of software to think parallel.

Intel director and chief evangelist for software development products talked about thinking parallel in a keynote speech he delivered at the SD West conference recently. James Reinders said, "One of the phrases I've used in some talks is, it's time for us as software developers to really figure out how to think parallel." He also says that the developer who doesn’t think parallel will see their career options limited.

Reinders gave the attendees eight rules for thinking parallel from a paper he published in 2007 reports ComputerWorld. The eight rules include -- Think parallel; program using abstraction; program tasks, not threads; design with the option of turning off concurrency; avoid locks when possible; use tools and libraries designed to help with concurrency; use scalable memory; and design to scale through increased workloads.

He says that after half a decade of shipping multi-core CPUs, Intel is still struggling with how to use the available cores. The chipmaker is under increasing pressure from NVIDIA who is leveraging a network of developers to program parallel applications to run on its family of GPUs. NVIDIA and Intel are embroiled in a battle to determine if the GPU or CPU will be the heart of future computer systems.

Programming for processors with 16 or 32 cores takes a different approach according to Reinders. He said, "It's very important to make sure, if at all possible, that your program can run in a single thread with concurrency off. You shouldn't design your program so it has to have parallelism. It makes it much more difficult to debug."

Reinders talked about the Intel Parallel Studio tool kit in the speech, a tool kit for developing parallel applications in C/C++, which is currently in its beta release. Reinders added, "The idea here [with] this project was to add parallelism support to [Microsoft's] Visual Studio in a big way."

Intel says that it plans to offer the parallel development kit to Linux programmers this year or early next year. The CPU Reinders is talking about when he says many-core is the Larrabee processor. Intel provided some details on Larrabee in August of 2008.

One of the key features of Larrabee is that it will be the heart of a line of discrete graphics cards, a market Intel has not participated in. Larrabee is said to contain ten of more cores inside the discrete package. If Larrabee comes to be in the form Intel talked about last year it will be competing directly against NVIDIA and ATI in the discrete graphics market.

NVIDIA is also rumored to be eyeing an entry into the x86 market as well. Larrabee will be programmable in the C/C++ languages, just as NVIDIA's GPUs are via the firms CUDA architecture.



Comments     Threshold


This article is over a month old, voting and posting comments is disabled

RE: What am I missing here?
By Scrogneugneu on 3/15/2009 11:52:52 AM , Rating: 2
Do you know anything about programming?

Multiple threads can read from the same data all the same than a single thread. Concurrency problems only happen when 2 threads want to write to the same memory emplacement, reading can be infinite. The state of everything in the game can be read, but no change will happen to it until the next frame render, so each and every thread can read the same data at the same time.

This isn't mentioning that I talked about a 4 threads engine, and you picked up and went with a 20 threads engine. If you want to compute 20 characters' actions, splitting it in 4 threads requires each thread to compute 5 characters sequentially. One thread per character is very, very wasteful.

Plus, the advantage I pointed out was that you could manage more AI resources in the same time. You can go from there and add a lot of complexity to the handling of the AI, thus ending up with a much more intelligent character. Suppose we do, and the computation time goes up to 5ms per character. By taking your own numbers (supposing the data reuse you speak of saves us 1.5ms per character), we end up with 5 + (19x3.5) = 66.5ms sequentially. Using your 20 threads example, that would be 3 characters x 5ms = 15ms.

Threading isn't meant to gain tremendous speed on everything. It's meant to handle large workloads better. Nobody will implement threading on simple tasks, but the capacity to lower the additional cost per character on AI computation is huge. The same logic goes for physics.


"And boy have we patented it!" -- Steve Jobs, Macworld 2007

Related Articles
Intel Talks Details on Larrabee
August 4, 2008, 12:46 PM













botimage
Copyright 2014 DailyTech LLC. - RSS Feed | Advertise | About Us | Ethics | FAQ | Terms, Conditions & Privacy Information | Kristopher Kubicki