Red Storm: physical connections

  • Thread starter Thread starter Felger Carbon
  • Start date Start date
F

Felger Carbon

Red Storm is not scheduled to be fully operative until Aug, and the
Super Bowl kickoff is some hours away. Let's kill some time here. :)

Robert, Yousuf and I have been discussing Red Storm's characteristics.
This is a problem since the details of the mesh connections have not
been revealed yet. I believe that Robert and Yousuf started as
programmers who picked up some hardware knowledge. I was a nuts &
bolts hardware engineer (now retired) who had to make things work. In
the latter part of my career, the things were computers or computer
subsystems. Along the way I picked up some knowledge of programming.

I'm gonna apply my hardware point of view to some things we actually
know about Red Storm.

10,368 CPUs connected in a 27 by 16 by 24 mesh (X, Y, Z).
Topologically, this is a cube. But, the map is not the territory, as
someone once said. Let's discuss the territory - the physical
implementation of this cubic mesh.

There are 4 CPUs per board.
There are 8 boards per card cage.
There are 3 card cages per cabinet.
There are 108 cabinets. If you've been keeping track, that's 10,368
CPUs.
The cabinet measures 2ft wide, 4ft long (in the direction of air
movement), and about 7 or 8 feet high.
The physical arrangement of the cabinets is 27 cabinets wide by 4
cabinets deep.

Question: what's the simplest wiring arrangement for the X Y Z mesh?

Since the cabinets are arranged 27 wide, it's obvious how the X mesh
(dimension 27) is wired up; there is one cabinet wide by 4 deep for
each value of X. So the wire for X = 3 (for example) must connect to
just 4 cabinets, but to every CPU in those cabinets.

Hmm. If one axis is limited to 4 cabinets per value, I got a hunch
another axis can be limited to 27 cabinets per value, and the final
axis has to connect to every cabinet for every value. Let's identify
the 27 cabinets per value:

Let's conceptually divide each card cage in half. Each half-cage
contains 16 CPUs, and there are 6 half-cages per cabinet. Each
half-cage can contain all the 16 possible values of Y for given values
of X,Z. Six half-cages per cabinet times 4 cabinets deep equals 24,
which is the mesh's Z dimension. So:

Each of 27 X values must connect to 4 cabinets.
Each of 16 Y values must connect to all 108 cabinets.
Each of 24 Z values must connect to 27 cabinets.

Within each cabinet, just one X value connects to all 96 CPUs.
Within each cabinet, a given Y value connects to one CPU in each of
six half-cages.
Within each cabinet, a given Z value connects to all 16 CPUs in a
given half-cage.

Only one X value per cabinet.
All 16 Y values per cabinet.
Six Z values per cabinet.

The above information is what it takes to convert the Red Storm map
(mesh) to the Red Storm territory (physical implementation).

The message you should get out of this is that if a given 3D mesh-type
MPU doesn't conform in some logical way to the 2-D physical cabinet
layout, there's big trouble.

It's easier to see this if you consider the physical layout to be 27
cabinets wide by 24 half-cages deep, with 16 CPUs per half-cage.
Voila, a simple 27 by 16 by 24 mesh. ;-)

<http://www.lanl.gov/orgs/ccn/salishan2003/pdf/camp.pdf>

The above link is for a 77-slide presentation by the customers, not by
Cray. ~3.6 meg download. It covers (justifies) the design choices
made.

I have a bone to pick with slide 59, the one that describes a "full
crossbar" as some folks' Holy Grail. I call a 10,368 by 10,368
crossbar a "bottomless money pit". When you fill the pit with money,
you have a really well-connected MPU. But the pit is bottomless...
 
Felger Carbon said:
Red Storm is not scheduled to be fully operative until Aug, and the
Super Bowl kickoff is some hours away. Let's kill some time here. :)

There's some rumours that IBM is going to be doing a new Linux commercial
for this one, featuring Mohamed Ali. And it's even rumoured that AMD will do
its first Super Bowl commercial this time too. I gotta buddy with a 65" HDTV
too, so I'm definitely not going to be on the newsgroups during that time.
:-)
Robert, Yousuf and I have been discussing Red Storm's characteristics.
This is a problem since the details of the mesh connections have not
been revealed yet. I believe that Robert and Yousuf started as
programmers who picked up some hardware knowledge. I was a nuts &
bolts hardware engineer (now retired) who had to make things work. In
the latter part of my career, the things were computers or computer
subsystems. Along the way I picked up some knowledge of programming.

Yes, I started out as a programmer, ages ago. I'd consider myself more of a
hardware guy these days.

Yousuf Khan
 
Red Storm is not scheduled to be fully operative until Aug, and the
Super Bowl kickoff is some hours away. Let's kill some time here. :)

We can safely talk about this right through the Super Bowl from my
point of view. People who try to call or IM me at that time may find
that I'm not available because it's an ideal time to run errands, the
streets being empty.
Robert, Yousuf and I have been discussing Red Storm's characteristics.
This is a problem since the details of the mesh connections have not
been revealed yet. I believe that Robert and Yousuf started as
programmers who picked up some hardware knowledge. I was a nuts &
bolts hardware engineer (now retired) who had to make things work. In
the latter part of my career, the things were computers or computer
subsystems. Along the way I picked up some knowledge of programming.

I'm gonna apply my hardware point of view to some things we actually
know about Red Storm.

10,368 CPUs connected in a 27 by 16 by 24 mesh (X, Y, Z).
Topologically, this is a cube. But, the map is not the territory, as
someone once said. Let's discuss the territory - the physical
implementation of this cubic mesh.

There are 4 CPUs per board.
There are 8 boards per card cage.
There are 3 card cages per cabinet.
There are 108 cabinets. If you've been keeping track, that's 10,368
CPUs.
The cabinet measures 2ft wide, 4ft long (in the direction of air
movement), and about 7 or 8 feet high.
The physical arrangement of the cabinets is 27 cabinets wide by 4
cabinets deep.

Question: what's the simplest wiring arrangement for the X Y Z mesh?

Since the cabinets are arranged 27 wide, it's obvious how the X mesh
(dimension 27) is wired up; there is one cabinet wide by 4 deep for
each value of X. So the wire for X = 3 (for example) must connect to
just 4 cabinets, but to every CPU in those cabinets.

There is not a "the" wire for X=3. There are 768 of them, and they
must go to appropriate CPU's in appropriate cabinets containing X=2
and X=4. Why are you making this hard?
Hmm. If one axis is limited to 4 cabinets per value, I got a hunch
another axis can be limited to 27 cabinets per value, and the final
axis has to connect to every cabinet for every value. Let's identify
the 27 cabinets per value:

Let's conceptually divide each card cage in half. Each half-cage
contains 16 CPUs, and there are 6 half-cages per cabinet. Each
half-cage can contain all the 16 possible values of Y for given values
of X,Z. Six half-cages per cabinet times 4 cabinets deep equals 24,
which is the mesh's Z dimension. So:

Each of 27 X values must connect to 4 cabinets.
Each of 16 Y values must connect to all 108 cabinets.
Each of 24 Z values must connect to 27 cabinets.

Within each cabinet, just one X value connects to all 96 CPUs.
Within each cabinet, a given Y value connects to one CPU in each of
six half-cages.
Within each cabinet, a given Z value connects to all 16 CPUs in a
given half-cage.

Only one X value per cabinet.
All 16 Y values per cabinet.
Six Z values per cabinet.

The above information is what it takes to convert the Red Storm map
(mesh) to the Red Storm territory (physical implementation).

The message you should get out of this is that if a given 3D mesh-type
MPU doesn't conform in some logical way to the 2-D physical cabinet
layout, there's big trouble.

They chose an optimally-bad aspect ratio for their cabinets, maximum
number of *required* external connections for volume of CPU's
contained. They designed it that way, they gotta run the wires. The
big trouble is of their own choosing.

The reason some of us software types hang around the hardware types is
because:

1. Software types are forever inventing new stuff to talk about.
Since you don't have actually to *build* anything to create a new
software something or other, web pages are forever springing up with
new something-or-others all the time and the conversations are
endless. If a software concept is important enough to penetrate an
enclave of hardware types, then it's probably worth paying attention
to. Very effective spam filter.

2. We don't want the hardware types pulling any fast ones on us, and
what I think you are suggesting I would put into the category of a
fast one. Now, the light bulbs that shine in the crania of some of
the people with the actual money (*not* anyone at a national
laboratory) are sometimes pretty dim, but not so dim that they would
let the fast one you are proposing get by them. You imply by diagram
even to a lifer in Washington that every CPU is separately connected
in x, y, and z to its nearest neighbor in all three directions, it had
better be damn well be separately connected to its nearest neighbor in
all three directions.
It's easier to see this if you consider the physical layout to be 27
cabinets wide by 24 half-cages deep, with 16 CPUs per half-cage.
Voila, a simple 27 by 16 by 24 mesh. ;-)

<http://www.lanl.gov/orgs/ccn/salishan2003/pdf/camp.pdf>

The above link is for a 77-slide presentation by the customers, not by
Cray. ~3.6 meg download. It covers (justifies) the design choices
made.

I have a bone to pick with slide 59, the one that describes a "full
crossbar" as some folks' Holy Grail. I call a 10,368 by 10,368
crossbar a "bottomless money pit". When you fill the pit with money,
you have a really well-connected MPU. But the pit is bottomless...

And no one attempts such a thing (at least not yet). If you need to
connect thousands of CPU's with current technology, you use a
hierarchy of switches. There's a trade between the fatness of the
cross-bar and the depth of the hierarchy.

Red Storm is a packet-switched network. Last time I checked, our
national telephone system was circuit-switched, which is why you can
still get an "all circuits are busy" message. That's an awful lot of
telephones that run through a cross-bar somewhere.

RM
 
Back
Top