The Top500 list - an analytical and critical view

Munich 31 July 2001 Since 1973 the top team Jack Dongarra, Hans Meuer and Erich Strohmaier publish the top 500 list twice a year, the odd number in June at Meuer's Supercomputer Conference, now in Heidelberg located, and the even lists at the IEEE SC conference. In the beginning it was quite interesting to analyse the list. But during the years it developed a life of its own, driven by the "marketers" of the HPC vendors. This view highlights two sides, the political and the analytical aspects. It is a personal view and summarises some discussions and a talk given by me at the Hewlett-Packard HPC users group meeting in Barcelona last year.

The history

Professor Hans Werner Meuer started in June 1986 his - at that time - German based Supercomputer Conference in the halls of University in Mannheim - by the way earlier than the IEEE Supercomputer. He presented the 15 German Supercomputer Centres with their machines. They fitted without problems on one slide. During the years this list grew steadily. Then the question raised up, what is a supercomputer and what characterises it? At that time Jack Dongarra's Linpack benchmark was well known and accepted. Every computer vendor used it and published the result in Jack's report. It started with a matrix size of the linear equation of 100x100. But soon Jack extended the dimension to 1000x1000. Rumours say that the idea of the Top500 grew, when Hans Meuer and Jack Dongarra drank beer on one of these conferences. The computers should be measured following a maximal possible linear equation size and then the first 500 ranked, based on the Linpack performance. In June 1993 they published in Mannheim the first Top500 list.

The first and the 17th list

This year Hans Werner Meuer presented the 17th list in Heidelberg and supplied the reader with an important contribution to the history of supercomputing since 1993. In the first June list we find the Thinking Machines CM5 with 1024 processors and a Linpack performance, named Rmax in the table, of 59.7 GFlop/s (billions of floating point operations per second) and a peak performance (Rpeak) of 131 GFlop/s. In the fall list the numerical wind tunnel from Fujitsu with 140 (vector) processors, an Rmax of 124.5 GFlop/s and an Rpeak of 236 GFlop/s. This computer demonstrates the optimisation possibilities of the Linpack Program. In 1993 Rmax of the Fujitsu achieved 53% of the peak performance. In 1996 the computer has been extended to 167 processors and gained 281 GFlop/s Rpeak and 229 GFlop/s Rmax, 81% efficiency. Number 500 in June 1993 was a Fujitsu Siemens computer VP-200, in 1985 at IABG in Ottobrunn and 1990 outsourced to debis Systemhaus with 0.422 GFlop/s Rmax und .533 GFlop/s Rpeak. At the time of its installation it was the first Japanese vector supercomputer in Europe and one of the fastest with 64 MB main memory. By the way, the author took care for the industrial users of this machine for seven years, advisory service, vectorisation, optimisation, education, marketing and sales.

Today IBM's ASCI White is ahead with 7. 226 TeraFlop/s Rmax - a factor of 121 faster than the CM5 from 1993. It uses 8193 Power3 Processors with 375 MHz and has an Rpeak of 12 TeraFlop/s, 60% efficiency. Last November the same computer only achieved 4.9 TeraFlop/s, only 41% efficiency. IBM had done a good job in optimising the code.

The Top500 Table

The Top500 table in HTML-format contains the rank, the vendor, the computer, the Linpack performance (Rmax), the installation site - mostly with web addresses, the country, the year of installation or upgrade, the number of processors, the peak performance (Rpeak) - this is the performance vendors guarantee not to exceed, NMAX und N1/2. The last two figures are for experts or specialists but have a high entropy. NMAX is the dimension of the solved linear equation and N1/2 is the dimension of the system, at which the computer achieves half of the peak performance. Both figures are optional. As an example, I look at the Hitachi SR8000 at Leibnizrechenzentrums of the Bavarian Academy of Sciences in München. With a matrix size NMAX of 120 000 the computer gets 1035 GFlop/s (Rmax), with a matrix size of only 15160 (N1/2) he achieves 672 GFlop/s. Thus one can conclude that this machine has a good performance with "minor" problems. Last November IBM's ASCI White did not got half of the peak performance, with a dimension of 430 000 it reached "only" 4938 GFlop/s. But this year, as mentioned earlier, the IBM team optimised the Linpack. Now, with a dimension of 518 000 it gets 7226 GFlop/s and with a dimension of 179 000 (N1/2) should get 6144 GFlop/s.

Technically the list is very interesting and contains a lot of information. One example is the efficiency, the relationship of Rmax and Rpeak in the Linpack benchmark. Looking for the first systems one gets this list:
Computer Rmax*100/Rpeak (%)
IBM ASCI White 59%
Intel ASCI Red 74%
Hitachi SR8000 82%
SGI Blue Mountain 52%
NEC SX-5 93%
Cray T3E 1200 49%
Fujitsu VPP5000 92%
In the November 2000 list, IBM ASCI White gained only 40% - not reaching N1/2. This demonstrates that the optimal implementation, the compiler power and intelligence is very important. As the vendors know this and they run the benchmark themselves, the figures are realistic and comparable.

An other important aspect is the scaling. Does the computers in the list scale with higher processor numbers. The Rmax results show whether the increasing processor number results in a similar increase in the performance.

Top500 and the science politics

The Top500 list is extremely easy to understand, as the computers are ranked following the Rmax value. Only one number is important, the rank! This pleases the scientific boards or the Father of the Country and the science politics. This happened for example, when Germany got rank 5 in the world list with the Leibniz Computer Center of the Bavarian Academy of Sciences in Munich one year ago. The Wissenschaftsrat in Germany, Scientific Council, regularly makes use of this list in his recommendations and papers. Thus he mentioned in his recommendations from 1995 that in Germany the top systems are missing, as Germany had no computer in the first fifty ranks. On the other hand, politicians are happy to mention that their system is highly ranked.

Top500 and the vendors

The computer vendors send their Linpack results to the Triumvirat, Dongarra, Meuer, Strohmaier. In the meantime - within these eight years - they recognised the huge marketing potential. All computers are presented, which get the performance and which are allowed by their customers to be mentioned. There will never run a Linpack on most of the commercial computers. An example are the computers in the banking scene, where they run risk analyses or SAP R/3 problems. An other field is the Telecommunication industry. Thus the Top500 list developed its own way of life, never expected and thought by the authors.

On the other hand, the integration of commercial systems in this list underpins the importance of those computers for scientific high-performance computing. The clusters and parallel computers are now used in both arenas.

The Top500 list over time

An important information delivers the time. Eight years are a long time, where one can see the different developments in computer architecture and processor structure. In the beginning life was so easy, most of the systems have been vector supercomputers, 340 out of 500. About 100 have to be counted to MPP systems like CM5 or paragon, nearly 40 belong to the SIMD architecture, today gone like the dinosaurs. This year there are only 47 vector computers and 453 MPPs. The 500 computers divide into 314 MPPs, 113 Systems based on constellation nodes, 40 SMPs and 33 clusters.

Other topics of this historic view are the vendors - a lot of them have gone, they are no longer with us. Others, not so active in the supercomputer arena - like IBM - have become the leader in the Top500 list. Interesting is the look to the applications, the move from engineering to database and other commercial usage.

The Top500 Web Site

The website have been redesigned some time ago and now allows sublists, the user can specify. The access to old lists and analyses are possible. The new site delivers a lot of information and the interested reader can sit hours at the computer and view all the possible information.

Linpack Benchmark

The program solves a dense linear equation. For the Top 500 the vendors can scale the problem to the maximum memory size and optimise the software to gain the best performance. The method must be identical in the number of floating point operations as the LU factorisation: number of floating point operations: = 2/3 N exp 3 + O (N exp 2). This excludes the usage of fast matrix multipliers like the Strassen algorithm. As the vendors like top Rmax, some used the Strassen algorithm, which needs less floating point operations. Then they divided the results by the number of LU operations. This lead to the curious situation that the supercomputer exceeded in the Linpack the peak performance. When I received the list some years ago before publishing, I noted this fact. The vendor then revised the results that then lay below Rpeak.

http://www.top500.org


Uwe Harms

[News on Advanced IT][Calendar][Analysis][IT in Medicine]