home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: comp.parallel
- Path: sparky!uunet!gatech!hubcap!fpst
- From: kaminsky-david@CS.YALE.EDU (David Kaminsky)
- Subject: Re: Linda / The Parform
- Message-ID: <1992Dec21.205113.23801@hubcap.clemson.edu>
- Sender: fpst@hubcap.clemson.edu (Steve Stevenson)
- Organization: Yale University Computer Science Dept., New Haven, CT 06520-2158
- References: <1992Dec21.132725.23905@hubcap.clemson.edu>
- Date: 21 Dec 1992 15:42:31 -0500
- Approved: parallel@hubcap.clemson.edu
- Lines: 38
-
- In article <1992Dec21.132725.23905@hubcap.clemson.edu> Steven Ericsson Zenith <z
- enith@kai.com> writes:
- >
- >I find these numbers difficult to believe.
- > ...
- >Firstly, bells start ringing for the single processor case. Why?
- >Because they are all the same and to my knowledge these systems don't
- >all use the same compiler. I expect to see some variation.
- >No indication is given of what the 1 processor time means - is this the
- >sequential execution time under the respective system compiler? It
- >should be. The above numbers can only begin to make sense if the base
- >compiler *is* the same in all cases - otherwise we do not know what we
- >are comparing.
- >
-
- I think I can answer some of Steve's concerns.
-
- When evaluating parallel systems, honest speedup is measured
- against an optimized sequential code. I believe that is what they
- have done. (I worked with them in producing the Linda measurements.)
-
- >Since this problem is obviously more than embarrassingly parallel I'd
- >like to know how much, if any, of the interaction mechanism was used
- >during computation. If the answer is, as I suspect, that after the data
- >and work distribution, insignificant interaction took place then the
- >above tells us something about the parallel decomposition of the problem
- >but sweet Fanny Adams about any of the systems tested.
- >
-
- The problem examined was an iterative heat diffusion code.
- The matrix is decomposed into blocks. At each iteration, boundary
- data is exchanged between pairs processors holding adjacent blocks.
- The time measured was wallclock time. For more information on the
- algorithm used, I'd recommend reading the paper.
-
- David
-
-
-