NetNews Usenet Archive 1992 #31

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1992 #31 / NN_1992_31.iso / spool / comp / parallel / 2790 < prev next >

Wrap

Text File | 1992-12-22 | 2.1 KB | 51 lines

Newsgroups: comp.parallel Path: sparky!uunet!gatech!hubcap!fpst From: kaminsky-david@CS.YALE.EDU (David Kaminsky) Subject: Re: Linda / The Parform Message-ID: <1992Dec21.205113.23801@hubcap.clemson.edu> Sender: fpst@hubcap.clemson.edu (Steve Stevenson) Organization: Yale University Computer Science Dept., New Haven, CT 06520-2158 References: <1992Dec21.132725.23905@hubcap.clemson.edu> Date: 21 Dec 1992 15:42:31 -0500 Approved: parallel@hubcap.clemson.edu Lines: 38 In article <1992Dec21.132725.23905@hubcap.clemson.edu> Steven Ericsson Zenith <z enith@kai.com> writes: > >I find these numbers difficult to believe. > ... >Firstly, bells start ringing for the single processor case. Why? >Because they are all the same and to my knowledge these systems don't >all use the same compiler. I expect to see some variation. >No indication is given of what the 1 processor time means - is this the >sequential execution time under the respective system compiler? It >should be. The above numbers can only begin to make sense if the base >compiler *is* the same in all cases - otherwise we do not know what we >are comparing. > I think I can answer some of Steve's concerns. When evaluating parallel systems, honest speedup is measured against an optimized sequential code. I believe that is what they have done. (I worked with them in producing the Linda measurements.) >Since this problem is obviously more than embarrassingly parallel I'd >like to know how much, if any, of the interaction mechanism was used >during computation. If the answer is, as I suspect, that after the data >and work distribution, insignificant interaction took place then the >above tells us something about the parallel decomposition of the problem >but sweet Fanny Adams about any of the systems tested. > The problem examined was an iterative heat diffusion code. The matrix is decomposed into blocks. At each iteration, boundary data is exchanged between pairs processors holding adjacent blocks. The time measured was wallclock time. For more information on the algorithm used, I'd recommend reading the paper. David