Context Navigation

← Previous Change
Wiki History
Next Change →

Changes between Version 9 and Version 10 of ticket/0677_mpp_rep

Timestamp:: 2010-07-19T00:30:33+02:00 (14 years ago)
Author:: rblod
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

ticket/0677_mpp_rep

-                      v9
+                      v10
 '''ticket''' : #677
+'''Branch''' : [https://forge.ipsl.jussieu.fr/nemo/browser/branches/DEV_r1879_mpp_rep  DEV_r1879_mpp_rep ]
+'''Branch''' : [https://forge.ipsl.jussieu.fr/nemo/browser/branches/DEV_r1879_mpp_rep DEV_r1879_mpp_rep]
 ----
 === Description ===
 Implementation of both methods to get mpp reproducibility, one from ECMWF (key_mpp_rep1) and the other from DFO (key_mpp_rep2). The target is to choose one, thanks to my reviewer's advices, but athis time (7th of June), I made an intensive use of cpp keys to delimit clearly the both methods.
 Both are based on the Idea of self compensated summation, see the paper "Using Accurate Arithmetics to Improve Numerical Reproducibility and Stability in parallel applications, Yun He and Chris Ding, Journal of supercomputing, Vol 18, Number 3, pages 259-277, doi 10.1023/A1008153532043.
+Both (or at least rep2, rep1 as far as I understand)) are based on the Idea of self compensated summation, see the paper "Using Accurate Arithmetics to Improve Numerical Reproducibility and Stability in parallel applications, Yun He and Chris Ding, Journal of supercomputing, Vol 18, Number 3, pages 259-277, doi 10.1023/A1008153532043.
 We have:
+We have(,Knuth's trick(The Art of Computer Programming’,  Vol 2, p. 203),
+sum = a+b
+Let u and v be the two sp-numbers.
+error = b + (a-sum)
+ Compute u’=(u+v)-v, v’=(u+v)-u  and v”=(u+v)-v’
+In the next addition, the error is first added back :
+Under very general conditions (concerning the reliability of rounding  procedures) the following theorem holds:
 (sum,error) = SCS(a,b)
+Double_prec_sum(u,v) = (u + v)  +  ( (u-u’) + (v-v”) )
+(sum1,error1) = SCS(sum,c+error)
+                                  |                                  |
+                  most significant                      least significant
+                             part of result       part
+where ‘+’ and ‘-’ mean the usual single-precision addition and subtraction. So we keep track of the truncation error and add it.
 These methods have been implemented in a new module lib_fortran.F90 with a few additions in lib_mpp.F90. In the sake of simplicity, I implemented a glob_sum function which is either a standard one( SUM + CALL mpp_sum), either one of the otw methods and the switch is done in lib_fortran.
 …
 Performance: tested on IBM Pwer6 with ORCA025 :
+|| \    ||               STD            ||                   REP1          ||                  REP2             ||
+||186||695.845 , 543.695 ||  690.451 , 560.091  ||  714.916  ,  566.557 ||
+||216||709.906 , 564.650 ||  729.994 , 583.716  ||  710.971  ,  568.351 ||
+||\||STD||REP1||REP2||
+||186||695.845 , 543.695||690.451 , 560.091||714.916  ,  566.557||
+||216||709.906 , 564.650||729.994 , 583.716||710.971  ,  568.351||
 average  Elapsed Time (s),CPU Time (s)
 === Testing ===

New URL for NEMO forge! http://forge.nemo-ocean.eu

Context Navigation

Changes between Version 9 and Version 10 of ticket/0677_mpp_rep

Legend:

ticket/0677_mpp_rep