New URL for NEMO forge!   http://forge.nemo-ocean.eu

Since March 2022 along with NEMO 4.2 release, the code development moved to a self-hosted GitLab.
This present forge is now archived and remained online for history.
Chap_MISC.tex in trunk/DOC/TexFiles/Chapters – NEMO

source: trunk/DOC/TexFiles/Chapters/Chap_MISC.tex @ 999

Last change on this file since 999 was 998, checked in by gm, 16 years ago

trunk - DOC - update .tex due to the change in position of NEMO_book

  • Property svn:executable set to *
File size: 38.9 KB
Line 
1% ================================================================
2% Chapter Ñ Miscellaneous Topics
3% ================================================================
4\chapter{Miscellaneous Topics (xxx)}
5\label{MISC}
6\minitoc
7
8% ================================================================
9% Representation of Unresolved Straits
10% ================================================================
11\section{Representation of Unresolved Straits}
12\label{MISC_strait}
13
14% In climate modeling, it is often necessary to allow water masses that are separated by land to exchange properties. This situation arises in models when the grid mesh is too coarse to resolve narrow passageways that in reality provide crucial connections between water masses. For example, coarse grid spacing typically closes off the Mediterranean from the Atlantic at the Straits of Gibraltar. In this case, it is important for climate models to include the effects of salty water entering the Atlantic from the Mediterranean. Likewise, it is important for the Mediterranean to replenish its supply of water from the Atlantic to balance the net evaporation occurring over the Mediterranean region.
15% Same problem occurs as soon as important straits are not properly resolved by the mesh. For example it arises in ORCA 1/4\r{} in several straits of the Indonesian archipelago (Ombai, Lombok...).
16% We describe here the three methods that can be used in \NEMO to handle such improperly resolved straits.
17%The problem is resolved here by allowing the mixing of tracers and mass/volume between non-adjacent water columns at nominated regions within the model. Momentum is not mixed. The scheme conserves total tracer content, and total volume (the latter in $z*$- or $s*$-coordinate), and maintains compatibility between the tracer and mass/volume budgets. 
18
19% Note: such changes are so specific to a given configuration that no attempt has been made to set them in a generic way. Nevertheless, an example of how they can be set up is given in the ORCA2 configuration (\key{ORCA_R2}).
20
21% -------------------------------------------------------------------------------------------------------------
22%       Hand made geometry changes
23% -------------------------------------------------------------------------------------------------------------
24\subsection{Hand made geometry changes}
25\label{MISC_strait_hand}
26
27$\bullet$ reduced scale factor, also called partially open face (Hallberg, personnal communication 2006)
28%also called partially open cell  or partial closed cells
29
30$\bullet$ increase of the viscous boundary layer thickness by local increase of the fmask value at the coast
31
32\colorbox{yellow}{Add a short description of scale factor changes staff and fmask increase}
33
34
35%>>>>>>>>>>>>>>>>>>>>>>>>>>>>
36\begin{figure}[!tbp] \label{Fig_MISC_strait_hand}  \begin{center}
37\includegraphics[width=0.80\textwidth]{./TexFiles/Figures/Fig_Gibraltar.pdf}
38\includegraphics[width=0.80\textwidth]{./TexFiles/Figures/Fig_Gibraltar2.pdf}
39\caption {Example of the Gibraltar strait defined in a 1\r{} x 1\r{} mesh.
40\textit{Top}: using partially open cells. The meridional scale factor at $v$-point
41is reduced on both sides of the strait to account for the real width of the strait
42(about 20 km). Note that the scale factors of the strait $T$-point remains unchanged. \textit{Bottom}: using viscous boundary layers. The four fmask parameters
43along the strait coastlines are set to a value larger than 4, $i.e.$ "strong" no-slip
44case (see Fig.\ref{Fig_LBC_shlat}) creating a large viscous boundary layer
45that allows a reduced transport through the strait.}
46\end{center}   \end{figure}
47%>>>>>>>>>>>>>>>>>>>>>>>>>>>>
48
49% -------------------------------------------------------------------------------------------------------------
50% Cross Land Advection
51% -------------------------------------------------------------------------------------------------------------
52\subsection{Cross Land Advection (\textit{tracla} module)}
53\label{MISC_strait_cla}
54
55%--------------------------------------------namcro--------------------------------------------------------
56\namdisplay{namcro} 
57%--------------------------------------------------------------------------------------------------------------
58
59\colorbox{yellow}{Add a short description of CLA staff here or in lateral boundary condition chapter?}
60
61% ================================================================
62% Closed seas
63% ================================================================
64\section{Closed seas}
65\label{MISC_closea}
66
67
68% ================================================================
69% Sub-Domain Functionality (\textit{nizoom, njzoom}, namelist parameters)
70% ================================================================
71\section{Sub-Domain Functionality (\jp{jpizoom}, \jp{jpjzoom})}
72\label{MISC_zoom}
73
74The sub-domain functionality, also improperly called the zoom option
75(improperly because it is not associated with a change in model resolution)
76is a quite simple function that allows a simulation over a sub-domain of an
77already defined configuration ($i.e.$ without defining a new mesh, initial
78state and forcings). This option can be useful for testing the user settings
79of surface boundary conditions, or the initial ocean state of a huge ocean
80model configuration while having a small computer memory requirement.
81It can also be used to easily test specific physics in a sub-domain (for example,
82see \citep{Madec1996} for a test of the coupling used in the global ocean
83version of OPA between sea-ice and ocean model over the Arctic or Antarctic
84ocean, using a sub-domain). In the standard model, this option does not
85include any specific treatment for the ocean boundaries of the sub-domain:
86they are considered as artificial vertical walls. Nevertheless, it is quite easy
87to add a restoring term toward a climatology in the vicinity of such boundaries
88(see \S\ref{TRA_dmp}).
89
90In order to easily define a sub-domain over which the computation can be
91performed, the dimension of all input arrays (ocean mesh, bathymetry,
92forcing, initial state, ...) are defined as \jp{jpidta}, \jp{jpjdta} and \jp{jpkdta} 
93(\mdl{par\_oce} module), while the computational domain is defined through
94\jp{jpiglo}, \jp{jpjglo} and \jp{jpk} (\mdl{par\_oce} module). When running the
95model over the whole domain, the user sets \jp{jpiglo}=\jp{jpidta} \jp{jpjglo}=\jp{jpjdta} 
96and \jp{jpk}=\jp{jpkdta}. When running the model over a sub-domain, the user
97has to provide the size of the sub-domain, (\jp{jpiglo}, \jp{jpjglo}, \jp{jpkglo}),
98and the indices of the south western corner as \jp{jpizoom} and \jp{jpjzoom} in
99the \mdl{par\_oce} module (Fig.~\ref{Fig_LBC_zoom}).
100
101Note that a third set of dimensions exist, \jp{jpi}, \jp{jpj} and \jp{jpk} which is
102actually used to perform the computation. It is set by default to \jp{jpi}=\jp{jpjglo} 
103and \jp{jpj}=\jp{jpjglo}, except for massively parallel computing where the
104computational domain is laid out on local processor memories following a 2D
105horizontal splitting. % (see {\S}IV.2-c) ref to the section to be updated
106
107%>>>>>>>>>>>>>>>>>>>>>>>>>>>>
108\begin{figure}[!ht] \label{Fig_LBC_zoom}  \begin{center}
109\includegraphics[width=0.90\textwidth]{./TexFiles/Figures/Fig_LBC_zoom.pdf}
110\caption {Position of a model domain compared to the data input domain when the zoom functionality is used.}
111\end{center}   \end{figure}
112%>>>>>>>>>>>>>>>>>>>>>>>>>>>>
113
114
115% ================================================================
116% 1D model functionality
117% ================================================================
118\section{Water column model: 1D model (\key{cfg\_1d})}
119\label{MISC_1D}
120
121The 1D model option simulates a stand alone water column within the 3D \NEMO 
122system. It can be applied to the ocean alone or to the ocean-ice system and can
123include passive tracers or a biogeochemical model. It is set up by defining the
124\key{cfg\_1d} CPP key. This 1D model is a very useful tool  \textit{(a)} to learn
125about the physics and numerical treatment of vertical mixing processes ; \textit{(b)} 
126to investigate suitable parameterizations of unresolved turbulence (wind steering,
127langmuir circulation, skin layers) ; \textit{(c)} to compare the behaviour of different
128vertical mixing schemes  ; \textit{(d)} to perform sensitivity studies on the vertical
129diffusion at a particular point of the ocean global domain ; \textit{(d)} to produce
130extra diagnostics, without the large memory requirement of the full 3D model.
131
132The methodology is based on the use of the zoom functionality (see
133\S\ref{MISC_zoom}), with some extra routines. There is no need to define a new
134mesh, bathymetry, initial state or forcing, since the 1D model will use those of the
135configuration it is a zoom of.
136
137% ================================================================
138% Accelerating the Convergence
139% ================================================================
140\section{Accelerating the Convergence (\np{nn\_acc} = 1)}
141\label{MISC_acc}
142%--------------------------------------------namdom-------------------------------------------------------
143\namdisplay{namdom} 
144%--------------------------------------------------------------------------------------------------------------
145
146% Steven update just bellow...   not sure it  is what I vant to say
147%Searching an equilibrium state with an ocean model requires repeated very long time
148%integrations (each of a few thousand years for a global model). Due to the size of
149Searching an equilibrium state with an ocean model requires very long time
150integration (a few thousand years for a global model). Due to the size of
151the time step required for numerical stability (less than a
152few hours), this usually requires a large elapsed time. In order to overcome
153this problem, \citet{Bryan1984} introduces a technique that is intended to accelerate
154the spin up to equilibrium. It uses a larger time step in
155the thermodynamic evolution equations than in the dynamic evolution
156equations. It does not affect the equilibrium solution but modifies the
157trajectory to reach it.
158
159The acceleration of convergence option is used when \np{nn\_acc}=1. In that case,
160$\Delta t=rdt$ is the time step of dynamics while $\widetilde{\Delta t}=rdttra$ is the
161tracer time-step. Their values are set from the \np{rdt} and \np{rdttra}  namelist
162parameters. The set of prognostic equations to solve becomes:
163\begin{equation} \label{Eq_acc}
164\begin{split}
165\frac{\partial \textbf{U}_h }{\partial t} 
166   &\equiv \frac{\textbf{U}_h ^{t+1}-\textbf{U}_h^{t-1} }{2\Delta t} = \ldots \\ 
167\frac{\partial T}{\partial t} &\equiv \frac{T^{t+1}-T^{t-1}}{2 \widetilde{\Delta t}} = \ldots \\ 
168\frac{\partial S}{\partial t} &\equiv \frac{S^{t+1} -S^{t-1}}{2 \widetilde{\Delta t}} = \ldots \\ 
169\end{split}
170\end{equation}
171
172\citet{Bryan1984} has examined the consequences of this distorted physics.
173Free waves have a slower phase speed, their meridional structure is slightly
174modified, and the growth rate of baroclinically unstable waves is reduced
175but with a wider range of instability. This technique is efficient for
176searching for an equilibrium state in coarse resolution models. However its
177application is not suitable for many oceanic problems: it cannot be used for
178transient or time evolving problems (in particular, it is very questionable
179to use this technique when there is a seasonal cycle in the forcing fields),
180and it cannot be used in high-resolution models where baroclinically
181unstable processes are important. Moreover, the vertical variation of
182$\widetilde{ \Delta t}$ implies that the heat and salt contents are no longer
183conserved due to the vertical coupling of the ocean level through both
184advection and diffusion.
185% first mention of depth varying tilda-delta-t!   and namelist parameter associated to that are to be described
186
187
188% ================================================================
189% Model optimisation, Control Print and Benchmark
190% ================================================================
191\section{Model Optimisation, Control Print and Benchmark}
192\label{MISC_opt}
193%--------------------------------------------namdom-------------------------------------------------------
194\namdisplay{nam_ctl} 
195%--------------------------------------------------------------------------------------------------------------
196
197% \gmcomment{why not make these bullets into subsections?}
198
199Three issues to be described here:
200
201$\bullet$ Vector and memory optimisation:
202
203\key{vectopt\_loop} enables internal loop collapse, a very efficient way to increase
204the length of vector calculations and thus speed up the model on vector computers.
205 
206% Add here also one word on NPROMA technique that has been found useless, since compiler have made significant progress during the last decade.
207 
208% Add also one word on NEC specific optimisation (Novercheck option for example)
209
210\key{vectopt\_memory} has been introduced in order to reduce the memory
211requirement of the model. This is obviously done at the cost of increasing the
212CPU time requirement, since it suppress intermediate computations which would
213have been saved in in-core memory. This feature has not been intensively used.
214In fact, currently it is only implemented for the TKE physics, in which,  when
215\key{vectopt\_memory} is defined, the coefficients used for horizontal smoothing
216of $A_v^T$ and $A_v^m$ are no longer computed once and for all. This reduces
217the memory requirement by three 2D arrays.
218
219 
220$\bullet$ Control print %: describe here 4 things:
221
2221- \np{ln\_ctl} : compute and print the trends averaged over the interior domain
223in all TRA, DYN, LDF and ZDF modules. This option is very helpful when
224diagnosing the origin of an undesired change in model results.
225
2262- also \np{ln\_ctl} but using the nictl and njctl namelist parameters to check
227the source of differences between mono and multi processor runs.
228
2293- \key{esopa} (to be rename key\_nemo) : which is another option for model
230management. When defined, this key forces the activation of all options and
231CPP keys. For example, all tracer and momentum advection schemes are called!
232There is therefore no physical meaning associated with the model results.
233However, this option forces both the compiler and the model to run through
234all the \textsc{Fortran} lines of the model. This allows the user to check for obvious
235compilation or execution errors with all CPP options, and errors in namelist options.
236
2373- \key{esopa} (to be rename key\_nemo) : which is another option for model
238management. When defined, this key forces the activation of all options and CPP keys.
239For example, all tracer and momentum advection schemes are called! There is
240therefore no physical meaning associated with the model results. However, this option
241forces both the compiler and the model to run through all the Fortran lines of the model.
242This allows the user to check for obvious compilation or execution errors with all CPP
243options, and errors in namelist options.
244
2454- last digit comparison (\np{nbit\_cmp}). In an MPP simulation, the computation of
246a sum over the whole domain is performed as the summation over all processors of
247each of their sums over their interior domains. This double sum never gives exactly
248the same result as a single sum over the whole domain, due to truncation differences.
249The "bit comparison" option has been introduced in order to be able to check that
250mono-processor and multi-processor runs give exactly the same results.
251
252$\bullet$  Benchmark (\np{nbench}). This option defines a benchmark run based on
253a GYRE configuration in which the resolution remains the same whatever the domain
254size. This allows a very large model domain to be used, just by changing the domain
255size (\jp{jpiglo}, \jp{jpjglo}) and without adjusting either the time-step or the physical
256parameterizations.
257
258
259% ================================================================
260% Elliptic solvers (SOL)
261% ================================================================
262\section{Elliptic solvers (SOL directory)}
263\label{MISC_sol}
264%--------------------------------------------namdom-------------------------------------------------------
265\namdisplay{namsol} 
266%--------------------------------------------------------------------------------------------------------------
267
268The computation of the surface pressure gradient with a rigid lid assumption
269requires the computation of $\partial_t \psi$, the time evolution of the
270barotropic streamfunction. $\partial_t \psi$ is the solution of an elliptic
271equation \eqref{Eq_PE_psi} for which three solvers are available: a
272Successive-Over-Relaxation scheme (SOR), a preconditioned conjugate gradient
273scheme(PCG) \citep{Madec1988, Madec1990} and a Finite Elements Tearing and
274Interconnecting scheme (FETI) \citep{Guyon_al_EP99, Guyon_al_CalPar99}.
275The PCG is a very efficient method for solving elliptic equations on vector computers.
276It is a fast and rather easy method to use; which are attractive features for a large
277number of ocean situations (variable bottom topography, complex coastal geometry,
278variable grid spacing, islands, open or cyclic boundaries, etc ...). It does not require
279a search for an optimal parameter as in the SOR method. However, the SOR has
280been retained because it is a linear solver, which is a very useful property when
281using the adjoint model of \NEMO. The FETI solver is a very efficient method on
282massively parallel computers. However, it has never been used since OPA 8.0.
283The current version in \NEMO should not even successfully go through the
284compilation phase.
285
286The surface pressure gradient is computed in \mdl{dynspg}. The solver used
287(PCG or SOR) depending on the value of \np{nsolv} (namelist parameter).
288At each time step the time derivative of the barotropic streamfunction is
289the solution of \eqref{Eq_psi_total}. Introducing the following coefficients:
290
291\begin{equation} 
292\begin{aligned}
293&C_{i,j}^{NS} &&= \frac{e_{2v}(i,j)}{\left( H_v (i,j) e_{1v} (i,j)\right)}\\
294&C_{i,j}^{EW} &&= \frac{e_{1u}(i,j)}{\left( H_u (i,j) e_{2u} (i,j)\right)}\\
295&B_{i,j} &&= \delta_i \left( e_{2v}M_v \right) - \delta_j \left( e_{1u}M_u \right)\\
296\end{aligned}
297\end{equation}
298
299the five-point finite difference equation \eqref{Eq_psi_total} can be rewritten as:
300
301\begin{multline} \label{Eq_solmat}
302C_{i+1,j}^{NS} {   \left\frac{\partial \psi}{\partial t } \right)    }_{i+1,j} + \;
303C_{i,j+1}^{EW}{   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j+1} + \;
304C_{i,j}    ^{NS} {   \left\frac{\partial \psi}{\partial t } \right)    }_{i-1,j} + \;
305C_{i,j}    ^{EW}{   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j-1}\\
306-\left(C_{i+1,j}^{NS} + C_{i,j+1}^{EW} + C_{i,j}^{NS} + C_{i,j}^{EW} \right)   {   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j}  =  B_{i,j}
307\end{multline}
308\eqref{Eq_solmat} is a linear symmetric system of equations. All the elements of
309the corresponding matrix \textbf{A} vanish except those of five diagonals. With
310the natural ordering of the grid points (i.e. from west to east and from
311south to north), the structure of \textbf{A} is block-tridiagonal with
312tridiagonal or diagonal blocks. \textbf{A} is a positive-definite symmetric
313matrix of size $(jpi \cdot jpj)^2$, and \textbf{B}, the right hand side of
314\eqref{Eq_solmat}, is a vector.
315
316% -------------------------------------------------------------------------------------------------------------
317%       Successive Over Relaxation
318% -------------------------------------------------------------------------------------------------------------
319\subsection{Successive Over Relaxation \np{nsolv}=2}
320\label{MISC_solsor}
321
322Let us introduce the four cardinal coefficients: $A_{i,j}^S = C_{i,j}^{NS}/D_{i,j}\,$,
323$A_{i,j}^W = C_{i,j}^{EW}/D_{i,j}\,$, $A_{i,j}^E = C_{i,j+1}^{EW}/D_{i,j}\,$ and $A_{i,j}^N = C_{i+1,j}^{NS}/D_{i,j}\,$, and define
324$\tilde B_{i,j} = B_{i,j}/D_{i,j}\,$, where $D_{i,j} = C_{i,j}^{NS}+ C_{i+1,j}^{NS} + C_{i,j}^{EW} + C_{i,j+1}^{EW} $ (i.e. the diagonal of
325\textbf{A}). (VII.5.1) can be rewritten as:
326
327\begin{multline} \label{Eq_solmat_p}
328    A_{i,j}^{N}  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i+1,j   } 
329+\,A_{i,j}^{E}  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i    ,j+1}       \\
330+\,A_{i,j}^{S}  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i-1 ,j    } 
331+\,A_{i,j}^{W} {   \left\frac{\partial \psi}{\partial t } \right)    }_{i    ,j-1} 
332               -  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j} = \tilde B_{i,j}
333\end{multline}
334
335The SOR method used is an iterative method. Its algorithm can be summarised
336as follows [see \citet{Haltiner1980} for a further discussion]:
337
338initialisation (evaluate a first guess from previous time step computations)
339\begin{equation}
340\left\frac{\partial \psi}{\partial t } \right)_{i,j}^0 = 2{\left\frac{\partial \psi}{\partial t } \right)_{i,j}^t} - {\left\frac{\partial \psi}{\partial t } \right)_{i,j}^{t-1}}
341\end{equation}
342iteration $n$, from $n=0$ until convergence, do :
343\begin{multline} 
344R_{i,j}^n
345= A_{i,j}^{N}  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i+1,j}^n
346+\,A_{i,j}^{E}  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j+1} ^n     \\
347+\,A_{i,j}^{S}  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i-1,j} ^{n+1}
348+\,A_{i,j}^{W} {   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j-1} ^{n+1} 
349                   -  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j}^n - \tilde B_{i,j}
350\end{multline}
351\begin{equation} 
352     {   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j} ^{n+1}
353  = {   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j} ^{n}
354  + \omega \;R_{i,j}^n
355\end{equation}
356where \textit{$\omega $ }satisfies $1\leq \omega \leq 2$. An optimal value exists for
357\textit{$\omega$} which significantly accelerates the convergence, but it has to be
358adjusted empirically for each model domain (except for a uniform grid where an
359analytical expression for \textit{$\omega$} can be found \citep{Richtmyer1967}).
360The value of $\omega$ is set using \textit{sor}, a \textbf{namelist} parameter.
361The convergence test is of the form:
362\begin{equation}
363\delta = \frac{\sum\limits_{i,j}{R_{i,j}^n}{R_{i,j}^n}}{\sum\limits_{i,j}{ \tilde B_{i,j}^n}{\tilde B_{i,j}^n}} \leq \epsilon
364\end{equation}
365where $\epsilon$ is the absolute precision that is required. It is recommended
366that a value smaller or equal to $10^{-3}$ is used for $\epsilon$ since larger
367values may lead to numerically induced basin scale barotropic oscillations. In fact,
368for an eddy resolving configuration or in a filtered free surface case, a value three
369orders of magnitude smaller than this should be used. The precision is specified by
370setting \textit{eps} (\textbf{namelist parameter}). In addition, two other tests are
371used to halt the iterative algorithm. They involve the number of iterations and the
372modulus of the right hand side. If the former exceeds a specified value, \textit{nmax} 
373(\textbf{namelist parameter}), or the latter is greater than $10^{15}$, the whole
374model computation is stopped and the last computed time step fields are saved
375in the standard output file. In both cases, this usually indicates that there is something
376wrong in the model configuration (an error in the mesh, the initial state, the input forcing,
377or the magnitude of the time step or of the mixing coefficients). A typical value of
378$nmax$ is a few hundred when $\epsilon = 10^{-6}$, increasing to a few
379thousand when $\epsilon = 10^{-12}$.
380The vectorization of the SOR algorithm is not straightforward. The scheme
381contains two linear recurrences on $i$ and $j$. This inhibits the vectorisation.
382Therefore it has been rewritten as:
383
384\begin{multline} 
385R_{i,j}^n
386= A_{i,j}^{N}  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i+1,j}^n
387+\,A_{i,j}^{E}  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j+1} ^n     \\
388+\,A_{i,j}^{S}  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i-1,j} ^{n}
389+\,A_{i,j}^{W} {   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j-1} ^{n} 
390                   -  {   \left\frac{\partial \psi}{\partial t } \right)    }_{i,j}^n - \tilde B_{i,j}
391\end{multline}
392
393\begin{equation}
394R_{i,j}^n = R_{i,j}^n - \omega \;A_{i,j}^{S}\; R_{i,j-1}^n
395\end{equation}
396
397\begin{equation}
398R_{i,j}^n = R_{i,j}^n - \omega \;A_{i,j}^{W}\; R_{i-1,j}^n
399\end{equation}
400
401The SOR method is very flexible and can be used under a wide
402range of conditions, including irregular boundaries, interior boundary
403points, etc. Proofs of convergence, etc. may be found in the standard
404numerical methods texts for partial differential equations.
405
406% -------------------------------------------------------------------------------------------------------------
407%       Preconditioned Conjugate Gradient
408% -------------------------------------------------------------------------------------------------------------
409\subsection{Preconditioned Conjugate Gradient}
410\label{MISC_solpcg}
411
412\begin{flushright}
413(\textit{nbsfs=1}, \textbf{namelist parameter})
414\end{flushright}
415
416\textbf{A} is a definite positive symmetric matrix, thus solving the linear
417system \eqref{Eq_solmat} is equivalent to the minimisation of a quadratic
418functional:
419\begin{equation*}
420\textbf{Ax} = \textbf{b} \leftrightarrow \textbf{x} =\text{inf}_{y} \,\phi (\textbf{y})
421\quad , \qquad
422\phi (\textbf{y}) = 1/2 \langle \textbf{Ay},\textbf{y}\rangle - \langle \textbf{b},\textbf{y} \rangle 
423\end{equation*}
424where $\langle , \rangle$ is the canonical dot product. The idea of the
425conjugate gradient method is to search for the solution in the following
426iterative way: assuming that $\textbf{x}^n$ has been obtained, $\textbf{x}^{n+1}$ 
427is found from $\textbf {x}^{n+1}={\textbf {x}}^n+\alpha^n{\textbf {d}}^n$ which satisfies:
428\begin{equation*}
429{\textbf{ x}}^{n+1}=\text{inf} _{{\textbf{ y}}={\textbf{ x}}^n+\alpha^n \,{\textbf{ d}}^n} \,\phi ({\textbf{ y}})\;\;\Leftrightarrow \;\;\frac{d\phi }{d\alpha}=0
430\end{equation*}
431and expressing $\phi (\textbf{y})$ as a function of \textit{$\alpha $}, we obtain the value that minimises the functional:
432\begin{equation*}
433\alpha ^n = \langle{ \textbf{r}^n , \textbf{r}^n} \rangle  / \langle {\textbf{ A d}^n, \textbf{d}^n} \rangle
434\end{equation*}
435where $\textbf{r}^n = \textbf{b}-\textbf{A x}^n = \textbf{A} (\textbf{x}-\textbf{x}^n)$ 
436is the error at rank $n$. The descent vector $\textbf{d}^n$ s chosen to be dependent
437on the error: $\textbf{d}^n = \textbf{r}^n + \beta^n \,\textbf{d}^{n-1}$. $\beta ^n$ 
438is searched such that the descent vectors form an orthogonal basis for the dot
439product linked to \textbf{A}. Expressing the condition
440$\langle \textbf{A d}^n, \textbf{d}^{n-1} \rangle = 0$ the value of $\beta ^n$ is found:
441 $\beta ^n = \langle{ \textbf{r}^n , \textbf{r}^n} \rangle  / \langle {\textbf{r}^{n-1}, \textbf{r}^{n-1}} \rangle$. As a result, the errors $ \textbf{r}^n$ form an orthogonal
442base for the canonic dot product while the descent vectors $\textbf{d}^n$ form
443an orthogonal base for the dot product linked to \textbf{A}. The resulting
444algorithm is thus the following one:
445
446initialisation :
447
448\begin{equation*} 
449\textbf{x}^0 = \left\frac{\partial \psi}{\partial t } \right)_{i,j}^0 = 2{\left\frac{\partial \psi}{\partial t } \right)_{i,j}^t} - {\left\frac{\partial \psi}{\partial t } \right)_{i,j}^{t-1}}
450, \text{the initial guess }
451\end{equation*}
452
453\begin{equation*}
454\textbf{r}^0 = \textbf{d}^0 = \textbf{b} - \textbf{A x}^0
455\end{equation*}
456\begin{equation*}
457\gamma_0 = \langle{ \textbf{r}^0 , \textbf{r}^0} \rangle
458\end{equation*}
459iteration $n,$ from $n=0$ until convergence, do :
460
461\begin{equation} 
462\begin{split}
463\text{z}^n& = \textbf{A d}^n \\
464\alpha_n &= \gamma_\langle{ \textbf{z}^n , \textbf{d}^n} \rangle \\
465\textbf{x}^{n+1} &= \textbf{x}^n + \alpha_n \,\textbf{d}^n \\
466\textbf{r}^{n+1} &= \textbf{r}^n - \alpha_n \,\textbf{z}^n \\
467\gamma_{n+1} &= \langle{ \textbf{r}^{n+1} , \textbf{r}^{n+1}} \rangle \\
468\beta_{n+1} &= \gamma_{n+1}/\gamma_{n}  \\
469\textbf{d}^{n+1} &= \textbf{r}^{n+1} + \beta_{n+1}\; \textbf{d}^{n}\\
470\end{split}
471\end{equation}
472
473
474The convergence test is:
475
476\begin{equation}
477\delta = \gamma_{n}\; / \langle{ \textbf{b} , \textbf{b}} \rangle \leq \epsilon
478\end{equation}
479where $\epsilon $ is the absolute precision that is required. As for the SOR algorithm,
480the whole model computation is stopped when the number of iterations, $nmax$, or
481the modulus of the right hand side of the convergence equation exceeds a
482specified value (see \S\ref{MISC_solsor} for a further discussion). The required
483precision and the maximum number of iterations allowed are specified by setting
484$eps$ and $nmax$ (\textbf{namelist parameters}).
485
486It can be demonstrated that the above algorithm is optimal, provides the exact
487solution in a number of iterations equal to the size of the matrix, and that
488the convergence rate is faster as the matrix is closer to the identity matrix,
489$i.e.$ its eigenvalues are closer to 1. Therefore, it is more efficient to solve a
490better conditioned system which has the same solution. For that purpose, we
491introduce a preconditioning matrix \textbf{Q} which is an approximation of
492\textbf{A} but much easier to invert than \textbf{A}, and solve the system:
493\begin{equation} \label{Eq_pmat}
494\textbf{Q}^{-1} \textbf{A x} = \textbf{Q}^{-1} \textbf{b}
495\end{equation}
496
497The same algorithm can be used to solve \eqref{Eq_pmat} if instead of the
498canonical dot product the following one is used:
499${\langle{ \textbf{a} , \textbf{b}} \rangle}_Q = \langle{ \textbf{a} , \textbf{Q b}} \rangle$,
500and if $\textbf{\~{b}} = \textbf{Q}^{-1}\;\textbf{b}$ and $\textbf{\~{A}} = \textbf{Q}^{-1}\;\textbf{A}$ are substituted to \textbf{b} and \textbf{A} \citep{Madec1988}.
501In \NEMO, \textbf{Q} is chosen as the diagonal of \textbf{ A}, i.e. the simplest form for \textbf{Q} so that it can be easily inverted. In this case, the discrete formulation of
502\eqref{Eq_pmat} is in fact given by \eqref{Eq_solmat_p} and thus the matrix and
503right hand side are computed independently from the solver used.
504
505% -------------------------------------------------------------------------------------------------------------
506%       FETI
507% -------------------------------------------------------------------------------------------------------------
508\subsection{FETI}
509\label{MISC_solfeti}
510
511FETI is a powerful solver that was developed by Marc Guyon  \citep{Guyon_al_EP99,
512Guyon_al_CalPar99}. It has been converted from Fortan 77 to 90, but never
513successfully tested after that.
514
515Its main advantage is to save a lot of CPU time when compared to the SOR or PCG
516algorithms. However, its main drawback is that the solution is dependent on the
517domain decomposition chosen. Using a different number of processors, the solution
518is the same at the precision required, but not the same at the computer precision.
519This make it hard to debug. 
520
521% -------------------------------------------------------------------------------------------------------------
522%       Boundary Conditions --- Islands
523% -------------------------------------------------------------------------------------------------------------
524\subsection{Boundary Conditions --- Islands (\key{islands})}
525\label{MISC_solisl}
526
527The boundary condition used for both recommended solvers is that the time
528derivative of the barotropic streamfunction is zero along all the coastlines.
529When islands are present in the model domain, additional computations must
530be performed to determine the barotropic streamfunction with the correct boundary
531conditions. This is detailed below.
532
533The model does not have specialised code for islands. They must instead be
534identified to the solvers by the user via bathymetry information, i.e. the $mbathy$ 
535array should equal $-1$ over the first island, $-2$ over the second, ... , $-N$ over
536the $N^{th}$ island. The model determines the position of island grid-points and
537defines a closed contour around each island which is used to compute the circulation
538around each one. The closed contour is formed from the ocean grid-points which
539are the closest to the island.
540
541First, the island barotropic streamfunctions $\psi_n$ are computed using the SOR
542or PCG scheme (they are solutions of \eqref{Eq_solmat} with the right-hand side
543equal to zero and with $\psi_n = 1$ along the island $n$ and $\psi_n = 0$ along
544the other coastlines) (Note that specifying $1$ as boundary condition on an island
545for $\psi$ is equivalent to defining a specific right hand side for \eqref{Eq_solmat}).
546The requested precision of this computation can be very high since it is only
547performed once. The absolute precision, $epsisl$, and the maximum number of
548iterations allowed, $nmisl$, are the \textbf{namelist parameters} used for this
549computation. Their typical values are $epsisl = 10^{-10}$ and $nmisl = 4000$.
550Then the island matrix A is computed from \eqref{Eq_PE_psi_isl_matrix} and
551inverted. At each time step, $\psi_0$, the solution of \eqref{Eq_solmat} with $\psi = 0$ 
552along all coastlines, is computed using either SOR or PCG. (It should
553be noted that the first guess of this computation remains as usual except that
554$\partial_t \psi_0$ is used, instead of $\partial_t \psi$. Indeed we are
555computing $\partial_t \psi_0$ which is usually very different from $\partial_t \psi$.)
556Then, it is easy to find the time evolution of the barotropic streamfunction on each
557island and to deduce $\partial_t \psi$, and to use it to compute the surface pressure
558gradient. Note that the value of the barotropic streamfunction itself is also computed
559as the time integration of $\partial_t \psi$ for further diagnostics.
560
561% ================================================================
562% Diagnostics
563% ================================================================
564\section{Diagnostics }
565\label{MISC_diag}
566
567% -------------------------------------------------------------------------------------------------------------
568%       Standard Model Output
569% -------------------------------------------------------------------------------------------------------------
570\subsection{Standard Model Output (default option or \key{dimg})}
571\label{MISC_iom}
572
573%to be updated with Seb documentation on the IO
574\amtcomment{ 
575The model outputs are of three types: the restart file, the output listing,
576and the output file(s). The restart file is used internally by the code when
577the user wants to start the model with initial conditions defined by a
578previous simulation. It contains all the information that is necessary in
579order for there to be no changes in the model results (even at the computer
580precision) between a run performed with several restarts and the same run
581performed in one step. It should be noted that this requires that the restart file
582contain two consecutive time steps for all the prognostic variables, and
583that it is saved in the same binary format as the one used by the computer
584that is to read it (in particular, 32 bits binary IEEE format must not be used for
585this file). The output listing and file(s) are predefined but should be checked
586and eventually adapted to the user's needs. The output listing is stored in
587the $ocean.output$ file. The information is printed from within the code on the
588logical unit $numout$. To locate these prints, use the UNIX command
589"$grep -i numout^*$" in the source code directory.
590
591In the standard configuration, the user will find the model results in two
592output files containing values for every time-step where output is demanded:
593a \textbf{VO} file containing all the three dimensional fields in logical unit
594$numwvo$, and a \textbf{SO} file containing all the two dimensional (horizontal)
595fields in logical unit $numwso$. These outputs are defined in the $diawri.F$ 
596routine. The default and user-selectable diagnostics are described in {\S}III-12-c.
597
598The default output (for all output files apart from the restart file) is 32 bits binary
599IEEE format, compatible with the Vairmer software (see the Climate Modelling
600and global Change team WEB server at CERFACS: http://www.cerfacs.fr).
601The model's reference directory also contains a visualisation tool based on
602\textbf{NCAR Graphics} (http://ngwww.ucar.edu). If a user has access to the
603NCAR software, he or she can copy the \textbf{LODMODEL/UTILS/OPADRA} 
604directory from the reference and, following the \textbf{README}, create
605graphical outputs from the model's results.
606}
607
608% -------------------------------------------------------------------------------------------------------------
609%       Tracer/Dynamics Trends
610% -------------------------------------------------------------------------------------------------------------
611\subsection{Tracer/Dynamics Trends (\key{trdlmd}, \key{diatrdtra}, \key{diatrddyn})}
612\label{MISC_tratrd}
613
614%to be udated this corresponds to OPA8
615When \key{diatrddyn} and/or \key{diatrddyn} cpp variables are defined, each
616trend of the dynamics and/or temperature and salinity time evolution equations
617is stored in three-dimensional arrays just after their computation ($i.e.$ at the end
618of each $dyn\cdots .F90$ and/or $tra\cdots .F90$ routine). These trends are then
619used in diagnostic routines $diadyn.F90$ and $diatra.F90$ respectively.
620In the standard model, these routines check the basin averaged properties of
621the momentum and tracer equations every \textit{ntrd } time-steps (\textbf{namelist
622parameter}). These routines are supplied as an example; they must be adapted by
623the user to his/her requirements.
624
625These two options imply the creation of several extra arrays in the in-core
626memory, increasing quite seriously the code memory requirements.
627
628% -------------------------------------------------------------------------------------------------------------
629%       On-line Floats trajectories
630% -------------------------------------------------------------------------------------------------------------
631\subsection{On-line Floats trajectories}
632\label{FLO}
633%--------------------------------------------namflo-------------------------------------------------------
634\namdisplay{namflo} 
635%--------------------------------------------------------------------------------------------------------------
636
637The on-line computation of floats adevected either by the three dimensional velocity
638field or constraint to remain at a given depth ($w = 0$ in the computation) have been introduced in the system during the CLIPPER project. The algorithm used is based on
639the work of \cite{Blanke_Raynaud_JPO97}. (see also the web site describing the off-line
640use of this marvellous diagnostic tool (http://stockage.univ-brest.fr/~grima/Ariane/).
641
642% -------------------------------------------------------------------------------------------------------------
643%       Other Diagnostics
644% -------------------------------------------------------------------------------------------------------------
645\subsection{Other Diagnostics}
646\label{MISC_diag_others}
647
648%To be updated  this mainly corresponds to OPA 8
649
650Aside from the standard model variables, other diagnostics can be computed
651on-line or can be added to the model. The available ready-to-add diagnostics
652routines can be found in directory DIA. Among the available diagnostics are:
653
654- the mixed layer depth (based on a density criterion) (\mdl{diamxl})
655
656- the turbocline depth (based on a turbulent mixing coefficient criterion) (\mdl{diamxl})
657
658- the depth of the 20\r{}C isotherm (\mdl{diahth})
659
660- the depth of the thermocline (maximum of the vertical temperature gradient) (\mdl{diahth})
661
662- the meridional heat and salt transports and their decomposition (\mdl{diamfl})
663
664- the surface pressure (\mdl{diaspr})
Note: See TracBrowser for help on using the repository browser.