source: trunk/libIGCM/AA_pack_output @ 1206

Last change on this file since 1206 was 1206, checked in by sdipsl, 9 years ago
  • Remove IGCM_debug_Verif_Exit_Post. Only IGCM_debug_Verif_Exit will manage exit cases.
  • Property licence set to
    The following licence information concerns ONLY the libIGCM tools
    ==================================================================

    Copyright © Centre National de la Recherche Scientifique CNRS
    Commissariat à l'Énergie Atomique CEA

    libIGCM : Library for Portable Models Computation of IGCM Group.

    IGCM Group is the french IPSL Global Climate Model Group.

    This library is a set of shell scripts and functions whose purpose is
    the management of the initialization, the launch, the transfer of
    output files, the post-processing and the monitoring of datas produce
    by any numerical program on any plateforme.

    This software is governed by the CeCILL license under French law and
    abiding by the rules of distribution of free software. You can use,
    modify and/ or redistribute the software under the terms of the CeCILL
    license as circulated by CEA, CNRS and INRIA at the following URL
    "http://www.cecill.info".

    As a counterpart to the access to the source code and rights to copy,
    modify and redistribute granted by the license, users are provided only
    with a limited warranty and the software's author, the holder of the
    economic rights, and the successive licensors have only limited
    liability.

    In this respect, the user's attention is drawn to the risks associated
    with loading, using, modifying and/or developing or reproducing the
    software by the user in light of its specific status of free software,
    that may mean that it is complicated to manipulate, and that also
    therefore means that it is reserved for developers and experienced
    professionals having in-depth computer knowledge. Users are therefore
    encouraged to load and test the software's suitability as regards their
    requirements in conditions enabling the security of their systems and/or
    data to be ensured and, more generally, to use and operate it in the
    same conditions as regards security.

    The fact that you are presently reading this means that you have had
    knowledge of the CeCILL license and that you accept its terms.
  • Property svn:keywords set to Revision Author Date
File size: 12.4 KB
Line 
1#-Q- curie ######################
2#-Q- curie ## CURIE   TGCC/CEA ##
3#-Q- curie ######################
4#-Q- curie #MSUB -r PACKOUTPUT     # Nom du job
5#-Q- curie #MSUB -eo
6#-Q- curie #MSUB -n 1              # Reservation du processus
7#-Q- curie #MSUB -T 36000          # Limite de temps elapsed du job
8#-Q- curie #MSUB -q ::default_node::
9#-Q- curie #MSUB -Q normal
10#-Q- curie #MSUB -A ::default_project::
11#-Q- curie set +x
12#-Q- ada #!/bin/ksh
13#-Q- ada #######################
14#-Q- ada ## ADA         IDRIS ##
15#-Q- ada #######################
16#-Q- ada # @ job_type = serial
17#-Q- ada # @ requirements = (Feature == "prepost")
18#-Q- ada # Temps Elapsed max. d'une requete hh:mm:ss
19#-Q- ada # @ wall_clock_limit = 10:00:00
20#-Q- ada # Nom du travail LoadLeveler
21#-Q- ada # @ job_name   = PACKOUTPUT
22#-Q- ada # Fichier de sortie standard du travail
23#-Q- ada # @ output     = $(job_name).$(jobid)
24#-Q- ada # Fichier de sortie d'erreur du travail
25#-Q- ada # @ error      =  $(job_name).$(jobid)
26#-Q- ada # pour recevoir un mail en cas de depassement du temps Elapsed (ou autre pb.)
27#-Q- ada # @ notification = error
28#-Q- ada # @ environment  = $DEBUG_debug ; $BigBrother ; $MODIPSL ; $libIGCM ; $libIGCM_SX ; $POST_DIR ; $Script_Post_Output ; $SUBMIT_DIR ; $DateBegin ; $DateEnd ; $PeriodPack ; $StandAlone ; $MASTER ; wall_clock_limit=$(wall_clock_limit)
29#-Q- ada # @ queue
30#-Q- lxiv8 ######################
31#-Q- lxiv8 ## OBELIX      LSCE ##
32#-Q- lxiv8 ######################
33#-Q- lxiv8 #PBS -N PACKOUTPUT
34#-Q- lxiv8 #PBS -m a
35#-Q- lxiv8 #PBS -j oe
36#-Q- lxiv8 #PBS -q medium
37#-Q- lxiv8 #PBS -o PACKOUTPUT.$$
38#-Q- lxiv8 #PBS -S /bin/ksh
39#-Q- ifort_CICLAD ######################
40#-Q- ifort_CICLAD ##   CICLAD    IPSL ##
41#-Q- ifort_CICLAD ######################
42#-Q- ifort_CICLAD #PBS -N PACKOUTPUT
43#-Q- ifort_CICLAD #PBS -m a
44#-Q- ifort_CICLAD #PBS -j oe
45#-Q- ifort_CICLAD #PBS -q std
46#-Q- ifort_CICLAD #PBS -S /bin/ksh
47#-Q- default #!/bin/ksh
48#-Q- default ##################
49#-Q- default ## DEFAULT HOST ##
50#-Q- default ##################
51
52#**************************************************************
53# Author: Sebastien Denvil
54# Contact: Sebastien.Denvil__at__ipsl.jussieu.fr
55# $Revision::                                          $ Revision of last commit
56# $Author::                                            $ Author of last commit
57# $Date::                                              $ Date of last commit
58# IPSL (2006)
59#  This software is governed by the CeCILL licence see libIGCM/libIGCM_CeCILL.LIC
60#
61#**************************************************************
62
63#set -eu
64#set -vx
65
66date
67
68#D- Task type (computing or post-processing)
69TaskType=post-processing
70
71########################################################################
72
73#D- Flag to determine if this job in a standalone mode
74#D- Default : value from AA_job if any
75StandAlone=${StandAlone:=true}
76
77#D- Path to libIGCM
78#D- Default : value from AA_job if any
79# WARNING For StandAlone use : To run this script on some machine (ulam and cesium)
80# WARNING you must check MirrorlibIGCM variable in sys library.
81# WARNING If this variable is true, you must use libIGCM_POST path instead
82# WARNING of your running libIGCM directory.
83libIGCM=${libIGCM:=::modipsl::/libIGCM}
84
85#-D- $hostname of the MASTER job when SUBMIT_DIR is not visible on postprocessing computer.
86MASTER=${MASTER:=ada|curie}
87
88#D- Flag to determine begin date for restart pack
89#D- Default : value from AA_job if any
90DateBegin=${DateBegin:=20000101}
91
92#D- Flag to determine end date for restart pack
93#D- Default : value from AA_job if any
94DateEnd=${DateEnd:=20691231}
95
96#D- Flag to determine pack period
97#D- Default : value from AA_job if any
98PeriodPack=${PeriodPack:=10Y}
99
100#D- Uncomment to run interactively
101#D- For testing purpose, will be remove
102#SUBMIT_DIR=${PWD}
103#RUN_DIR_PATH=${SCRATCHDIR}/Pack_Test
104
105#D- Increased verbosity (1, 2, 3)
106#D- Default : value from AA_job if any
107Verbosity=${Verbosity:=3}
108
109#D- Low level debug : to bypass lib test checks and stack construction
110#D- Default : value from AA_job if any
111DEBUG_debug=${DEBUG_debug:=false}
112
113########################################################################
114
115. ${libIGCM}/libIGCM_debug/libIGCM_debug.ksh
116. ${libIGCM}/libIGCM_card/libIGCM_card.ksh
117. ${libIGCM}/libIGCM_date/libIGCM_date.ksh
118#-------
119. ${libIGCM}/libIGCM_sys/libIGCM_sys.ksh
120. ${libIGCM}/libIGCM_config/libIGCM_config.ksh
121. ${libIGCM}/libIGCM_post/libIGCM_post.ksh
122#-------
123RUN_DIR=${RUN_DIR_PATH}
124IGCM_sys_MkdirWork ${RUN_DIR}
125IGCM_sys_Cd ${RUN_DIR}
126#-------
127( ${DEBUG_debug} ) && IGCM_debug_Check
128( ${DEBUG_debug} ) && IGCM_card_Check
129( ${DEBUG_debug} ) && IGCM_date_Check
130
131########################################################################
132
133#set -vx
134
135# ------------------------------------------------------------------
136# Test if all was right before proceeding further
137# ------------------------------------------------------------------
138IGCM_debug_Verif_Exit
139
140if [ ${StandAlone} = true ] ; then
141    CARD_DIR=${SUBMIT_DIR}
142else
143    CARD_DIR=${RUN_DIR_PATH}
144    IGCM_sys_Get_Master ${SUBMIT_DIR}/config.card ${RUN_DIR_PATH}
145    IGCM_sys_Get_Master ${SUBMIT_DIR}/run.card    ${RUN_DIR_PATH}
146    IGCM_sys_Get_Master ${SUBMIT_DIR}/COMP        ${RUN_DIR_PATH}
147    IGCM_sys_Get_Master ${SUBMIT_DIR}/POST        ${RUN_DIR_PATH}
148fi
149
150#==================================
151# First of all
152#
153# Read libIGCM compatibility version in config.card
154# Read UserChoices section
155# Read Ensemble section
156# Read Post section
157# Define all netcdf output directories
158#==================================
159IGCM_config_CommonConfiguration ${CARD_DIR}/config.card
160
161# ------------------------------------------------------------------
162# Activate BigBrother so as to supervise this job
163# ------------------------------------------------------------------
164IGCM_debug_BigBro_Initialize
165
166#==================================
167# Read ListOfComponents section
168# to drive the loop over find
169IGCM_card_DefineArrayFromSection ${CARD_DIR}/config.card ListOfComponents
170
171#==================================
172# Test and set up directories
173#==================================
174IGCM_sys_TestDirArchive ${R_SAVE}
175[ $? != 0 ] && IGCM_debug_Exit "IGCM_sys_TestDirArchive"
176
177# Where to store used file list /!\ TEMPORARY /!\
178STORE_DEBUG=${R_SAVE}/DEBUG
179
180# Switch to script variables meaning (try to be compatible with ipsl_pack TGCC moving procedure)
181JobName=${config_UserChoices_JobName}
182echo $JobName $DateBegin $DateEnd
183
184# ------------------------------------------------------------------
185# Test if all was right before proceeding further
186# ------------------------------------------------------------------
187IGCM_debug_Verif_Exit
188
189IGCM_debug_Print 1 "Check coherence between PackFrequency and PeriodLength"
190IGCM_post_CheckModuloFrequency PeriodPack config_UserChoices_PeriodLength NbPeriodPerFrequency
191# ------------------------------------------------------------------
192# Test if all was right before proceeding further
193# ------------------------------------------------------------------
194IGCM_debug_Verif_Exit
195
196IGCM_debug_Print 1 "We must process ${NbPeriodPerFrequency} files for each pack"
197
198# Init loop
199date_begin_pack=${DateBegin}
200date_end_simulation=${DateEnd}
201number_pack=1
202
203IGCM_debug_PrintVariables 3 date_begin_pack
204IGCM_debug_PrintVariables 3 date_end_simulation
205
206while [ ${date_begin_pack} -le ${date_end_simulation} ] ; do
207
208  IGCM_debug_PrintVariables 3 number_pack
209  DaysTemp=$( IGCM_date_DaysInCurrentPeriod ${date_begin_pack} ${PeriodPack} )
210  date_end_pack=$( IGCM_date_AddDaysToGregorianDate ${date_begin_pack} $(( ${DaysTemp} - 1 )) )
211
212  for comp in ${config_ListOfComponents[*]} ; do
213    dirList=$( find ${R_BUFR}/${comp}/Output -maxdepth 1 -mindepth 1 -type d )
214    for dir in ${dirList} ; do
215      # dirID is like ATM.Output.MO
216      dirID=$( echo $dir | sed "s:${R_BUFR}/::" | sed "s:/:.:g" )
217      # Sort what's in the directory
218      find ${dir} -type f -name "${JobName}*.nc" -ls | sort -k 11 > liste_files.${dirID}.txt
219      # How much file type. Example : 1M_histmthCOSP.nc, 1M_histmth.nc, 1M_histmthNMC.nc, 1M_paramLMDZ_phy.nc
220      # /!\ fileType include the .nc extension /!\
221      fileType=$( gawk '{print $11}' liste_files.${dirID}.txt | gawk -F$dir/ '{print $2}' | sed "s:${JobName}_[0-9]\{8,9\}_[0-9]\{8,9\}_::g" | sort | uniq )
222      # Loop over the file type and pack them when in between date_begin_pack and date_end_pack
223      for myType in ${fileType} ; do
224        grep ${myType} liste_files.${dirID}.txt > liste_files.${dirID}.${myType}.txt
225        nbfile=0
226        for file in $( gawk '{print $11}' liste_files.${dirID}.${myType}.txt ); do
227          extract_date_file=$( echo ${file}  | sed -e "s/.*${JobName}_[0-9]*_//" )
228          date_file=$( echo ${extract_date_file} | sed 's/\([0-9]\{8\}\)_.*$/\1/g' )
229          # echo pack n°${number_pack}  ${date_file} ${date_begin_pack} ${date_end_pack}
230          if [ ${date_file} -le ${date_end_pack} ] && [ ${date_file} -ge ${date_begin_pack} ] ; then
231            echo ${file} >> liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt
232            ncdump -h ${file} | grep -E 'float|double' | cut -f 1 -d '(' | cut -f 2 -d ' ' >> liste_variables_${myType}_${date_begin_pack}_${date_end_pack}.txt
233            (( nbfile = nbfile + 1 ))
234          fi
235        done
236
237        if [ ${nbfile} = 0 ] ; then
238          IGCM_debug_Print 1 "We found no file to process"
239          IGCM_debug_Print 1 "We should have found ${NbPeriodPerFrequency} files"
240          IGCM_debug_Print 1 "As some files can be produced only for some selected period we consider we can move to the next file type"
241          continue
242        fi
243
244        # Select list of variables to work with
245        list_var=$( cat liste_variables_${myType}_${date_begin_pack}_${date_end_pack}.txt | sort | uniq -c | awk -v nbfile=$nbfile '{if ($1 != nbfile) {print $2}}' | paste -s -d ',' )
246        liste_file_tmp=$( for i in $( cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt ) ; do basename $i ; done )
247        # Create packed files
248        IGCM_debug_Print 1 "Ncrcat ongoing for ${dir} and ${myType}"
249        if [ ! ${nbfile} = ${NbPeriodPerFrequency} ] ; then
250          IGCM_debug_Print 1 "Number of files to process is not equal to what it should be"
251          IGCM_debug_Print 1 "We found ${nbfile} files and it should have been ${NbPeriodPerFrequency} files"
252          IGCM_debug_Exit "ERROR in number of files to process. STOP HERE INCLUDING THE COMPUTING JOB"
253          IGCM_debug_Verif_Exit
254        fi
255        output=${JobName}_${date_begin_pack}_${date_end_pack}_${myType}
256        #cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt | xargs ncrcat -v ${list_var} -o ${output}
257        if [ X${list_var} = X ] ; then
258          IGCM_sys_ncrcat -p ${dir} ${liste_file_tmp} --output ${output}
259        else
260          IGCM_sys_ncrcat -x -v ${list_var} -p ${dir} ${liste_file_tmp} --output ${output}
261        fi
262        # ------------------------------------------------------------------
263        # Test if all was right before proceeding further
264        # ------------------------------------------------------------------
265        IGCM_debug_Verif_Exit
266        # Save it
267        IGCM_sys_Put_Out ${output} ${R_SAVE}/$( echo $dir | sed "s:${R_BUFR}/::" )/${output}
268        # Clean file produced by ncrcat
269        IGCM_sys_Rm ${output}
270        # ------------------------------------------------------------------
271        # Test if all was right before proceeding further
272        # ------------------------------------------------------------------
273        IGCM_debug_Verif_Exit
274        # Clean files used by ncrcat
275        cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt | xargs rm
276        # Save the list of files that has been pack (ncrcat)
277        #mv liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt ${STORE_DEBUG}
278        IGCM_debug_Print 1 "Ncrcat and cleaning done for ${dir} and ${myType}"
279        echo
280      done
281    done
282  done
283  (( number_pack = number_pack + 1 ))
284  # Add 1 day to date_end_pack to have the new date_begin_pack
285  date_begin_pack=$( IGCM_date_AddDaysToGregorianDate ${date_end_pack} 1 )
286done
287
288# Flush post-processing submission
289if [ -f ${R_BUFR}/FlushPost_${DateEnd}.ksh ] ; then
290  . ${R_BUFR}/FlushPost_${DateEnd}.ksh
291  IGCM_FlushPost
292  #IGCM_sys_Rm -f ${R_BUFR}/FlushPost_${DateEnd}.ksh
293fi
294
295# Clean RUN_DIR_PATH (necessary for cesium and titane only)
296IGCM_sys_RmRunDir -Rf ${RUN_DIR_PATH}
297
298# ------------------------------------------------------------------
299# Finalize BigBrother to inform that the jobs end
300# ------------------------------------------------------------------
301IGCM_debug_BigBro_Finalize
302
303date
Note: See TracBrowser for help on using the repository browser.