source: trunk/libIGCM/AA_pack_output @ 1451

Last change on this file since 1451 was 1448, checked in by jgipsl, 6 years ago

Change in headers at irene : temporary remove option -m and use a workaround given by the TGCC. This change is done to avoid problems with resubmitting from job when submitting the main job from workdir.

  • Property licence set to
    The following licence information concerns ONLY the libIGCM tools
    ==================================================================

    Copyright © Centre National de la Recherche Scientifique CNRS
    Commissariat à l'Énergie Atomique CEA

    libIGCM : Library for Portable Models Computation of IGCM Group.

    IGCM Group is the french IPSL Global Climate Model Group.

    This library is a set of shell scripts and functions whose purpose is
    the management of the initialization, the launch, the transfer of
    output files, the post-processing and the monitoring of datas produce
    by any numerical program on any plateforme.

    This software is governed by the CeCILL license under French law and
    abiding by the rules of distribution of free software. You can use,
    modify and/ or redistribute the software under the terms of the CeCILL
    license as circulated by CEA, CNRS and INRIA at the following URL
    "http://www.cecill.info".

    As a counterpart to the access to the source code and rights to copy,
    modify and redistribute granted by the license, users are provided only
    with a limited warranty and the software's author, the holder of the
    economic rights, and the successive licensors have only limited
    liability.

    In this respect, the user's attention is drawn to the risks associated
    with loading, using, modifying and/or developing or reproducing the
    software by the user in light of its specific status of free software,
    that may mean that it is complicated to manipulate, and that also
    therefore means that it is reserved for developers and experienced
    professionals having in-depth computer knowledge. Users are therefore
    encouraged to load and test the software's suitability as regards their
    requirements in conditions enabling the security of their systems and/or
    data to be ensured and, more generally, to use and operate it in the
    same conditions as regards security.

    The fact that you are presently reading this means that you have had
    knowledge of the CeCILL license and that you accept its terms.
  • Property svn:keywords set to Revision Author Date
File size: 13.0 KB
Line 
1#-Q- curie ######################
2#-Q- curie ## CURIE   TGCC/CEA ##
3#-Q- curie ######################
4#-Q- curie #MSUB -r PACKOUTPUT     # Nom du job
5#-Q- curie #MSUB -eo
6#-Q- curie #MSUB -n 1              # Reservation du processus
7#-Q- curie #MSUB -T 36000          # Limite de temps elapsed du job
8#-Q- curie #MSUB -q ::default_node::
9#-Q- curie #MSUB -c ::default_core::
10#-Q- curie #MSUB -Q normal
11#-Q- curie #MSUB -A ::default_project::
12#-Q- curie set +x
13#-Q- irene ######################
14#-Q- irene ## IRENE   TGCC/CEA ##
15#-Q- irene ######################
16#-Q- irene #MSUB -r PACKOUTPUT     # Job name
17#-Q- irene #MSUB -eo
18#-Q- irene #MSUB -n 1              # Number of cores
19#-Q- irene #MSUB -T 36000          # Maximum elapsed time
20#-Q- irene #MSUB -q skylake
21#-Q- irene #MSUB -c 4
22#-Q- irene #MSUB -Q normal
23#-Q- irene #MSUB -A ::default_project::
24#-Q- irene ###MSUB -m store,work,scratch
25#-Q- irene #MSUB -E '--licenses=fs_unshare,fs_work,fs_store,fs_scratch'
26#-Q- irene set +x
27#-Q- ada #!/bin/ksh
28#-Q- ada #######################
29#-Q- ada ## ADA         IDRIS ##
30#-Q- ada #######################
31#-Q- ada # @ job_type = mpich
32#-Q- ada # @ requirements = (Feature == "prepost")
33#-Q- ada # Temps Elapsed max. d'une requete hh:mm:ss
34#-Q- ada # @ wall_clock_limit = 10:00:00
35#-Q- ada # Memory required for ncrcat
36#-Q- ada # @ as_limit = 30Gb
37#-Q- ada # Nom du travail LoadLeveler
38#-Q- ada # @ job_name   = PACKOUTPUT
39#-Q- ada # Fichier de sortie standard du travail
40#-Q- ada # @ output     = $(job_name).$(jobid)
41#-Q- ada # Fichier de sortie d'erreur du travail
42#-Q- ada # @ error      =  $(job_name).$(jobid)
43#-Q- ada # pour recevoir un mail en cas de depassement du temps Elapsed (ou autre pb.)
44#-Q- ada # @ notification = error
45#-Q- ada # @ environment  = $DEBUG_debug ; $BigBrother ; $postProcessingStopLevel ; $MODIPSL ; $libIGCM ; $libIGCM_SX ; $POST_DIR ; $Script_Post_Output ; $SUBMIT_DIR ; $DateBegin ; $DateEnd ; $PeriodPack ; $StandAlone ; $MASTER ; wall_clock_limit=$(wall_clock_limit)
46#-Q- ada # @ queue
47#-Q- lxiv8 ######################
48#-Q- lxiv8 ## OBELIX      LSCE ##
49#-Q- lxiv8 ######################
50#-Q- lxiv8 #PBS -N PACKOUTPUT
51#-Q- lxiv8 #PBS -m a
52#-Q- lxiv8 #PBS -j oe
53#-Q- lxiv8 #PBS -q medium
54#-Q- lxiv8 #PBS -o PACKOUTPUT.$$
55#-Q- lxiv8 #PBS -S /bin/ksh
56#-Q- ifort_CICLAD ######################
57#-Q- ifort_CICLAD ##   CICLAD    IPSL ##
58#-Q- ifort_CICLAD ######################
59#-Q- ifort_CICLAD #PBS -N PACKOUTPUT
60#-Q- ifort_CICLAD #PBS -m a
61#-Q- ifort_CICLAD #PBS -j oe
62#-Q- ifort_CICLAD #PBS -q std
63#-Q- ifort_CICLAD #PBS -S /bin/ksh
64#-Q- default #!/bin/ksh
65#-Q- default ##################
66#-Q- default ## DEFAULT HOST ##
67#-Q- default ##################
68
69#**************************************************************
70# Author: Sebastien Denvil
71# Contact: Sebastien.Denvil__at__ipsl.jussieu.fr
72# $Revision::                                          $ Revision of last commit
73# $Author::                                            $ Author of last commit
74# $Date::                                              $ Date of last commit
75# IPSL (2006)
76#  This software is governed by the CeCILL licence see libIGCM/libIGCM_CeCILL.LIC
77#
78#**************************************************************
79
80#set -eu
81#set -vx
82
83date
84
85#D- Task type DO NOT CHANGE (computing, post-processing or checking)
86TaskType=post-processing
87
88########################################################################
89
90#D- Flag to determine if this job in a standalone mode
91#D- Default : value from AA_job if any
92StandAlone=${StandAlone:=true}
93
94#D- Path to libIGCM
95#D- Default : value from AA_job if any
96# WARNING For StandAlone use : To run this script on some machine (ulam and cesium)
97# WARNING you must check MirrorlibIGCM variable in sys library.
98# WARNING If this variable is true, you must use libIGCM_POST path instead
99# WARNING of your running libIGCM directory.
100libIGCM=${libIGCM:=::modipsl::/libIGCM}
101
102#-D- $hostname of the MASTER job when SUBMIT_DIR is not visible on postprocessing computer.
103MASTER=${MASTER:=ada|curie}
104
105#D- Flag to determine begin date for restart pack
106#D- Default : value from AA_job if any
107DateBegin=${DateBegin:=20000101}
108
109#D- Flag to determine end date for restart pack
110#D- Default : value from AA_job if any
111DateEnd=${DateEnd:=20691231}
112
113#D- Flag to determine pack period
114#D- Default : value from AA_job if any
115PeriodPack=${PeriodPack:=10Y}
116
117#D- Uncomment to run interactively
118#D- For testing purpose, will be remove
119#SUBMIT_DIR=${PWD}
120#RUN_DIR_PATH=${SCRATCHDIR}/Pack_Test
121
122#D- Increased verbosity (1, 2, 3)
123#D- Default : value from AA_job if any
124Verbosity=${Verbosity:=3}
125
126#D- Low level debug : to bypass lib test checks and stack construction
127#D- Default : value from AA_job if any
128DEBUG_debug=${DEBUG_debug:=false}
129
130########################################################################
131
132. ${libIGCM}/libIGCM_debug/libIGCM_debug.ksh
133. ${libIGCM}/libIGCM_card/libIGCM_card.ksh
134. ${libIGCM}/libIGCM_date/libIGCM_date.ksh
135#-------
136. ${libIGCM}/libIGCM_sys/libIGCM_sys.ksh
137. ${libIGCM}/libIGCM_config/libIGCM_config.ksh
138. ${libIGCM}/libIGCM_post/libIGCM_post.ksh
139#-------
140RUN_DIR=${RUN_DIR_PATH}
141IGCM_sys_MkdirWork ${RUN_DIR}
142IGCM_sys_Cd ${RUN_DIR}
143#-------
144( ${DEBUG_debug} ) && IGCM_debug_Check
145( ${DEBUG_debug} ) && IGCM_card_Check
146( ${DEBUG_debug} ) && IGCM_date_Check
147
148########################################################################
149
150#set -vx
151
152# ------------------------------------------------------------------
153# Test if all was right before proceeding further
154# ------------------------------------------------------------------
155IGCM_debug_Verif_Exit
156
157if [ ${StandAlone} = true ] ; then
158    CARD_DIR=${SUBMIT_DIR}
159else
160    CARD_DIR=${RUN_DIR_PATH}
161    IGCM_sys_Get_Master ${SUBMIT_DIR}/config.card ${RUN_DIR_PATH}
162    IGCM_sys_Get_Master ${SUBMIT_DIR}/run.card    ${RUN_DIR_PATH}
163    IGCM_sys_Get_Master ${SUBMIT_DIR}/COMP        ${RUN_DIR_PATH}
164    IGCM_sys_Get_Master ${SUBMIT_DIR}/POST        ${RUN_DIR_PATH}
165fi
166
167#==================================
168# First of all
169#
170# Read libIGCM compatibility version in config.card
171# Read UserChoices section
172# Read Ensemble section
173# Read Post section
174# Define all netcdf output directories
175#==================================
176IGCM_config_CommonConfiguration ${CARD_DIR}/config.card
177
178# ------------------------------------------------------------------
179# Activate BigBrother so as to supervise this job
180# ------------------------------------------------------------------
181IGCM_debug_BigBro_Initialize
182
183#==================================
184# Read ListOfComponents section
185# to drive the loop over find
186IGCM_card_DefineArrayFromSection ${CARD_DIR}/config.card ListOfComponents
187
188#==================================
189# Test and set up directories
190#==================================
191IGCM_sys_TestDirArchive ${R_SAVE}
192[ $? != 0 ] && IGCM_debug_Exit "IGCM_sys_TestDirArchive"
193
194# Where to store used file list /!\ TEMPORARY /!\
195STORE_DEBUG=${R_SAVE}/DEBUG
196
197# Switch to script variables meaning (try to be compatible with ipsl_pack TGCC moving procedure)
198JobName=${config_UserChoices_JobName}
199echo $JobName $DateBegin $DateEnd
200
201# ------------------------------------------------------------------
202# Test if all was right before proceeding further
203# ------------------------------------------------------------------
204IGCM_debug_Verif_Exit
205
206IGCM_debug_Print 1 "Check coherence between PackFrequency and PeriodLength"
207IGCM_post_CheckModuloFrequency PeriodPack config_UserChoices_PeriodLength NbPeriodPerFrequency
208# ------------------------------------------------------------------
209# Test if all was right before proceeding further
210# ------------------------------------------------------------------
211IGCM_debug_Verif_Exit
212
213IGCM_debug_Print 1 "We must process ${NbPeriodPerFrequency} files for each pack"
214
215# Init loop
216date_begin_pack=${DateBegin}
217date_end_simulation=${DateEnd}
218number_pack=1
219
220IGCM_debug_PrintVariables 3 date_begin_pack
221IGCM_debug_PrintVariables 3 date_end_simulation
222
223while [ ${date_begin_pack} -le ${date_end_simulation} ] ; do
224
225  IGCM_debug_PrintVariables 3 number_pack
226  DaysTemp=$( IGCM_date_DaysInCurrentPeriod ${date_begin_pack} ${PeriodPack} )
227  date_end_pack=$( IGCM_date_AddDaysToGregorianDate ${date_begin_pack} $(( ${DaysTemp} - 1 )) )
228
229  for comp in ${config_ListOfComponents[*]} ; do
230    dirList=$( find ${R_BUFR}/${comp}/Output -maxdepth 1 -mindepth 1 -type d )
231    for dir in ${dirList} ; do
232      # dirID is like ATM.Output.MO
233      dirID=$( echo $dir | sed "s:${R_BUFR}/::" | sed "s:/:.:g" )
234      # Sort what's in the directory
235      find ${dir} -type f -name "${JobName}*.nc" -ls | sort -k 11 > liste_files.${dirID}.txt
236      # How much file type. Example : 1M_histmthCOSP.nc, 1M_histmth.nc, 1M_histmthNMC.nc, 1M_paramLMDZ_phy.nc
237      # /!\ fileType include the .nc extension /!\
238      fileType=$( gawk '{print $11}' liste_files.${dirID}.txt | gawk -F$dir/ '{print $2}' | sed "s:${JobName}_[0-9]\{8,9\}_[0-9]\{8,9\}_::g" | sort | uniq )
239      # Loop over the file type and pack them when in between date_begin_pack and date_end_pack
240      for myType in ${fileType} ; do
241        grep ${myType} liste_files.${dirID}.txt > liste_files.${dirID}.${myType}.txt
242        nbfile=0
243        for file in $( gawk '{print $11}' liste_files.${dirID}.${myType}.txt ); do
244          extract_date_file=$( echo ${file}  | sed -e "s/.*${JobName}_[0-9]*_//" )
245          date_file=$( echo ${extract_date_file} | sed 's/\([0-9]\{8\}\)_.*$/\1/g' )
246          # echo pack n°${number_pack}  ${date_file} ${date_begin_pack} ${date_end_pack}
247          if [ ${date_file} -le ${date_end_pack} ] && [ ${date_file} -ge ${date_begin_pack} ] ; then
248            echo ${file} >> liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt
249            ncdump -h ${file} | grep -E 'float|double' | cut -f 1 -d '(' | cut -f 2 -d ' ' >> liste_variables_${myType}_${date_begin_pack}_${date_end_pack}.txt
250            (( nbfile = nbfile + 1 ))
251          fi
252        done
253
254        if [ ${nbfile} = 0 ] ; then
255          IGCM_debug_Print 1 "We found no file to process"
256          IGCM_debug_Print 1 "We should have found ${NbPeriodPerFrequency} files"
257          IGCM_debug_Print 1 "As some files can be produced only for some selected period we consider we can move to the next file type"
258          continue
259        fi
260
261        # Select list of variables to work with
262        list_var=$( cat liste_variables_${myType}_${date_begin_pack}_${date_end_pack}.txt | sort | uniq -c | awk -v nbfile=$nbfile '{if ($1 != nbfile) {print $2}}' | paste -s -d ',' )
263        liste_file_tmp=$( for i in $( cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt ) ; do basename $i ; done )
264        # Create packed files
265        IGCM_debug_Print 1 "Ncrcat ongoing for ${dir} and ${myType}"
266        if [ ! ${nbfile} = ${NbPeriodPerFrequency} ] ; then
267          IGCM_debug_Print 1 "Number of files to process is not equal to what it should be"
268          IGCM_debug_Print 1 "We found ${nbfile} files and it should have been ${NbPeriodPerFrequency} files"
269          IGCM_debug_Exit "ERROR in number of files to process. STOP HERE INCLUDING THE COMPUTING JOB"
270          IGCM_debug_Verif_Exit
271        fi
272        output=${JobName}_${date_begin_pack}_${date_end_pack}_${myType}
273        #cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt | xargs ncrcat -v ${list_var} -o ${output}
274        if [ X${list_var} = X ] ; then
275          IGCM_sys_ncrcat -p ${dir} ${liste_file_tmp} --output ${output}
276        else
277          IGCM_sys_ncrcat -x -v ${list_var} -p ${dir} ${liste_file_tmp} --output ${output}
278        fi
279        # ------------------------------------------------------------------
280        # Test if all was right before proceeding further
281        # ------------------------------------------------------------------
282        IGCM_debug_Verif_Exit
283        # Save it
284        IGCM_sys_Put_Out ${output} ${R_SAVE}/$( echo $dir | sed "s:${R_BUFR}/::" )/${output}
285        # Clean file produced by ncrcat
286        IGCM_sys_Rm ${output}
287        # ------------------------------------------------------------------
288        # Test if all was right before proceeding further
289        # ------------------------------------------------------------------
290        IGCM_debug_Verif_Exit
291        # Clean files used by ncrcat
292        cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt | xargs rm
293        # Save the list of files that has been pack (ncrcat)
294        #mv liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt ${STORE_DEBUG}
295        IGCM_debug_Print 1 "Ncrcat and cleaning done for ${dir} and ${myType}"
296        echo
297      done
298    done
299  done
300  (( number_pack = number_pack + 1 ))
301  # Add 1 day to date_end_pack to have the new date_begin_pack
302  date_begin_pack=$( IGCM_date_AddDaysToGregorianDate ${date_end_pack} 1 )
303done
304
305# Flush post-processing submission
306if [ -f ${R_BUFR}/FlushPost_${DateEnd}.ksh ] ; then
307  . ${R_BUFR}/FlushPost_${DateEnd}.ksh
308  IGCM_FlushPost
309  #IGCM_sys_Rm -f ${R_BUFR}/FlushPost_${DateEnd}.ksh
310fi
311
312# Clean RUN_DIR_PATH (necessary for cesium and titane only)
313IGCM_sys_RmRunDir -Rf ${RUN_DIR_PATH}
314
315# ------------------------------------------------------------------
316# Finalize BigBrother to inform that the jobs end
317# ------------------------------------------------------------------
318IGCM_debug_BigBro_Finalize
319
320date
Note: See TracBrowser for help on using the repository browser.