source: trunk/libIGCM/AA_pack_output @ 1389

Last change on this file since 1389 was 1356, checked in by sdipsl, 8 years ago
  • Property licence set to
    The following licence information concerns ONLY the libIGCM tools
    ==================================================================

    Copyright © Centre National de la Recherche Scientifique CNRS
    Commissariat à l'Énergie Atomique CEA

    libIGCM : Library for Portable Models Computation of IGCM Group.

    IGCM Group is the french IPSL Global Climate Model Group.

    This library is a set of shell scripts and functions whose purpose is
    the management of the initialization, the launch, the transfer of
    output files, the post-processing and the monitoring of datas produce
    by any numerical program on any plateforme.

    This software is governed by the CeCILL license under French law and
    abiding by the rules of distribution of free software. You can use,
    modify and/ or redistribute the software under the terms of the CeCILL
    license as circulated by CEA, CNRS and INRIA at the following URL
    "http://www.cecill.info".

    As a counterpart to the access to the source code and rights to copy,
    modify and redistribute granted by the license, users are provided only
    with a limited warranty and the software's author, the holder of the
    economic rights, and the successive licensors have only limited
    liability.

    In this respect, the user's attention is drawn to the risks associated
    with loading, using, modifying and/or developing or reproducing the
    software by the user in light of its specific status of free software,
    that may mean that it is complicated to manipulate, and that also
    therefore means that it is reserved for developers and experienced
    professionals having in-depth computer knowledge. Users are therefore
    encouraged to load and test the software's suitability as regards their
    requirements in conditions enabling the security of their systems and/or
    data to be ensured and, more generally, to use and operate it in the
    same conditions as regards security.

    The fact that you are presently reading this means that you have had
    knowledge of the CeCILL license and that you accept its terms.
  • Property svn:keywords set to Revision Author Date
File size: 12.5 KB
Line 
1#-Q- curie ######################
2#-Q- curie ## CURIE   TGCC/CEA ##
3#-Q- curie ######################
4#-Q- curie #MSUB -r PACKOUTPUT     # Nom du job
5#-Q- curie #MSUB -eo
6#-Q- curie #MSUB -n 1              # Reservation du processus
7#-Q- curie #MSUB -T 36000          # Limite de temps elapsed du job
8#-Q- curie #MSUB -q ::default_node::
9#-Q- curie #MSUB -c ::default_core::
10#-Q- curie #MSUB -Q normal
11#-Q- curie #MSUB -A ::default_project::
12#-Q- curie set +x
13#-Q- ada #!/bin/ksh
14#-Q- ada #######################
15#-Q- ada ## ADA         IDRIS ##
16#-Q- ada #######################
17#-Q- ada # @ job_type = serial
18#-Q- ada # @ requirements = (Feature == "prepost")
19#-Q- ada # Temps Elapsed max. d'une requete hh:mm:ss
20#-Q- ada # @ wall_clock_limit = 10:00:00
21#-Q- ada # Memory required for ncrcat
22#-Q- ada # @ as_limit = 30Gb
23#-Q- ada # Nom du travail LoadLeveler
24#-Q- ada # @ job_name   = PACKOUTPUT
25#-Q- ada # Fichier de sortie standard du travail
26#-Q- ada # @ output     = $(job_name).$(jobid)
27#-Q- ada # Fichier de sortie d'erreur du travail
28#-Q- ada # @ error      =  $(job_name).$(jobid)
29#-Q- ada # pour recevoir un mail en cas de depassement du temps Elapsed (ou autre pb.)
30#-Q- ada # @ notification = error
31#-Q- ada # @ environment  = $DEBUG_debug ; $BigBrother ; $postProcessingStopLevel ; $MODIPSL ; $libIGCM ; $libIGCM_SX ; $POST_DIR ; $Script_Post_Output ; $SUBMIT_DIR ; $DateBegin ; $DateEnd ; $PeriodPack ; $StandAlone ; $MASTER ; wall_clock_limit=$(wall_clock_limit)
32#-Q- ada # @ queue
33#-Q- lxiv8 ######################
34#-Q- lxiv8 ## OBELIX      LSCE ##
35#-Q- lxiv8 ######################
36#-Q- lxiv8 #PBS -N PACKOUTPUT
37#-Q- lxiv8 #PBS -m a
38#-Q- lxiv8 #PBS -j oe
39#-Q- lxiv8 #PBS -q medium
40#-Q- lxiv8 #PBS -o PACKOUTPUT.$$
41#-Q- lxiv8 #PBS -S /bin/ksh
42#-Q- ifort_CICLAD ######################
43#-Q- ifort_CICLAD ##   CICLAD    IPSL ##
44#-Q- ifort_CICLAD ######################
45#-Q- ifort_CICLAD #PBS -N PACKOUTPUT
46#-Q- ifort_CICLAD #PBS -m a
47#-Q- ifort_CICLAD #PBS -j oe
48#-Q- ifort_CICLAD #PBS -q std
49#-Q- ifort_CICLAD #PBS -S /bin/ksh
50#-Q- default #!/bin/ksh
51#-Q- default ##################
52#-Q- default ## DEFAULT HOST ##
53#-Q- default ##################
54
55#**************************************************************
56# Author: Sebastien Denvil
57# Contact: Sebastien.Denvil__at__ipsl.jussieu.fr
58# $Revision::                                          $ Revision of last commit
59# $Author::                                            $ Author of last commit
60# $Date::                                              $ Date of last commit
61# IPSL (2006)
62#  This software is governed by the CeCILL licence see libIGCM/libIGCM_CeCILL.LIC
63#
64#**************************************************************
65
66#set -eu
67#set -vx
68
69date
70
71#D- Task type DO NOT CHANGE (computing, post-processing or checking)
72TaskType=post-processing
73
74########################################################################
75
76#D- Flag to determine if this job in a standalone mode
77#D- Default : value from AA_job if any
78StandAlone=${StandAlone:=true}
79
80#D- Path to libIGCM
81#D- Default : value from AA_job if any
82# WARNING For StandAlone use : To run this script on some machine (ulam and cesium)
83# WARNING you must check MirrorlibIGCM variable in sys library.
84# WARNING If this variable is true, you must use libIGCM_POST path instead
85# WARNING of your running libIGCM directory.
86libIGCM=${libIGCM:=::modipsl::/libIGCM}
87
88#-D- $hostname of the MASTER job when SUBMIT_DIR is not visible on postprocessing computer.
89MASTER=${MASTER:=ada|curie}
90
91#D- Flag to determine begin date for restart pack
92#D- Default : value from AA_job if any
93DateBegin=${DateBegin:=20000101}
94
95#D- Flag to determine end date for restart pack
96#D- Default : value from AA_job if any
97DateEnd=${DateEnd:=20691231}
98
99#D- Flag to determine pack period
100#D- Default : value from AA_job if any
101PeriodPack=${PeriodPack:=10Y}
102
103#D- Uncomment to run interactively
104#D- For testing purpose, will be remove
105#SUBMIT_DIR=${PWD}
106#RUN_DIR_PATH=${SCRATCHDIR}/Pack_Test
107
108#D- Increased verbosity (1, 2, 3)
109#D- Default : value from AA_job if any
110Verbosity=${Verbosity:=3}
111
112#D- Low level debug : to bypass lib test checks and stack construction
113#D- Default : value from AA_job if any
114DEBUG_debug=${DEBUG_debug:=false}
115
116########################################################################
117
118. ${libIGCM}/libIGCM_debug/libIGCM_debug.ksh
119. ${libIGCM}/libIGCM_card/libIGCM_card.ksh
120. ${libIGCM}/libIGCM_date/libIGCM_date.ksh
121#-------
122. ${libIGCM}/libIGCM_sys/libIGCM_sys.ksh
123. ${libIGCM}/libIGCM_config/libIGCM_config.ksh
124. ${libIGCM}/libIGCM_post/libIGCM_post.ksh
125#-------
126RUN_DIR=${RUN_DIR_PATH}
127IGCM_sys_MkdirWork ${RUN_DIR}
128IGCM_sys_Cd ${RUN_DIR}
129#-------
130( ${DEBUG_debug} ) && IGCM_debug_Check
131( ${DEBUG_debug} ) && IGCM_card_Check
132( ${DEBUG_debug} ) && IGCM_date_Check
133
134########################################################################
135
136#set -vx
137
138# ------------------------------------------------------------------
139# Test if all was right before proceeding further
140# ------------------------------------------------------------------
141IGCM_debug_Verif_Exit
142
143if [ ${StandAlone} = true ] ; then
144    CARD_DIR=${SUBMIT_DIR}
145else
146    CARD_DIR=${RUN_DIR_PATH}
147    IGCM_sys_Get_Master ${SUBMIT_DIR}/config.card ${RUN_DIR_PATH}
148    IGCM_sys_Get_Master ${SUBMIT_DIR}/run.card    ${RUN_DIR_PATH}
149    IGCM_sys_Get_Master ${SUBMIT_DIR}/COMP        ${RUN_DIR_PATH}
150    IGCM_sys_Get_Master ${SUBMIT_DIR}/POST        ${RUN_DIR_PATH}
151fi
152
153#==================================
154# First of all
155#
156# Read libIGCM compatibility version in config.card
157# Read UserChoices section
158# Read Ensemble section
159# Read Post section
160# Define all netcdf output directories
161#==================================
162IGCM_config_CommonConfiguration ${CARD_DIR}/config.card
163
164# ------------------------------------------------------------------
165# Activate BigBrother so as to supervise this job
166# ------------------------------------------------------------------
167IGCM_debug_BigBro_Initialize
168
169#==================================
170# Read ListOfComponents section
171# to drive the loop over find
172IGCM_card_DefineArrayFromSection ${CARD_DIR}/config.card ListOfComponents
173
174#==================================
175# Test and set up directories
176#==================================
177IGCM_sys_TestDirArchive ${R_SAVE}
178[ $? != 0 ] && IGCM_debug_Exit "IGCM_sys_TestDirArchive"
179
180# Where to store used file list /!\ TEMPORARY /!\
181STORE_DEBUG=${R_SAVE}/DEBUG
182
183# Switch to script variables meaning (try to be compatible with ipsl_pack TGCC moving procedure)
184JobName=${config_UserChoices_JobName}
185echo $JobName $DateBegin $DateEnd
186
187# ------------------------------------------------------------------
188# Test if all was right before proceeding further
189# ------------------------------------------------------------------
190IGCM_debug_Verif_Exit
191
192IGCM_debug_Print 1 "Check coherence between PackFrequency and PeriodLength"
193IGCM_post_CheckModuloFrequency PeriodPack config_UserChoices_PeriodLength NbPeriodPerFrequency
194# ------------------------------------------------------------------
195# Test if all was right before proceeding further
196# ------------------------------------------------------------------
197IGCM_debug_Verif_Exit
198
199IGCM_debug_Print 1 "We must process ${NbPeriodPerFrequency} files for each pack"
200
201# Init loop
202date_begin_pack=${DateBegin}
203date_end_simulation=${DateEnd}
204number_pack=1
205
206IGCM_debug_PrintVariables 3 date_begin_pack
207IGCM_debug_PrintVariables 3 date_end_simulation
208
209while [ ${date_begin_pack} -le ${date_end_simulation} ] ; do
210
211  IGCM_debug_PrintVariables 3 number_pack
212  DaysTemp=$( IGCM_date_DaysInCurrentPeriod ${date_begin_pack} ${PeriodPack} )
213  date_end_pack=$( IGCM_date_AddDaysToGregorianDate ${date_begin_pack} $(( ${DaysTemp} - 1 )) )
214
215  for comp in ${config_ListOfComponents[*]} ; do
216    dirList=$( find ${R_BUFR}/${comp}/Output -maxdepth 1 -mindepth 1 -type d )
217    for dir in ${dirList} ; do
218      # dirID is like ATM.Output.MO
219      dirID=$( echo $dir | sed "s:${R_BUFR}/::" | sed "s:/:.:g" )
220      # Sort what's in the directory
221      find ${dir} -type f -name "${JobName}*.nc" -ls | sort -k 11 > liste_files.${dirID}.txt
222      # How much file type. Example : 1M_histmthCOSP.nc, 1M_histmth.nc, 1M_histmthNMC.nc, 1M_paramLMDZ_phy.nc
223      # /!\ fileType include the .nc extension /!\
224      fileType=$( gawk '{print $11}' liste_files.${dirID}.txt | gawk -F$dir/ '{print $2}' | sed "s:${JobName}_[0-9]\{8,9\}_[0-9]\{8,9\}_::g" | sort | uniq )
225      # Loop over the file type and pack them when in between date_begin_pack and date_end_pack
226      for myType in ${fileType} ; do
227        grep ${myType} liste_files.${dirID}.txt > liste_files.${dirID}.${myType}.txt
228        nbfile=0
229        for file in $( gawk '{print $11}' liste_files.${dirID}.${myType}.txt ); do
230          extract_date_file=$( echo ${file}  | sed -e "s/.*${JobName}_[0-9]*_//" )
231          date_file=$( echo ${extract_date_file} | sed 's/\([0-9]\{8\}\)_.*$/\1/g' )
232          # echo pack n°${number_pack}  ${date_file} ${date_begin_pack} ${date_end_pack}
233          if [ ${date_file} -le ${date_end_pack} ] && [ ${date_file} -ge ${date_begin_pack} ] ; then
234            echo ${file} >> liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt
235            ncdump -h ${file} | grep -E 'float|double' | cut -f 1 -d '(' | cut -f 2 -d ' ' >> liste_variables_${myType}_${date_begin_pack}_${date_end_pack}.txt
236            (( nbfile = nbfile + 1 ))
237          fi
238        done
239
240        if [ ${nbfile} = 0 ] ; then
241          IGCM_debug_Print 1 "We found no file to process"
242          IGCM_debug_Print 1 "We should have found ${NbPeriodPerFrequency} files"
243          IGCM_debug_Print 1 "As some files can be produced only for some selected period we consider we can move to the next file type"
244          continue
245        fi
246
247        # Select list of variables to work with
248        list_var=$( cat liste_variables_${myType}_${date_begin_pack}_${date_end_pack}.txt | sort | uniq -c | awk -v nbfile=$nbfile '{if ($1 != nbfile) {print $2}}' | paste -s -d ',' )
249        liste_file_tmp=$( for i in $( cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt ) ; do basename $i ; done )
250        # Create packed files
251        IGCM_debug_Print 1 "Ncrcat ongoing for ${dir} and ${myType}"
252        if [ ! ${nbfile} = ${NbPeriodPerFrequency} ] ; then
253          IGCM_debug_Print 1 "Number of files to process is not equal to what it should be"
254          IGCM_debug_Print 1 "We found ${nbfile} files and it should have been ${NbPeriodPerFrequency} files"
255          IGCM_debug_Exit "ERROR in number of files to process. STOP HERE INCLUDING THE COMPUTING JOB"
256          IGCM_debug_Verif_Exit
257        fi
258        output=${JobName}_${date_begin_pack}_${date_end_pack}_${myType}
259        #cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt | xargs ncrcat -v ${list_var} -o ${output}
260        if [ X${list_var} = X ] ; then
261          IGCM_sys_ncrcat -p ${dir} ${liste_file_tmp} --output ${output}
262        else
263          IGCM_sys_ncrcat -x -v ${list_var} -p ${dir} ${liste_file_tmp} --output ${output}
264        fi
265        # ------------------------------------------------------------------
266        # Test if all was right before proceeding further
267        # ------------------------------------------------------------------
268        IGCM_debug_Verif_Exit
269        # Save it
270        IGCM_sys_Put_Out ${output} ${R_SAVE}/$( echo $dir | sed "s:${R_BUFR}/::" )/${output}
271        # Clean file produced by ncrcat
272        IGCM_sys_Rm ${output}
273        # ------------------------------------------------------------------
274        # Test if all was right before proceeding further
275        # ------------------------------------------------------------------
276        IGCM_debug_Verif_Exit
277        # Clean files used by ncrcat
278        cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt | xargs rm
279        # Save the list of files that has been pack (ncrcat)
280        #mv liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt ${STORE_DEBUG}
281        IGCM_debug_Print 1 "Ncrcat and cleaning done for ${dir} and ${myType}"
282        echo
283      done
284    done
285  done
286  (( number_pack = number_pack + 1 ))
287  # Add 1 day to date_end_pack to have the new date_begin_pack
288  date_begin_pack=$( IGCM_date_AddDaysToGregorianDate ${date_end_pack} 1 )
289done
290
291# Flush post-processing submission
292if [ -f ${R_BUFR}/FlushPost_${DateEnd}.ksh ] ; then
293  . ${R_BUFR}/FlushPost_${DateEnd}.ksh
294  IGCM_FlushPost
295  #IGCM_sys_Rm -f ${R_BUFR}/FlushPost_${DateEnd}.ksh
296fi
297
298# Clean RUN_DIR_PATH (necessary for cesium and titane only)
299IGCM_sys_RmRunDir -Rf ${RUN_DIR_PATH}
300
301# ------------------------------------------------------------------
302# Finalize BigBrother to inform that the jobs end
303# ------------------------------------------------------------------
304IGCM_debug_BigBro_Finalize
305
306date
Note: See TracBrowser for help on using the repository browser.