Changes between Version 4 and Version 5 of 2020WP/HPC-10_mcastril_HPDAonline DiagGPU


Ignore:
Timestamp:
2020-01-21T12:29:10+01:00 (9 months ago)
Author:
mcastril
Comment:

Legend:

Unmodified
Added
Removed
Modified
  • 2020WP/HPC-10_mcastril_HPDAonline DiagGPU

    v4 v5  
    3030As first step, the portability of NEMO diagnostic calculations to GPUs has been analyzed, exploring how to adapt these regions from the current MPI implementation to the CUDA paradigm. A toy model has been created to perform preliminary tests, that were done using the dia_hsb diagnostic. The code itself is executed 50x faster than in a single CPU but the data transfer to and from GPU is the main bottleneck.  
    3131 
    32 We are working on the asynchronous strategy in order to hide all communications among GPU/CPU. We also plan to increase the efficiency of the overall solution, by mitigating the impact of the offloaded data and extending our approach to the rest of the diagnostics.  
     32The plan for the next months is to work on removing the CPU/GPU communications from the critical path, by using an asynchronous method or parallelizing them with a computation phase. At the same point the efficiency of the overall solution will be improved, by mitigating the impact of the offloaded data. 
     33 
     34The last step will be to extend this approach to the rest of the diagnostics.  
    3335 
    3436=== Documentation updates