https://github.com/arcticsnow/topopyscale_examples

Set of examples on which to run TopoPyScale

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (11.6%) to scientific vocabulary

Last synced: 7 months ago · JSON representation

Repository

Set of examples on which to run TopoPyScale

Basic Info

Host: GitHub
Owner: ArcticSnow
License: mit
Language: Jupyter Notebook
Default Branch: main
Size: 89.1 MB

Statistics

Stars: 6
Watchers: 3
Forks: 1
Open Issues: 0
Releases: 1

Created over 4 years ago · Last pushed 7 months ago

Metadata Files

Readme License

TopoPyScale_examples

Set of examples on which to run TopoPyScale

The example ex1_norway_finse contains two Jupyter Notebooks available on

WARNINGS 1. You need a Python VE setup with TopoPyScale installed 2. cdsapi credentials must be ready 3. These script will download the climate data from Copernicus ERA5 repository

Contributors:

Simon Filhol
Joel Fiddes

Usage with TopoPyScale

Clone the repository
modify the project_dir in config.yml to fit your path
Run the pipeline within your relevant virtual environment python pipeline.py

Now you may wait for the climate data to download. Downscaling will follow.

Example 1: Finse in Norway

Folder ex1_norway_finse

Example 2: Retezat Mountain Range in Romania

Folder ex2_romania_retezat

Example 3: Davos in Switzerland

Folder ex3_switzerland_davos

Example 4: SLURM cluster jobs

Folder ex4_slurm_cluster

This example is not intended to be run out of the box but as a guidance for those using SLURM based clusters. It will need to be adapted to your situation. If using FSM the executable will need to be compiled on your cluster.

My use case is the the WSL hyperion cluster which uses a slurm based scheduling system. The issue is that nodes are not connected so cannot make use of dask easily (unlike the UIO server for example), There do seem to be ways but I think will need someone cleverer than me to figure that out:

https://jobqueue.dask.org/en/latest/generated/dask_jobqueue.SLURMCluster.html

My approach is to make it possible to distribute a big job (mine is most of the Tian Shan and some of the Pamir) by year to the cluster. This means a couple of things:

climate data needs to be independent of the sim directory (ie new climate path variable)
2 new py scripts that van be handled by slurm: (1) run_master.py and (2) run_worker.py

run_master.py

runs once
sets up the the sim

like this:

``` from TopoPyScale import topoclass as tc

configfile = './config.yml' mp = tc.Topoclass(configfile) wdir = mp.config.project.directory mp.computedemparam() mp.extracttopoparam() mp.toposub.writelandform() mp.computehorizon() ```

run_worker.py

runs many times in parallel using slurm scheduler
downscales a single year by accepting a year value as a single argument
generates a sub directory for each sim eg sim2000 for year 2000
example below is also ri=unning fsm sims

like this:

``` import sys import numpy as np startYearIndex= sys.argv[1] myyears = np.arange(2000,2023) startYear = myyears[int(startYearIndex)-1] print(startYear)

import pandas as pd from TopoPyScale import topoclass as tc from TopoPyScale import topoexport as te from TopoPyScale import toposim as sim

from TopoPyScale import topo_da as da

from TopoPyScale import topo_utils as ut

from datetime import datetime import os import numpy as np from matplotlib import pyplot as plt

startTime = datetime.now() configfile = './config.yml' mp = tc.Topoclass(configfile) mp.extracttopoparam() wdir = mp.config.project.directory newdir = wdir+"/sim_"+ str(startYear) +"/"

copy files

import shutil sourcedir = wdir+"/outputs" destinationdir = newdir+"/outputs" if os.path.exists(newdir): shutil.rmtree(newdir)

shutil.copytree(sourcedir, destinationdir)

src = wdir+"/FSM" dst = newdir+"/FSM" shutil.copyfile(src, dst) shutil.copymode(src, dst)

update parameters

mp.config.project.directory =newdir mp.config.project.start = mp.config.project.start.replace(year=startYear-1) mp.config.project.start = mp.config.project.start.replace(month=9) mp.config.project.start = mp.config.project.start.replace(day=1) mp.config.project.end = mp.config.project.end.replace(year=startYear) mp.config.project.end = mp.config.project.end.replace(month=8) mp.config.project.end = mp.config.project.end.replace(day=31)

if os.path.exists("outputs/dssolar.nc"): os.remove("outputs/dssolar.nc") mp.computehorizon() mp.computesolargeometry() mp.downscaleclimate()

mp.to_fsm()

os.chdir(newdir)

Simulate FSM

for i in range(mp.config.sampling.toposub.nclusters): nsim = "{:0>2}".format(i) sim.fsmnlst(31, newdir+"/outputs/FSMpt"+ nsim +".txt", 24) sim.fsmsim(newdir+"/fsmsims/nlstFSMpt_"+ nsim +".txt", "./FSM")

```

Finally slurm.sh files handle the scheduling.

slurm_master.sh

Runs run_master.py as a single job, once. The simulation is now setup (climate, dem, TopoSUB/points outputs, horizon)

sbatch slurm_master.sh

slurm_worker.sh

Runs a single job for every simulation year (solar calc and downscaling plus any simulation with FSM if desired) in parallel on the cluster. These are embarassingly parallel in that they are completely independent jobs. As we parallelise in time even file access is not overlapping (which can cause issues with netcdf climate files and multiple readers).

The following line must be edited to reflect the number of years in your downscaling job, in this case 23. ```

SBATCH --array=1-23 # this is number of years in downscaling job (23)

This numeric ID is provided then as an argument torunworker.pyand simply indexes a year vector inrunworker.py```

Then simply run: sbatch slurm_worker.sh

Adding an example site

Add a folder with a DEM and a complete config.yml file. To not overload the Github repo with data add in the .gitignore file lines to exclude the climate data and outputs such as:

```txt ex2romaniaretezat/inputs/climate/* ex2romaniaretezat/outputs/*

ex1norwayfinse/inputs/climate/* ex1norwayfinse/outputs/* ```

Be mindful of the size of the input DEM. Github limits to file above 50 Mb.

Owner

Name: Simon Filhol
Login: ArcticSnow
Kind: user
Location: Norway
Company: University of Oslo

Repositories: 33
Profile: https://github.com/ArcticSnow

GitHub Events

Total

Watch event: 1
Push event: 2

Last Year

Watch event: 1
Push event: 2

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/arcticsnow/topopyscale_examples

Science Score: 26.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

TopoPyScale_examples

Usage with TopoPyScale

Example 1: Finse in Norway

Example 2: Retezat Mountain Range in Romania

Example 3: Davos in Switzerland

Example 4: SLURM cluster jobs

run_master.py

run_worker.py

from TopoPyScale import topo_da as da

from TopoPyScale import topo_utils as ut

copy files

update parameters

Simulate FSM

slurm_master.sh

slurm_worker.sh

SBATCH --array=1-23 # this is number of years in downscaling job (23)

Adding an example site

Owner

GitHub Events

Total

Last Year