Benchmark FSDP DLO-JZ

Benchmark GPU dense Computing -> ~ 98% GPU time

Prerequisites

Run benchmark with command:
- sbatch slurm/bench_h100_cap.slurm
- sbatch slurm/bench_h100_nocap.slurm
using pytorch-gpu/py3/2.5.0 module

(Please see requierements.txt to have module equivalence)

But imported libraries list should be:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
slurm		slurm
LICENSE		LICENSE
README.md		README.md
dlojz_chrono.py		dlojz_chrono.py
fsdp.py		fsdp.py
requirements.txt		requirements.txt