Repository navigation
checkpointing
- Website
- Wikipedia
A GPipe implementation in PyTorch
An I/O benchmark for deep Learning applications
Extending DOLFINx with checkpointing functionality
Keras wrapper that autosaves what ModelCheckpoint cannot.
A python package for performing memory intensive computations in parallel using chunks and checkpointing.
A python package for checkpointing, saving, and loading objects.
A lightweight checkpointing program written in C.
Code and tutorial on integrating wandb sweeps with Slurm pre-emption
This FLINK project will consume streams from an azure event-hub and produce to a different event-hub ,and the config files for deploying the same in kubernetes
Hangman Game Word Predictor (Character-level attention)
A shared library to help test your code with failure-injection
This is a standalone flink producer using for testing the flink-consume-produce-ek repo contents
Robust distributed checkpointing and job management system for multi-GPU SLURM workloads
DMTCP scripts to get Python scripts working with SLURM.
A digital album face recognition manager, that isolates images of a specified person from a digital album.
Compile a torch model to a checkpointed model
🌀 data objects for Bash (attempt one).