Select Git revision
- Branches 20
- olruwase/trainable_parameters
- elasticity-v2
- master default protected
- rtd-staging
- staging-moe-zero-v3
- olruwase/callable_lr_scheduler
- reyazda/mp_inference
- staging-cl-v0
- jeffra/engine-xthru-v2
- big-science
- olruwase/global_gradient_norm
- olruwase/deepspeed_config_mpu
- grad-norm-query
- olruwase/mem_centric_tile_bug
- olruwase/align_rrg_rs_param_order
- cpu-adam/optional_CUDA-copy
- jeffra/pp-zero-gas-fix
- olruwase/docs
- jeffra/py3
- olruwase/zero3_broken_tracing
- Tags 20
- v0.5.0
- v0.4.5
- v0.4.4
- v0.4.3
- v0.4.2
- v0.4.1
- v0.4.0
- v0.3.16
- v0.3.15
- v0.3.14
- v0.3.13
- v0.3.12
- v0.3.11
- v0.3.10
- v0.3.9
- v0.3.8
- grad-norm-test
- v0.3.7
- v0.3.6
- v0.3.5
Compare
-
-
- Open in your IDE
- Download source code
Name | Last commit | Last update |
---|---|---|