Explore projects
-
-
Simple "Hello World" type program used to test the layout of resources on a Summit node using jsrun.
Updated -
Simple tester for multi-architecture domain decomposed particle transport
Updated -
This repository contains example OpenACC programs to test the OpenARC compiler.
Updated -
This project isolates an issue related to CUDA separable compilation that occurs in legacy CMake with CUDA as a TPL. The issue is fixed when using modern CMake with CUDA enabled as a language.
Updated -
Updated
-
Updated
-
ecpcitest / vtk-m
BSD 3-Clause "New" or "Revised" LicenseVTK-m mirror which runs in OLCF CI
UpdatedUpdated -
Celeritas / Celeritas
Apache License 2.0Updated -
ecpcitest / chm137 / spinifel
Lawrence Berkeley National Labs BSD variant licenseUpdated -
-
PyTorch-based large-scale ptychography for determining atom trajectories
Updated -
Strategies to distribute simplex-shaped workload across thousands of GPUs through mathematical mapping and dynamic scheduling
Updated -
Gounley, John / DeepSpeed
MIT LicenseDeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Updated -
Simple test of a HIP implementation's ability of kernels to accept an unused object reference.
Updated -
candle / Megatron-LM
Apache License 2.0Ongoing research training transformer language models at scale, including: BERT & GPT-2
Updated -
Seer is an intelligent system for extreme heterogeneous architectures
Updated -
SCALE / Code / external / celeritas
Apache License 2.0Updated -
Kumar, Atul / GITR
GNU General Public License v2.0 or laterUpdated -
Kumar, Atul / IEADs with GITR
GNU General Public License v2.0 or laterUpdated