Repositories
2LL
llying-001/TransformerEngine-RocmFork
No description provided.
00Updated 10 months ago
LL
llying-001/TransformerEngineFork
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
00Updated 11 months ago