Repos
16
Stars
0
Forks
2
Top Language
C++
Loading contributions...
Repositories
16A dynamic binary instrumentation tool for tracing and analyzing CUDA kernel instructions.
Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
Github mirror of trition-lang/triton repo.
Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152, ICML'24).
Development repository for the Triton language and compiler
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
No description provided.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
A new high-level IR for clang.
This repo contains the .NET Core foundational libraries, called CoreFX. It includes classes for collections, file systems, console, XML, async and many others.
This repo contains the .NET Core runtime, called CoreCLR, and the base library, called mscorlib. It includes the garbage collector, JIT compiler, base .NET data types and many low-level classes.
Fork of the LLVM Compiler Infrastructure
This repo contains LLILC, an LLVM based compiler for .NET Core. It includes a set of cross-platform .NET code generation tools that enables compilation of MSIL byte code to LLVM supported platforms.
This repo contains CoreRT, a .NET Core runtime optimized for AOT (ahead of time compilation) scenarios, with the accompanying .NET Native compiler toolchain.