11 results for “topic:gelu”
Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!
Implemented GPT from scratch
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!
"The 'Activation Functions' project repository contains implementations of various activation functions commonly used in neural networks. "
A GPT Model To Generate Text
Here, we will provide a PyTorch regime to handle the partial differential equation solution of the heat equation by executing Deep Kolmogorov Method of Beck et. al.
Annotated vanilla implementation in PyTorch of the Transformer model introduced in 'Attention Is All You Need'.
This repository contains the code and the report for the coursework of INFR11031 Advanced Vision, a postgraduate course offered at The University of Edinburgh. The task was to train on limited and improve the accuracy of the ResNet-50 classifier on a small subset of the ImageNet dataset containing 50K training images and 50K test images. Achieved a mark of 74%
PyTorch implementation of normalization-free LLMs investigating entropic behavior to find desirable activation functions
Microphone Array-Based Direction of Arrival of Gunshot Detection .Gun violence remains a critical concern. Identifying the precise location of a gunshot—or getting as close as humanly possible—is crucial for saving lives and ensuring public safety.