NielsRogge
ML @HuggingFace. Interested in deep learning, NLP. Contributed 40+ models to HuggingFace Transformers
Languages
Loading contributions...
Top Repositories
This repository contains demos I made with the Transformers library by HuggingFace.
This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.
A repository containing general tutorials I'd like to share with the world.
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Short README about myself.
UniLM - Unified Language Model Pre-training / Pre-training for NLP and Beyond
Repositories
261A repository with various baselines for the agentic-document-ai project.
This repository contains demos I made with the Transformers library by HuggingFace.
Code & data for TaxCalcBench
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM 3, and Qwen3-VL.
This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".
EfficientSAM3 compresses SAM3 into lightweight, edge-friendly models via progressive knowledge distillation for fast promptable concept segmentation and tracking.
No description provided.
[DEIMv2] Real Time Object Detection Meets DINOv3
Official code and models for Video Encoder-only Mask Transformer (VidEoMT).
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.
veRL: Volcano Engine Reinforcement Learning for LLM
This repository is meant for parsing evaluation results from Hugging Face models, and opening pull requests on the hub to display them at leaderboards.
A repository containing general tutorials I'd like to share with the world.
No description provided.
⏰ AI conference deadline countdowns
:alarm_clock: AI conference deadline countdowns
Short README about myself.
SGLang is a fast serving framework for large language models and vision language models.
a state-of-the-art-level open visual language model
A lightweight, powerful framework for multi-agent workflows
Reference implementation of Mistral AI 7B v0.1 model.
UniLM - Unified Language Model Pre-training / Pre-training for NLP and Beyond
CVE cache of the official CVE List in CVE JSON 5 format
Utilities to use the Hugging Face Hub API
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
No description provided.
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.