162 results for “topic:huggingface-models”
⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.
A gopeed-extension for downloading models and datasets from huggingface, hf-mirror and modelscope. Huggingface download
Object storage for the AI age
This Next.js application generates videos based on client-provided queries. It is designed as a SaaS platform, allowing users to easily create engaging video content for various purposes such as marketing, education, or social media. The app leverages cutting-edge technologies to provide a smooth user experience and high-quality video output
[CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
Mount remote repositories, models and datasets managed by Git LFS instantly.
huggingface-go : 高速下载 huggingface 的模型和数据集
Upload & Merge CSV or JSON Data with Images to Notion Database
Easy text classification for everyone : Bert based models via Huggingface transformers (KR / EN)
ScaleDP is an Open-Source extension of Apache Spark for Document Processing
ORLA is a web application that transforms text prompts into detailed 3D models using advanced AI technologies. With an intuitive interface and powerful backend, ORLA enables users to generate high-quality 3D assets quickly and easily.
Multimodal-OCR is an experimental, high-performance visual reasoning and optical character recognition suite designed to accurately extract text, analyze visual content, and parse complex document structures. Built upon a diverse ecosystem of cutting-edge vision-language models.
A 5-way embedding model for text, audio, image, video, and 3D point clouds.
Multimodal Document Processing RAG with LangChain
Neuromorphic Bird Classifier Desktop App (NeuroBCDA) bundled with Live Event Camera Simulator
Huggingface Backup - Jupyter, Colab and Python Script
Simple Generative AI enabled Streamlit web application that converts speech to-image.
A machine learning platform for market analysis and forecasting, specializing in stock volatility prediction, market trend forecasting, and portfolio optimization.
Multimodal-OCR3 is a highly capable, experimental optical character recognition and visual processing suite designed for precise text extraction, document parsing, and markdown generation. Leveraging a powerful selection of vision-language.
This repository contains code for generating blog content using the LLama 2 language model. It integrates with Streamlit for easy user interaction. Simply input your blog topic, desired word count, and writing style to generate engaging blog content.
一个从 Hugging Face 镜像站点快速下载模型和数据集的命令行工具。
This repository includes notebooks starting from data tokenization and fine-tuning of BERT models including ModernBERT, till upload models to the hub for Electrical Engineering NER task.
🤗 A Python script for efficiently downloading and reconstructing large Hugging Face model files by splitting them into manageable chunks
A library and Hugging Face model downloader for Ollama.
QIE-Bbox-Studio (Qwen Image Edit Bounding Box Studio) is an advanced AI-powered image editing interface built on top of the Qwen2.5-VL and Qwen-Image-Edit models. This application allows users to manipulate images with extreme precision by defining bounding boxes and providing natural language prompts.
🤖 Dermify.AI is an ⚡ AI powered Web Application which harnesses the power of image processing to offer cost-effective and accessible skin condition assessments worldwide
Demonstration for NVIDIA's Nemotron-Parse-v1.1 model, designed for advanced document parsing and OCR. Upload images of documents (e.g., papers, forms) to extract structured content: text, tables (LaTeX), figures, and titles. Outputs annotated images with colored bounding boxes and processed markdown/LaTeX text for easy integration.
VLM-Parsing is a Gradio-based web application for parsing documents and images into structured HTML and Markdown formats using advanced Vision Language Models (VLMs).
A URL summarizer, which summarizes the content of a URL with proper formatting. It uses 'sshleifer/distilbart-cnn-12-6', which is a distilled version of the BART model, specifically optimized for text summarization tasks, including CNN summarization.
A fine-tuned version of SmolLM2-360M-Instruct-bnb-4bit specialized for parsing unstructured calendar event requests into structured JSON data.