27 results for “topic:dataset-management”
A Simple Image Clustering Script using CLIP and Hierarchial Clustering
A free and opensource yolov8, yolo11 and yolo26 all in one training tool that automates file structure and yaml files, auto labeling with SAM2, brush system for uninterupted labeling, a strong modular augmentation system where anybody can write their own filters and training. Without having to open terminal.
A tool to streamline AI image captioning
code for generating a high-quality knowledge graph with metadata about datasets and links to publications
An all-in-one local GUI to visualize, analyze, merge, and rapidly improve your Computer Vision datasets.
A resource for biomedical students and researchers. Includes proteomics software tools like FragPipe, MaxQuant, PDV, SearchGUI, ThermoRawFileParser, and PeptideShaker. Offers a user-friendly interface, automated identification and quantification, comprehensive data analysis, and lightweight clone feature for optimized storage.
Pluk is a simple dataset management system which stores data in chunks and a virtual filesystem in DB. Also includes kdataset CLI tool
PixelPruner Gradio is a user-friendly image cropping & dataset management app. It supports PNG, JPG, JPEG, GIF, BMP, and TIFF formats. Easily crop, preview, and manage images with interactive previews, thumbnail views, and Zip packaging. Streamline your workflow and achieve perfect crops every time with PixelPruner.
Roboflow-lite alternative: a local-first, open-source MLOps toolkit for building and training computer-vision models.
Collection of examples and tools to start playing with computer vision and deep learning
a dataset management tool
This is the 'data.aykhan.net' repository, serving as a dedicated static data API. It offers structured endpoints for user profiles, product details, events, and more, simplifying data access for web and software projects. Explore and integrate reliable static data into your applications with ease.
🔧 Control stepper motors wirelessly with VAL3000, the quick and easy driver that requires no programming. Perfect for precise motor management.
A modular research framework engineered to benchmark CNN models across multiple sign language datasets. Featuring a scalable architecture (Factory Pattern), optimized HSV-based hand segmentation, and real-time inference capabilities for edge deployment.
Crush.js is a dataset utility library
Dataset management and caching for AI research benchmarks
HuggingFace Datasets for Elixir - A native Elixir port of the popular HuggingFace datasets library. Stream, load, and process ML datasets from the HuggingFace Hub with full BEAM/OTP integration. Supports Parquet streaming, dataset splitting, shuffling, and seamless integration with Nx tensors for machine learning workflows.
A fast PyQt‑based image annotation tool with customizable hotkeys, per‑folder labels, CSV export, and “next untagged” navigation — ideal for prepping ML training datasets.
Développer une application web interactive permettant à l’utilisateur de créer et gérer des datasets d’images (ex. « chat » ou « chien ») et de tester un modèle de prédiction simulé.
A Django-based data analytics platform providing a RESTful API for dataset management, automated data cleaning, smart statistical insights, and project tracking.
YOLO training toolkit with Claude Code skills — dataset management, experiment tracking, HP tuning via model.tune(), active learning with CVAT, ONNX export. Supports YOLO11 & YOLO26.
Simple project that extract, clean and process a dataset and import the data to a nosql database. Implementation of a simple app to work with.
Dataset Management for LLM Fine-Tuning. Import from files or live capture, organize samples, manage quality, and export for fine-tuning.
🚀 Accelerate GPU programming with cuTile Python, a powerful tool for efficient data processing on NVIDIA GPUs.
Sign language dataset recording and automatic annotation pipeline
Roboflow provides a platform for hosting, managing, and distributing computer vision datasets. This project uses the Roboflow API to automatically download emotion detection datasets in YOLOv11 format.
🚀 Learn efficient CUDA programming with cuTile through hands-on tutorials and benchmarks on key machine learning techniques.