"topic:real-time-inference" — Search

65 results for “topic:real-time-inference”

Human action classification system with pose-based (MediaPipe) and video-based (3D CNN) models. Features 100+ architectures for real-time pose classification and temporal models pretrained on UCF-101/HMDB51.

Python25566Updated 1 week ago

computer-visionconvolutional-neural-networksdeep-learninghmdb51human-action-recognitionmediapipemobilenetv3pose-classificationpose-estimationpytorchr2plus1dr3d-18real-time-inferencetimmtransfer-learningucf-101video-analysisvideo-classificationvideo-classification-modelsvideo-recognition

securade/sentinel

Securade.ai Sentinel - A monitoring and surveillance application that enables visual Q&A and video captioning for existing CCTV cameras.

Python277Updated 4 weeks ago

artificial-intelligenceautomated-surveillanceblipcctvcomputer-visiongenerative-ailive-video-analyticsmachine-learningreal-time-inferencertsp-streamsafety-monitoringsurveillancevideo-analyticsvideo-monitoringviltvisual-captioningvisual-large-language-modelsvisual-question-answeringvlmworker-safety

ReverendBayes/YOLO11m-Car-Damage-Detector

Custom YOLO11m model for detecting and classifying car body damage (99% shattered glass, 96% flat tire detection accuracy)—optimized for high-capacity inference and assistive use in inspection and service workflows like BMW pre-loaner inspections.

Jupyter Notebook273Updated 1 day ago

ai-in-automotiveanomaly-detectioncar-inspectioncar-insurancecnncomputer-visionconvolutional-neural-networksdeep-learninggooglecolabimage-segmentationinsurance-claimsobject-detectionoem-data-toolpretrained-modelpytorchreal-time-inferenceultralyticsyolo11yolo11-detection

irfan112/yowov3-multistreaming-inferencing

A real-time inferencing of multistreaming YOWOv3(Spatio Temporal Action Detection task) using (UCF101-24) dataset. The repo is extension of https://github.com/Hope1337/YOWOv3, https://arxiv.org/pdf/2408.02623

Python251Updated 3 weeks ago

action-recognitionhuman-action-recognitioni3dlive-streamingmultistreamingreal-time-inferencespatiotemporal-detectionvideo-understandingyowov3

irfan112/YOLO-TLP

YOLO-TLP: detected and classified tiny objects with bounding box dimensions smaller than 15 pixels, outperforming other one-stage detectors. maximum resolution for target observation in real-time applications.

Python211Updated 1 week ago

lightweight-cnnreal-time-inferencesmall-object-detectionsurveillancetiny-object-detectionvisdroneyoloyolo-tlp

aimaster-dev/Blood-Brain-Detect-System

This project uses YOLO for real-time leukemia detection in blood samples and CNNs for classifying brain hemorrhages in MRI scans. It aims to support faster, more accurate medical diagnostics through deep learning.

Python120Updated 6 months ago

blood-analysisclassificationcnnconvolutional-neural-networksdeep-learningdiagnosticsearly-detectionhealthcarehealthcare-automationhemorrhage-detectionleukemia-researchmedical-aimedical-imagingmriobject-detectionpythonreal-time-inferenceyolo

RizwanMunawar/visionusecases

Explore a wide range of computer vision projects and documentation covering everything from object detection, image segmentation, and tracking to pose estimation, object counting, and automated annotation. These resources highlight real-world AI applications built with modern models like Ultralytics YOLO, Meta SAM 2, and other vlms

122Updated 1 month ago

artificial-intelligencecomputer-visiondeep-learninginstance-segmentationobject-detectionobject-trackingpose-estimationreal-time-inferenceultralytics

tinh2044/YOLO12-UnderWater

YOLOv12 Underwater Object Detection is an open-source suite for underwater object detection, built on YOLOv12. It offers an end-to-end pipeline with GPU-accelerated training, customizable data augmentations, real-time inference via Gradio, and support for model export (ONNX & PyTorch).

Python110Updated 1 month ago

attention-mechanismsbrackish-watercomputer-visiondata-augmentationdeep-learningenvironmental-adaptationevaluation-metricsgpu-accelerationgradioimage-processingmixed-precision-trainingmodel-exportobject-detectiononnxpytorchreal-time-inferencetraining-pipelineunderwater-imagingunderwater-object-detectionyolov12

dyneth02/MoveNet-Multipose-Detection-OpenCV

A real-time multi-person human pose estimation system using TensorFlow MoveNet Multipose (Lightning). Built with OpenCV for video and webcam inference, it detects and visualizes keypoints and skeletal connections with confidence-based filtering, optimized for speed and multi-person scenarios.

Jupyter Notebook80Updated 2 weeks ago

ai-engineeringcenternetcomputer-visiondeep-learningfeature-pyramid-networkgpu-accelerationhuman-posekeypoint-detectionml-projectmobilenetv2movenetmultiperson-trackingopencvpose-estimationpythonreal-time-inferencetensorflowtensorflow-hubvideo-processingwebcam-inference

philtimmes/KeSSie

KeSSie HUGE Context Semantic recall for Large Language Models

Python60Updated 2 weeks ago

context-windowcudaenterprise-aigpu-optimizationhigh-throughputinference-optimizationkv-cachelarge-language-modelslinear-serializationllm-inferencelong-term-memorylossless-compressionmemory-efficiencyreal-time-inferencerocmstate-inferencestate-managementtransformer-architecturevllmvram-optimization

tomash-dev/Blood_Brain

Python61Updated 7 months ago

anhphan2705/im_pidnet

Improved PIDNet for real-time semantic segmentation. Work in progress.

Python61Updated 9 months ago

computer-visioninferencemmsegmmsegmentationpidnetreal-timereal-time-inferencereal-time-semantic-segmentationsemantic-segmentation

senthree3/THYX

THYX is an edge AI video analysis platform for AIoT, featuring behavior recognition, intelligent alerts, and real-time management. Modular, high-performance, and lightweight — ideal for smart security and industrial scenarios.

HTML60Updated 3 months ago

aiboxaiotbehavior-recognitioncomputer-visionedge-aiobject-detectionreal-time-inferencesmart-securityvideo-analysis

GBR-RL/VisionSort-RPi

A conveyor belt sorting system powered by Raspberry Pi and YOLOv8 for real-time object detection.

Python52Updated 1 month ago

automationcomputer-visiondiy-projecthailohailo-aiobject-detectionraspberry-pireal-time-inferenceyolov8

asigatchov/vball-net

Volleyball tracking - VballNet is a specialized deep learning framework designed for volleyball tracking, built upon the foundation of TrackNetV4. This repository includes two primary models, VballNetV1 and VballNetFastV1

Python51Updated 1 day ago

ballmotion-detectionreal-time-inferencesports-analyticstrackingtrackiningtracknetvolleyball-tracking

umitkacar/onnx-tensorrt-optimization

40x faster AI inference: ONNX to TensorRT optimization with FP16/INT8 quantization, multi-GPU support, and deployment

Python40Updated 1 month ago

cudadeep-learningedge-computingfp16gpu-accelerationinference-accelerationint8latency-optimizationmlopsmodel-deploymentmodel-optimizationnvidia-gpuonnxonnxruntimeproduction-aipytorch-to-onnxquantizationreal-time-inferencetensorflow-to-onnxtensorrt

OleGustavDahlJohnsen/project-agora

Project Agora: MVP of the Concordia framework. An ethical, symbiotic AI designed to foster and protect human flourishing.

Python40Updated 3 months ago

ai-ethicsai-governanceapple-intelligencecasual-traceabilitycontext-aware-aidistributed-aiedge-aiethical-artificial-intelligencehuman-ai-symbiosismultimodal-perceptionon-device-mlorchid-protocolreal-time-inferencesanctum-protocolsecure-aisensor-fusionsensormeshshofar2symbiotic-aitrust-horizon

Nishant1998/PlantAi

PlantAi is a ResNet-based CNN model trained on the PlantVillage dataset to classify plant leaf images as healthy or diseased. This repository includes PyTorch training code, tools to convert the model to TensorFlow Lite (TFLite) for deployment, and an Android app integrating the model for real-time leaf disease detection from camera images.

Java42Updated 1 month ago

agriculture-aiandroidcnncomputer-visioncpu-inferencedeep-learningdeep-neural-networksimage-classificationjavaonnxpytochreal-time-inferenceresnettflight

GrahamPellegrini/Machine-Learning-Noise-Cancellation

Bachelor Final Year Project exploring real-time speech denoising using machine learning. Compares classical methods (SS, WF, MMSE-LSA) with 5 deep models on spectrogram data, highlighting Conv-TasNet’s effectiveness. Features dataset bucketing, OOM mitigation, and batch evaluation.

TeX21Updated 1 month ago

classical-vs-deep-learningconv-tasnetdenoisingmachine-learningoom-optimizationreal-time-inferencespectrogramspectrogramsspeech-enhancement

beingdhruvv/ImageSharpening-KD-Restormer-UNet

This repository features an image sharpening pipeline using Knowledge Distillation. A high-capacity Restormer acts as the teacher model, while a lightweight Mini-UNet is trained as the student to mimic its performance.

Jupyter Notebook22Updated 4 months ago

computer-visiondeep-learningdiv2kgpu-trainingimage-processingimage-restorationimage-sharpeningknowledge-distillationmodel-compressionmotion-deblurringperceptual-losspython3real-time-inferencerestormerssimtraining-pipelineunet-pytorchvgg-loss

KarthikSriramGit/Vision-Karts

AI-powered automated checkout system using YOLO11 object detection and face recognition to eliminate queues and reduce checkout times at retail facilities

Python20Updated 1 month ago

cameracomputer-visiondeep-learningedge-aiface-recognitiongpu-accelerationimage-processingobject-detectiononnxopencvpythonpytorchqr-authenticationreal-time-inferencesmart-retailtensorrtultralyticsyoloyolo11

AdnanSattar/Spatial-RAG-Worldmodel

A Spatial Retrieval-Augmented Generation system for latent world models, designed for embodied spatial intelligence in robotics, autonomous navigation, and embodied AI. Features ROS2 integration, real-time inference @ 25Hz, and complete robot build guide.

Python21Updated 1 day ago

autonomous-robotscomputer-visiondockerembodied-aiencoder-decoderfastapilatent-spacenextjspytorchraspberry-pireal-time-inferenceros2-humblespatial-computingspatial-memoryworld-models

Muhd-Uwais/EchoID

Deep voice speaker recognition system built with Keras CNNs. Educational project featuring audio augmentation, mel-spectrogram processing, and real-time inference. Binary classification for beginners.

Jupyter Notebook11Updated 1 month ago

audio-classificationaudio-processingcnndeep-learningeducationalkerasmachine-learningmel-spectrogrampythonreal-time-inferencespeaker-recognition

raghavrajsah/tethyr

Autonomous AI vision agent with contextual memory for real-time spatial computing | Multi-modal LLM reasoning + YOLO grounding + persistent AR overlays | Next-gen hands-free AI interface

Python10Updated 3 months ago

ai-agentscomputer-visionllmmultimodal-aireal-time-inferencespatial-computing

AksharKher-30/Real-time-face-mask-detection

Real-time Face Mask Detection using YOLOv8 — trained on custom dataset with live webcam inference.

Jupyter Notebook10Updated 8 months ago

computer-visiondeep-learningface-mask-detectionmpsobject-detectionopencvreal-time-inferencetransfer-learningultralyticsyolov8

Graiphic/Nest

Sovereign AI for smart, self-learning buildings • Edge-native • 25–35 % energy savings

10Updated 3 months ago

adaptive-systemsautonomous-controlbuilding-automationdigital-twinedge-computingembedded-aienergy-efficiencyenergy-optimizationgenerative-aigreen-techhvac-controloccupancy-modelingonnx-runtimepredictive-modelingreal-time-inferencereinforcement-learningrl-controlsmart-buildingsustainabilitythermal-dynamics

markolivaic/SiteSafety-YOLO

Real time HSE compliance monitoring system using YOLOv8. Designed for Edge AI deployment to automate industrial safety protocols and personnel detection.

Jupyter Notebook10Updated 2 months ago

computer-visionedge-aipythonreal-time-inferenceyolov8

muthusamir/YOLOE-Unified

YOLOE-Unified is a novel framework that integrates YOLOE with distilled CLIP, runtime SAM refinement, and TensorRT optimization for efficient open-vocabulary object detection and instance segmentation on edge devices (Jetson Orin, etc.).

Python10Updated 1 month ago

computer-visionedge-deploymentinstance-segmentationmultimodal-fusionopen-vocabulary-detectionreal-time-inferencetensorrt-optimizationzero-shot-learning

codaley/objectdet-tflitemm

Custom object detection project using TensorFlow Lite Model Maker with an EfficientDet-Lite2 backbone. Trained to detect rocks and bags, deployed to Android for real-time inference on a Pixel 7a. Focused on efficient edge-device performance and streamlined model integration.

Jupyter Notebook10Updated 3 weeks ago

android-appcomputer-visiondataset-preparationedge-aiefficientdet-lite2machine-learningobject-detectionreal-time-inferencetensorflow-lite

SD7Campeon/Comment-Toxicity-Detection-and-Classification

LLM-inspired BiLSTM pipeline for real-time, multi-label toxicity inference across adversarial discourse modalities.

Jupyter Notebook10Updated 10 months ago

affective-computingbilstmcontextual-nlpdeep-sequential-modeldiscourse-analysiskeras-tensorflowllmmulti-label-classificationnlpnlp-pipelinereal-time-inferencesklearnsubword-tokenizationtext-vectorizationtoxicity-analysistoxicity-classificationtoxicity-detectiontoxicity-predictiontransformer

Page 1 of 3