"topic:batch-inference" — Search

22 results for “topic:batch-inference”

Support Yolov5(4.0)/Yolov5(5.0)/YoloR/YoloX/Yolov4/Yolov3/CenterNet/CenterFace/RetinaFace/Classify/Unet. use darknet/libtorch/pytorch/mxnet to onnx to tensorrt

C++21041Updated 5 months ago

batch-inferencecenterfacecenternetclassifydarknetlibtorchmxnetonnx-tensorrtpytorchretinafaceunetyoloryolov4yolov5yolox

louisoutin/yolov5_torchserve

Torchserve server using a YoloV5 model running on docker with GPU and static batch inference to perform production ready and real time inference.

Python10120Updated 2 months ago

batch-inferencedeep-learningdockerobject-detectionpytorchservicetorchserve

0-mostafa-rezaee-0/Batch_LLM_Inference_with_Ray_Data_LLM

Batch LLM Inference with Ray Data LLM: From Simple to Advanced

Jupyter Notebook124Updated 1 week ago

batch-inferencedistributed-computinglarge-language-modelsllmllm-apinlpparallel-processingrayray-dataray-servevllm

sutro-sh/sutro

Analyze and generate unstructured data using LLMs, from quick experiments to billion token jobs.

Python111Updated 4 days ago

batch-inferencecsvdata-engineeringdata-pipelinesdata-processingdistributed-inferenceevalsllm-inferencemlopsobservabilitypandasparquetpolarss3synthetic-dataunstructured-data

tungngreen/PipelineScheduler

PipelineScheduler optimizes workload distribution between servers and edge devices, setting optimal batch sizes to maximize throughput and minimize latency amid content dynamics and network instability. It also addresses resource contention with spatiotemporal inference scheduling to reduce co-location interference.

C++102Updated 4 weeks ago

batch-inferencednn-servinggpu-schedulingmodel-serving

milenkovicm/torchfusionArchived

Torchfusion is a very opinionated torch inference on datafusion.

Rust50Updated 5 days ago

batch-inferencedatafusioninferencemachine-learningpytorchrustsqltorchuserdefined-functions

ray-project/ray-saturday-dec-2022Archived

Ray Saturday Dec 2022 edition

Jupyter Notebook52Updated 1 year ago

batch-inferencecomputer-visiondistributed-machine-learningray-airray-coreray-distributedsemantic-segmentation

SABER-labs/torch_batcher

Serve pytorch inference requests using batching with redis for faster performance.

Python50Updated 12 months ago

batch-inferencegpuinference-serverpytorchredis

yuwenmichael/Grounding-DINO-Batch-Inference

Support batch inference of Grounding DINO. "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Jupyter Notebook40Updated 1 year ago

batch-inferencegroundingdinoobject-detectionpytorch

milenkovicm/lightfusionArchived

LightGBM Inference on Datafusion

Rust20Updated 5 days ago

batch-inferencedatafusioninferencelightgbmmachine-learningrustsqludfuserdefined-functions

mili-tan/Onllama.OllamaBatch

简单的 Ollama JSONL 批量推理工具 / Simple Ollama JSONL batch inference tool.

C#22Updated 2 weeks ago

batchbatch-inferencebatch-processingollamaollama-apiollama-clientollama-interface

kyoro1/image_analysis_with_automl_in_azure

This repository provides sample codes, which enable you to learn how to use auto-ml image classification, or object detection under Azure ML(AML) environment.

Jupyter Notebook10Updated 3 years ago

automlazure-machine-learningazure-machine-learning-pipelinebatch-inferenceimage-classificationmanaged-identityobject-detection

Rajesh-Arigala/tiny-mlp-classifier-pytorch

Neural network classifier with training, evaluation, calibration, and prediction using PyTorch.

Jupyter Notebook10Updated 3 months ago

batch-inferenceconfidence-analysisexperiment-trackinghyperparameter-tuninglogginglogging-librarymodel-evaluationneural-networkspytorch

Charithareddt/manufacturing-anomaly-detection

Production-style batch ML pipeline for manufacturing claims anomaly detection with drift checks (synthetic data).

Python00Updated 1 month ago

anomaly-detectionbatch-inferencemachine-learningmlopspythonscikit-learn

sarabesh/sentiment-analysis-finetuning-and-deployment

This repo simulates how an ML model moves to production in an industry setting. The goal is to build, deploy, monitor, and retrain a sentiment analysis model using Kubernetes (minikube) and FastAPI.

Jupyter Notebook00Updated 11 months ago

api-interfacebatch-inferencebert-fine-tuningfast-apifinetuningkubernetesmlopssentiment-analysis

rohanchauhan/azure-batch-inference-service

We perform batch inference on lead scoring task using Pyspark.

Jupyter Notebook00Updated 4 years ago

azurebatch-inferencelead-scoringmachine-learningpyspark

brnaguiar/mlops-next-watch

MLOps project that recommends movies to watch implementing Data Engineering and MLOps best practices.

Jupyter Notebook01Updated 5 months ago

airflowartificial-intelligenceaws-s3batch-inferencebatch-scoringdata-engineeringdvcgrafanaminiomlflowmlopsmovie-recommendationpostgresqlprometheusrecommender-systemspark

miozilla/sdkgenai

sdkgenai :hammer_and_wrench::arrows_clockwise::package: : Gen AI SDK # Model Parameters # Safety Filters # Multi-turn Chat # Content Streaming # Asynchronous Requests # Token Counting # Context Caching # Function Calling # Batch Prediction # Text Embeddings

Jupyter Notebook00Updated 1 month ago

apibatch-inferencecontrolled-generationgemini-2-5-flashmultimodalparameterpenaltypromptsafety-ratingsseedsystem-instructionstemperaturetop-ktop-pvertex-ai

kirtis111/store-sales-forecasting-e2e

End-to-end retail sales forecasting using LightGBM with time-series features, SHAP explainability, FastAPI inference, Streamlit demo, and CI for production-ready ML workflows.

Python00Updated 1 month ago

batch-inferencecicddata-sciencefastapilightgbmmachine-learningmlopsmodel-explainabilityretail-forecastingshapstreamlittime-series

msaleh1888/azure-ml-customer-segmentation

Production-grade customer segmentation pipeline built on Azure (Blob Storage, Data Factory, Azure ML, Batch Endpoint). Includes end-to-end data engineering, feature engineering, K-Means model training, and scalable batch inference.

Python00Updated 3 months ago

azureazure-blob-storageazure-data-factoryazure-machine-learningazure-mlbatch-inferencecloud-mlclusteringcustomer-segmentationdata-engineeringend-to-end-mlinfrastructure-as-codekmeansmachine-learningml-pipelinemlopsproduction-mlpython

lk-learner/Databricks-14-Days-Challenge-2

Indian Data Club : Databricks 14-Days Challenge-2 is designed to help beginners build a strong foundation in Databricks through daily learning, hands-on practice, and problem solving.

00Updated 20 hours ago

aisystembatch-inferencedata-engineeringdatabricksdelta-lakedelta-tableend-to-end-architecturejob-orchestrationmlflowmodel-trainingpipelinerecommendation-system

kimmmmyy223/llm-batch

🚀 Process JSON data in batches with `llm-batch`, leveraging sequential or parallel modes for efficient interaction with LLMs.

Go00Updated 1 hour ago

awsbatch-inferencebedrockdeep-learningdistributed-computingdynamic-batchingflasklanguage-modellarge-language-modelsllm-agentllm-inferencenlpopspythonrabbitmqray-datareactvllm