"topic:gpu-scheduling" — Search

12 results for “topic:gpu-scheduling”

A control plane for concurrent LLM RL on shared GPUs

Python1936Updated 13 hours ago

agentic-rlgpu-schedulingllm-trainingloraml-systemsmlopsreinforcement-learningrltinker

Sibyl-Research-Team/sibyl-research-system

Fully Autonomous AI Research System with Self-Evolution, built natively on Claude Code

ai-agentai-for-scienceai-scientistautomated-scienceautonomous-agentsautonomous-researchautoresearchclaude-codedeepresearchexperiment-executiongpu-schedulingllm-agentsmcpmulti-agent-systempaper-generationresearch-automationscientific-discoveryself-evolvingself-healingvibe-research

NexusGPU/tensor-fusion

Tensor Fusion is a state-of-the-art GPU virtualization and pooling solution designed to optimize GPU cluster utilization to its fullest potential.

Go12729Updated 4 hours ago

aiamd-gpuautoscalingdynamic-resource-allocationgpugpu-accelerationgpu-poolinggpu-schedulinggpu-usagegpu-virtualizationinferencekarpenterkubernetesllm-servingnvidiapytorchrcudaremote-gpuvgpu

yalue/cuda_scheduling_examiner_mirror

A tool for examining GPU scheduling behavior.

Cuda9622Updated 1 year ago

benchmarkcudacuda-kernelsgpugpu-schedulingmandelbrot

tungngreen/PipelineScheduler

PipelineScheduler optimizes workload distribution between servers and edge devices, setting optimal batch sizes to maximize throughput and minimize latency amid content dynamics and network instability. It also addresses resource contention with spatiotemporal inference scheduling to reduce co-location interference.

C++102Updated 1 month ago

batch-inferencednn-servinggpu-schedulingmodel-serving

raj200501/GPUOptimizerML

The GPU Optimizer for ML Models enhances GPU performance for machine learning. It offers advanced scheduling, real-time monitoring, and efficient resource management through a user-friendly web interface and robust API, integrating big data technologies for seamless data processing and model optimization. @nvidia

Python30Updated 2 months ago

big-data-integrationgpu-optimizationgpu-schedulingmodel-managementreal-time-monitoringsecure-api

kube-nexus/kubenexus-scheduler

Topology-aware Kubernetes scheduler for multi-tenant, heterogeneous clusters

Go31Updated 4 days ago

distributed-computinggang-schedulinggpu-schedulingk8s-scheduling-frameworkkubeflowkuberneteskubernetes-schedulerml-trainingnuma-awareschedulerschedulingspark-operatortoplogy

Gitdigital-products/fraud-detection-service

# Fraud-Detection-Service ## The **fraud-detection-service** detects fraudulent orders and user activity. ### Endpoints - `GET /health` — service status - `POST /fraud/check` — check an order for fraud (sample) - `GET /fraud/:orderId` — get fraud status for an order (sample) ## Tracing This service reports telemetry

JavaScript10Updated 2 months ago

ai-assistantartificial-intelligenceauditingauthenticationauthorizationcompliancedata-processingdata-protectiondocumentation-toolsencryptionfraud-detectionfraud-preventiongpu-computinggpu-schedulingmachine-learningnlpowasprisk-analysisrisk-managementsecurity

chicogong/dtask-scheduler

A distributed CPU/GPU task scheduler for large-scale batch jobs across thousands of machines. Zero dependencies, sub-millisecond latency.

Go00Updated 2 months ago

batch-processingcontrol-planedistributed-computingdistributed-schedulerdistributed-systemsgolanggpu-schedulinghigh-performancejob-schedulerload-balancingqueueschedulertask-queuetask-schedulerworker-agentworker-poolzero-dependency

manishklach/gpu-suspend-resume-runtime

Transparent suspend/resume runtime enabling preemptible GPU workloads via memory snapshotting, UVM paging, and execution state orchestration.

Python00Updated 3 weeks ago

checkpointingcudagpu-preemptiongpu-runtimegpu-schedulinghpckubernetesslurmunified-virtual-memoryuvm

janakan2466/kaleidoscope-infrastructure

HPC research toolkit infrastructure for interfacing & analyzing LLMs (Kit is composed of: API gateway service, GPU scheduler, model servicer, and web interface)

Python00Updated 1 year ago

full-stack-applicationgpu-schedulinghigh-performance-computingnatural-language-inferencenatural-language-procressingslurm

leoho0722/llm-gpu-scheduler

Design of a GPU Dynamic LLM Inference Task Scheduling Architecture Based on KubeAI

Python00Updated 6 months ago

gpu-schedulingkubeaikuberneteslangtracellm-inferencemonitoringprometheus