"topic:cpu-only" — Search

17 results for “topic:cpu-only”

Minimal CPU-only Ollama Docker Image

🦙 chat-o-llama: A lightweight, modern web interface for AI conversations with support for both Ollama and llama.cpp backends. Features persistent conversation management, real-time backend switching, intelligent context compression, and a clean responsive UI.

Python61Updated 1 month ago

ai-chatchatchat-interfacechat-o-llamachatbotconversation-historycpu-onlydeveloper-toolsflasklightweightllama-cppllamacpplocal-aioffline-aiollamaprivacy-focusedpythonself-hostedsqlite

Rushi-Balapure/pdf_2_json_extractor

A high-performance Python library for extracting structured content from PDF documents with layout-aware text extraction. pdf_to_json preserves document structure including headings (H1-H6) and body text, outputting clean JSON format.

Python31Updated 1 month ago

cli-toolcpu-onlycross-platformdata-extractiondocument-parsingdocument-processingjsonlayout-analysisnlpofflinepdfpdf-extractionpdf-parserpdf-processingpdf-to-jsonpythonpython-librarystructure-extractiontext-extraction

ssgosh/WorkBuddy

An LLM-based content moderator. Firefox extension to block webpages unrelated to work, based on page title and URL. Local LLMs with Ollama and Langchain to ensure your browsing history never leaves your device, for complete privacy. Google Gemini also supported.

Python30Updated 1 day ago

cpu-onlyfirefox-addongeminilangchainllama3llmollamaprompt-engineeringpython

sungurerdim/whiscribe

CPU-only local audio transcription with a browser UI — powered by faster-whisper, runs in your browser with a single Python script

Python10Updated 1 week ago

audiocpu-onlyfaster-whisperpythonspeech-to-textstreamlittranscriptionwhisper

tmdev012/ollama-local

No description provided.

Shell10Updated 1 week ago

ai-assistantandroidbashclicpu-onlygrpcllamalocal-llmno-gpuoffline-aiollamaprivacy-firstshell-aisqlitetermux

ThatLinuxGuyYouKnow/Flutter-Image-Classification

Image Classification with On-Device Inference, built with Flutter, AI model runs on mobile cpu

Dart10Updated 1 year ago

cpu-onlyflutterimage-classificationonnxruntimeshufflenetv2

Blackfall-Labs/ternsig

Ternsig Virtual Mainframe Runtime (TVMR) — extensible VM with 10 standard extensions (121 instructions), Signal ISA, mastery learning, hot-reload firmware, and thermogram persistence.

Rust11Updated 6 days ago

cpu-onlyextensiblefirmwarehot-reloadmastery-learningneuromorphicrustsignal-isaternarytvmrvirtual-machine

Mchiir/FaceLocking

Face locking system built on ArcFace (ONNX) and 5-point alignment that recognizes a selected identity, locks onto it, tracks facial actions, and records behavior over time.

Python10Updated 1 week ago

arcfacecomputer-visioncosine-similaritycpu-onlyface-lockface-recognitionhaar-cascademediapipeonnxopencv

mazumdarsoumya/BayesFusionSDF

Probabilistic Signed Distance Fusion with View Planning on CPU

Python00Updated 4 days ago

3d-reconstructionbayesian-inferencecomputer-visioncpu-onlygaussian-markov-random-fieldsnext-best-viewroboticssigned-distance-fieldtsdfuncertainty-estimation

sunghunkwag/LowNoCompute-AI-Baseline

CPU-friendly experience-based reasoning framework combining meta-learning (MAML), state space models (SSM), and memory buffers for fast few-shot adaptation. Pure NumPy implementation for edge devices and low-compute environments.

Python00Updated 4 months ago

adaptationaicontinual-learningcpu-onlyexperience-bufferexperience-replayfew-shot-learninglow-computemamlmemory-buffermeta-learningnumpypolicy-orchestrationreinforcement-learningssmstate-space-modeltest-time-adaptation

Jameson040/my_lama-wheels

Pre-built Llama-CPP Wheel for HF Spaces (Python 3.13)

00Updated 5 hours ago

cpu-onlyhuggingface-spacesllama-cpp-pythonmanylinuxprebuilt-wheelspython-3-13

Ariyan-Pro/RAG-Latency-Optimization

CPU-optimized RAG pipeline reducing latency 2.7× (247ms → 92ms). Implements caching, filtering, quantization for production. Complete with FastAPI, Docker, benchmarks, investor materials. The engineering showcase that sells itself.

Python00Updated 1 month ago

ai-ml-performance-tuningbecnhmarkingcachingcpu-onlydemonstrationdockerdockerfileembeddingsfaissfastapilow-latencyproduction-readyrag-optimizationretrieval-augmented-generationsales-engineeringsemantic-searchshowcasesqlite

sankar-ramamoorthy/rag-foundry-universal

CPU-only RAG stack: PDFs→Docling→Ollama→pgvector. Windows/macOS/Linux. Docker Compose. Graph-aware code search + scanned PDF OCR.

Python00Updated 1 day ago

cpu-onlydockerdoclingdocument-aigraph-awarelocal-llmmarkdown-parsingocrollamaopen-source-ragpdf-parsingpgvectorragrag-over-python-codeself-hostedtree-sitter-parser

ThatLinuxGuyYouKnow/Face-Detection-Server

Face Detection service, super fast inference with a nano model

Python00Updated 1 year ago

cpu-onlyface-recognitionimage-classificationpytorch

dinhtrung0706/deepfake-presentation-aware

A lightweight reproduction and analysis inspired by recent work on presentation-aware deepfake / spoofing detection, with a focus on codec-induced presentation mismatch (AMR) under CPU-only constraints.

Python00Updated 1 day ago

aicpu-onlydeepfake-detection

ousseeeef/chat-o-llama

Chat-O-Llama is a user-friendly web interface for managing conversations with Ollama, featuring persistent chat history. Easily set up and start your chat sessions with just a few commands. 🐙💻

HTML00Updated 2 hours ago

ai-chatchatconversation-historycpu-onlydockerfasttext-embeddingsgradiolangchain-pythonlightweightllama3llama3-meta-ailocal-aimlflowmlflow-trackingoffline-aipythonqdrant-clientqdrant-vector-database