"topic:llama-cpp-python" — Search

61 results for “topic:llama-cpp-python”

a self-hosted webui for 30+ generative ai

animatediffaudiocraftbarkcontrolnetdiffusersfluxgenerative-aigfpgangradiohuggingfaceinsightfaceip-adapterkandinskyllama-cpp-pythonphotomakerreal-esrganstable-diffusionstable-diffusion-3-5webuiwhisper

jasonacox/TinyLLM

Setup and run a local LLM and Chatbot using consumer grade hardware.

JavaScript31937Updated 15 hours ago

artificial-intelligencechatbotlarge-language-modelsllama-cpp-pythonllmopenairagretrieval-augmented-generationvllm

Aesthisia/LLMinator

Gradio based tool to run opensource LLM models directly from Huggingface

Python9719Updated 6 days ago

chatbotcpucudaggufgradiohuggingfacelangchainllama-cpp-pythonllamacppllmllm-inferencemodel-conversionollamaopenaiopensourcepythonsafetensorsstreamingwebsockets-chat

unixwzrd/oobabooga-macOS

Information on optimizing python libraries specifically for oobabooga to take advantage of Apple Silicon and Accelerate Framework.

Python778Updated 3 months ago

blasjournalllama-cpp-pythonmacosnumpyoobaboogapytorch

mlc-delgado/pytldr-oss

An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intelligent assistant for modern professionals.

Python584Updated 4 weeks ago

cassandradockerdocker-composegradiollama-cpp-pythonllama2python3

dougeeai/llama-cpp-python-wheels

Pre-built wheels for llama-cpp-python across platforms and CUDA versions

393Updated 1 day ago

adaada-architectureampereblackwellblackwell-architectureblackwell-gpucudacuda13ggufllama-cpp-pythonllmmachine-learningprebuiltpython313rtx3060rtx3070rtx3080rtx3090wheelswindows

ossirytk/llama-cpp-chat-memoryArchived

Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma

Python354Updated 1 month ago

chainlitchatbotchromadbgguflangchain-pythonllama-cppllama-cpp-pythonllama2nerspacy

Talnz007/VulkanIlm

GPU-accelerated LLaMA inference wrapper for legacy Vulkan-capable systems a Pythonic way to run AI with knowledge (Ilm) on fire (Vulkan).

Python280Updated 3 months ago

amd-gpufastaiggufgpu-inferenceintel-gpulegacy-gpusllama-cppllama-cpp-pythonllm-inferencelocal-ailocalllmmachine-learningopen-source-llmpython-wrappervulkan

notolog/notolog-editor

Notolog Markdown Editor

Python245Updated 5 days ago

ai-assistantemacsggufllama-cppllama-cpp-pythonlocal-inferencelocal-llmmarkdownmarkdown-editoron-device-aionnxphi-4privacy-first-aipyside6pythonpython-aipython-qtqtqwen

BorjaOteroFerreira/IALab-Suite

Tool for test diferents large language models without code.

Python180Updated 2 months ago

api-restchat-applicationflask-apiinference-apilarge-language-modelsllama-cpp-pythonllama2llama2-7bllamacppllmllm-inferencelocal-inferencemixtral-8x7b

controlecidadao/samantha_ia

Experimental interface environment for open source LLM, designed to democratize the use of AI. Powered by llama-cpp, llama-cpp-python and Gradio.

Python187Updated 1 month ago

ai-experimentationai-interfaceartificial-intelligencedata-analysisdemocratizing-aiethical-aifeedback-loopgguf-modelslarge-language-modelsllama-cppllama-cpp-pythonlocal-aimodel-chainingmodel-iteractionopen-source-aiprompt-chainingprompt-engineeringsamantha-aitext-generationwindows-compatible

CuaOS/CuaOS

This repository is a CUA (computer use agent) system that, using the Qwen3-VL model on Ubuntu computers, aims to perform tasks on your behalf using the keyboard and mouse in a local Sandbox environment in GGUF format, based on the commands you provide.

Python172Updated 6 days ago

agentagentscomputer-usecomputer-use-agentdesktop-automationdockerdocker-containerlinuxllama-cpp-agentllama-cpp-pythonllamacpplocallyoperating-systemqwen3-vlsandboxsandbox-playgroundtigervncubuntu

svjack/CodeActAgent-Gradio

UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.

Jupyter Notebook161Updated 1 month ago

agentagent-based-modelingchatbotcode-actcode-actioncode-generationcode-llmscodinggradiogradio-interfacellamallama-cppllama-cpp-pythonllmllm-agentmistralmistral-7bmixtralpromptpython

007prateekd/fin-edubot

A financial chatbot powered by an LLM and retrieval-augmented generation.

Jupyter Notebook132Updated 3 months ago

chatbotfinancefinchatlarge-language-modelsllama-cppllama-cpp-pythonllmllm-inferenceretrieval-augmented-generationweb-scraping

laelhalawani/gguf_modeldb

A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b

Python123Updated 1 year ago

databasehugginfaceinferencellamallama-cpp-pythonllama2llmmodel-databasepython3

woheller69/LLAMA_TK_CHAT

Simple chat interface for local AI using llama-cpp-python and llama-cpp-agent

Python121Updated 1 month ago

guillama-cpp-agentllama-cpp-pythonllm-inference

TAO71-AI/I4.0

TAO71 I4.0 is an AI created by TAO71 in Python.

Python60Updated 1 month ago

aiapiartificial-intelligencechatbotchatbotsclientdiffusersimage2textlinuxllama-cpp-pythonpythonpython3python311servertext2imagetransformers

rrrusst/solairia

SOLAIRIA is a free tool with minimal dependencies that lets you interact with text-generation AI LLMs of your choice privately by running on your own local hardware offline.

Python61Updated 3 months ago

aifreegenaillama-cpp-pythonllmofflineprivacyprivatetext-generation

ShotokanOSS/ggufForge

A comprehensive toolkit for training and running lightweight adapters for GGUF-based language models (ERNIE, Llama, Mistral, Phi-3, etc.) without modifying the base model.

Python60Updated 2 weeks ago

cost-efficient-aiefficient-trainingefficient-training-and-inferencegguflanguage-model-adapterllama-cppllama-cpp-pythonllm-optimizationlocal-llmlocal-llm-gguflogit-correctionlow-resource-finetuningpytorchquantization-recoverytransfer-learningtransfer-learning-and-fine-tuningtransfer-learning-nlp

BetoAvila/the-silent-cartographer_youtube

YouTube API implementation with Meta's Llama 2 to analyze comments and sentiments

Python50Updated 1 year ago

dockerdocker-secretsgoogle-api-python-clientllama-cppllama-cpp-pythonllama2llama2-7bllama2-dockerllamacppllmsminicondanumpypandaspythonpytorchyoutube-apiyoutube-comment-scraperyoutube-comment-sentiment-analysis

joshuaDeal/clippy-gpt

Clippy resurrected as an AI front end. 📃📎👀

Python50Updated 1 week ago

aiai-assistantassistantchatbotchatgptchatgpt-apiclippitclippydesktop-assistantdesktop-petintelligent-uiintelligent-user-interfacellama-cpp-pythonopenaiopenai-apiopenrouteropenrouter-apipyside6retroretrotech

svjack/Genshin-Impact-Character-Chat

Genshin Impact Character Chat Models tuned by Lora on LLM

Python40Updated 1 month ago

chatbotgamegenshin-impactgradiollama-cppllama-cpp-pythonllmmistralqwenqwen1-5roleplayroleplaying-gamesharegpttransformersvllmwebui

svjack/Genshin-Impact-RAG

A Genshin Impact Question Answer Project supported by Qwen1.5-14B-Chat

Python41Updated 2 months ago

embeddingembeddingsgenshin-impactggufllamallama-cppllama-cpp-pythonllmpromptqwenqwen1-5qwen14brag

zeeb0tt/runpod-llm

Runpod-LLM provides ready-to-use container scripts for running large language models (LLMs) easily on RunPod.

Shell31Updated 1 month ago

llama-cppllama-cpp-pythonllamacppllamacpp-pythonllmllm-inferencellm-servingllm-trainingllmopsllmsollamaollama-apirunpodrunpod-endpointrunpod-serverlessrunpod-workerrunpods

KareemSayed1232/Decoupled-Adaptive-RAG-Engine

This is a adaptive RAG system decoupled ready for deployment and production

Python30Updated 5 months ago

ai-engineeringdense-retrievalfastapigradiollama-cpp-pythonllmmachine-learningmicroservicesnlpportfolio-projectpythonragrag-chatbot

kantan-kanto/ComfyUI-MultiModal-Prompt-Nodes

Multimodal prompt generator nodes for ComfyUI, designed to generate prompts for QwenImageEdit and Wan2.2. Supports local LLM / local GGUF models (Qwen3-VL, Qwen2.5-VL) and Qwen API for image and video prompt generation and enhancement.

Python31Updated 8 hours ago

comfy-uicomfyui-custom-nodecomfyui-custom-nodescomfyui-nodesggufllama-cpp-pythonllm-toolsprompt-generatorqwenqwen-image-editqwen3-vlvision-language-modelswan

Ali-Fartoot/ProfessorConnected

ProfessorConnected is an API-powered tool that helps you discover professors with similar research interests by analyzing their arXiv publications using advanced NLP and vector search techniques.

Python20Updated 11 months ago

bertchromadbllama-cpp-pythonngrampythonrag

Crislopsaa/epsilon_ome_trainer

EPSILON OME TRAINER is a desktop application that is designed to serve as a tool to generate free, unlimited problems that are similiar to the Spanish Mathematical Olympiads' ones.

Python20Updated 5 days ago

aidesktop-applicationeducationepsilonepsilon-ome-trainerllama-cpp-pythonmathmathematical-olympiadsomeplotly-expresspyqt5pythonsqlite3trainer

noir55/gguf_simple_webui

llama-cpp-python(llama.cpp)で実行するGGUF形式のLLM用の簡易Webインタフェースです。

Python22Updated 1 year ago

ggufllama-cpp-pythonllmwebui

Ali-Fartoot/Presage

Presage is an API-driven fortune-telling application that utilizes FastAPI and Large Language Models (LLMs) to analyze users’ palm images. By leveraging llm-cpp-server and Lang-Segment-Anything (SAM-Lang), it processes hand images to generate insightful analyses.

Python21Updated 4 months ago

fastapillama-cpp-pythonllmsamsam-lang

Page 1 of 3