93 results for “topic:on-device”
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
On-device wake word detection powered by deep learning
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
On-device Speech-to-Intent engine powered by deep learning
On-device voice assistant platform powered by deep learning
100M parameter lightweight conversational text-to-speech model with breaths, laughter, multi-speaker dialogue, voice cloning, and streaming. Llama-based, on-device.
On-device speech-to-text engine powered by deep learning
open-source healthcare ai
AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.
电子鹦鹉 / Toy Language Model
On-device voice activity detection (VAD) powered by deep learning
On-device CocoaLumberjack console with support for search, adjust levels, copying and more.
Precision genomics for everyone, everywhere. Powered by private AI.
The baseline project for inferencing various Pose Estimation tflite models with TFLiteSwift on iOS
Convmelspec: Convertible Melspectrograms via 1D Convolutions
Local speech-to-text for macOS — on-device AI, fully private, no cloud
On-device noise suppression powered by deep learning
On-device property graph database. Schema-as-code. One CLI → One Folder. No Server. Think: DuckDB for graphs.
Eris is a private AI chat application that runs entirely on your device using Apple's MLX framework. Named after the dwarf planet that challenged our understanding of the solar system, Eris challenges the notion that AI must live in the cloud.
On-device speaker diarization powered by deep learning
Personalized machine learning on the smartphone
Real-time speech enhancement mobile app using Nested U-Net
Official repo for Jesture AI SDK: Real-time On-device Hand Gesture Control
Custom implementation of Apple Intelligence features
Push-to-talk voice dictation for macOS using Whisper
Unity package for using Spark-TTS on-device models. This is a C# port of https://github.com/SparkAudio/Spark-TTS by SparkAudio team and uses converted ONNX models instead of the PyTorch models in the original repo
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Open-source, local-first mobile app retention and engagement automation with agent-driven workflows.
LiveTalk is a unified, high-performance talking head generation system that combines the power of LivePortrait and MuseTalk open-source repositories. The PyTorch models from these projects have been ported to ONNX format and optimized for CoreML to enable efficient on-device inference in Unity.
Chat with on-device LLM's on iPhone