142 results for “topic:assemblyai”
The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio intelligence models, as well as the latest LeMUR models.
Record, transcribe, and transform voice notes into structured insights. Leverage Whisper or AssemblyAI and ChatGPT to fill in gaps, generate summaries, and visualize ideas — all seamlessly integrated within Obsidian.
Build an Audio AI App with Python and AssemblyAI Course
QuickDigest AI facilitates seamless interaction with various data formats, real-time web search, and creative image generation for advertising
An interactive AI voice agent that can capture and transcribe speech in real-time, generate intelligent responses using the DeepSeek R1 (7B model) AI, and convert the responses back to natural speech for immediate playback. The agent maintains conversation context and supports cross-platform usage on macOS, Linux, and Windows.
The AssemblyAI Java SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio intelligence models, as well as the latest LeMUR models.
DiscordNPC lets you interact with ChatGPT through a Discord voice channel, enabling a natural conversation.
PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-podcasts (Project for the AssemblyAI Winter 2022 Hackathon).
Audio transcription UI for OpenAI Whisper, GPT4o Transcribe and AssemblyAI APIs
AI-powered technical interview system with dynamic resume analysis, voice interaction, and automated evaluation reports.
The OpenAPI spec, AsyncAPI spec, and Postman collection for AssemblyAI's APIs
The AssemblyAI C# .NET SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio intelligence models, as well as the latest LeMUR models.
The AssemblyAI Ruby SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio intelligence models, as well as the latest LeMUR models.
Transcribe audio using AssemblyAI with Semantic Kernel plugins.
Using voice to keep a journal.
Automated AI-powered space video generator using Node.js, Remotion, Gemini, AWS Polly, AssemblyAI, and NASA APOD API — publishes daily to YouTube with full automation.
The AtlasVoice project aims to assist psychotherapist doctors by introducing a bot assistant and transcription generation. This initiative is designed to minimize the time spent on recorded sessions, allowing professionals to gain valuable insights into their patient interactions more efficiently.
Murf AI's 30 Days of AI Voice Agents Challenge
Transcribe audio on Cloudflare Workers with AssemblyAI, Node.js, and TypeScript
Record voice, transcribe a prompt, picturize the prompt, create variations, get description of a celebrity and upload, other use cases on KB
Your AI-powered smart companion for smarter learning
Python-based system designed to transcribe audio files, split the transcripts into manageable chunks, create text embeddings using HuggingFace models, and employ advanced question-answering models for retrieval-based QA.
Vākya AI — A real-time AI voice agent that listens, understands and talks back . Powered by FastAPI, WebSockets, AssemblyAI, Gemini, and Murf AI.
TagGPT: A simple ChatGPT based multimodal dialog generation engine that can "see/draw" and "hear/speak"
Composite voice agent SDK with no extra infra requirements. Supports browser-native STT/TTS features.
Transform podcast listening with our Podcast Summarizer Project! This innovative tool transcribes audio, extracts key content, and provides user-friendly summaries. The project utilizes AssemblyAI and Listen Notes APIs for transcription and episode details. Simply input an episode ID, click "Download Episode Summary," and experience podcast content
A sleek, user-friendly transcription platform powered by AssemblyAI. Enjoy features like speaker recognition, interactive time-syncing, and professional document exports—all at no cost.
Transcription and translation scripts for Lex Fridman podcast about DeepSeek, at 2025-02-03
HealthMate is an AI-powered voice agent that helps users get clear, reliable answers to health-related questions in real time. It combines FastAPI, AssemblyAI, and React with Retrieval-Augmented Generation (RAG) and VectorDB Cloud (Native) for fast and relevant responses.
Modern Wisdom AI RAG Pipeline