264 results for “topic:gemini-flash”
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Deploy your private Gemini application for free with one click, supporting Gemini 1.5, Gemini 2.0 models.
A desktop application that extracts YouTube playlist transcripts and enhances them using Google's Gemini AI models. The output is a book in any language you want.
Vanilla JS web interface for Gemini 2.0 flash-exp Multimodal API with text, audio, camera, screen inputs and audio responses and function calling
Co-create PowerPoint slide decks with AI
Simplified Gemini for Claude Code.
🦀 A Pure Rust Framework For Building AGI (WIP).
A lightweight Python API wrapper and CLI for Google’s Gemini language models.
Autospec is an open-source AI agent that takes a web app URL and autonomously QAs it, and saves its passing specs as E2E test code
Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash
Gemini Pro: An AI-powered Telegram bot script for generating text and image-based responses using Gemini AI
更接近 Copilot ,您的智能生活和工作副驾 (o゜▽゜)o☆
A NetBeans plugin that allows Google Gemini to do all your work on the worlds best Java IDE. Get a wireless headset, lay on your bed and tell the model to turn on your tv and stream NetBeans onto it while you approve and deny diffs on the diff viewer. Try not to drink too much alcohol if the model goes way faster than your thoughts.
An AI Discord Bot leveraging Googles Gemini 1.5 Model & Prodia!
This project enables real-time streaming of audio (and optionally video or screen captures) from your local device to Google Gemini using the Live API. It allows you to interact with Gemini through both text and voice, supporting conversational AI responses.
AI agent for creating personalized digests of research papers
A Multi-Agent based application which provides a comphrehensive financial/market analysis of any company
🤯 Go VIRAL! This AI-powered YouTube Clipper AUTOMATICALLY finds & creates your next hit videos with smart face tracking & word-by-word captions. Stop wasting time, start trending! 🚀
This project provides a custom keyboard for iPhones, built using UIKit, and a companion app built with SwiftUI for customization options. The keyboard features a built-in search bar that integrates with the Gemini-1.5 flash API to quickly answer questions or generate content for commenting or replying.
Autonomous AI Agent for the JVM: shell, files, google search, runs any LLM generated java code on the jvm itself. JIT compilation and a child-first classloader with ANY classpath. Comes wit a swing GUI , can be "dropped in" into any existing java application. Easy tools easy pojos. IoT devices. Run it standalone for a pure-java AI assistant.
T20: Multi-Agent. Orchestrator-delegate model. TAS. Goal -> plan -> delegate. Agents: Gemini family. Autonomous, traceable. Logs sessions. CLI. Usage: `t20-cli "goal string"`. Artifacts of high value.
🚀 Transform plain Vietnamese into powerful SQL queries with AI. This tool allows you to interact with your PostgreSQL database using everyday language—no SQL expertise required.
This repository contains a transformer-based model for real-time American Sign Language (ASL) recognition. The model leverages transformer architecture to interpret ASL gestures and utilizes the Gemini-Pro LLM API for constructing sentences from recognized ASL signs.
AI-powered flashcard generator built with React and Google Gemini . Create and customize quiz content seamlessly for an interactive learning experience.
Google Gemini Voice/Vision Assistant with gemini-1.5-pro / gemini-1.5-flash modal ! #Gemini 1.5 Flash #Gemini 1.5 Pro
NeoExamShield
A Streamlit-based chatbot application using Gemini models for NLP. Features include light/dark mode toggle, model selection (Gemini 1.5 Flash, 1.5 Pro, 1.0 Pro), adjustable parameters (temperature, top-p, top-k, max tokens), secure API key input, and an interactive chat interface with history.
This repository provides a framework to integrate internet search capabilities with a Language Learning Model (LLM), specifically using Gemini 1.5 API. This allows the LLM to fetch and use real-time data from the internet to enhance its responses to user queries.
⚡Powered by Goggle TPUs and the latest (Aug 27, 2024) Gemini 1.5 Pro and Flash Models to generate high-quality engineered prompts, analyze text and images, and create datasets for fine-tuning AI models, helping you to become a prompt engineering pro
This repository contains the VS Code extension for the main project, GitPilot. You can find the main repository here: https://github.com/InflixOP/GitPilot