Repos
91
Stars
70
Forks
11
Top Language
Python
Loading contributions...
Top Repositories
BrainSurfCNN for individualized prediction of task contrasts from resting-state functional connectivity
Generating brain activation maps from free-form text query
Matlab implementation of photometric stereo
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio quality • Chuyển văn bản thành giọng nói tiếng Việt • Text to speech tiếng Việt • TTS tiếng Việt
Sharp Monocular View Synthesis in Less Than a Second
All notes and materials for the CS229: Machine Learning course by Stanford University
Repositories
91Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio quality • Chuyển văn bản thành giọng nói tiếng Việt • Text to speech tiếng Việt • TTS tiếng Việt
Generating brain activation maps from free-form text query
Sharp Monocular View Synthesis in Less Than a Second
All notes and materials for the CS229: Machine Learning course by Stanford University
No description provided.
Matlab implementation of photometric stereo
:arrow_down_small: Display any CSV (comma separated values) file as a searchable, filterable, pretty HTML table
Convert PDF to markdown quickly with high accuracy
BrainSurfCNN for individualized prediction of task contrasts from resting-state functional connectivity
Port of OpenAI's Whisper model in C/C++
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
No description provided.
No description provided.
A demo for Jupyter to Github pages action conversion.
:arrow_double_down: Dumb downloader that scrapes the web
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
A free & open tool for transcribing audio interviews
real time face swap and one-click video deepfake with only a single image
Deep Reinforcement Learning: Zero to Hero!
llama3 implementation one matrix multiplication at a time
No description provided.
An open-source academic paper management tool.
Instant voice cloning by MyShell.
Kolmogorov Arnold Networks
#1 Locally hosted web application that allows you to perform various operations on PDF files
PDF Layout Chunking for LLMs
No description provided.
⬛️ CLI tool for saving complete web pages as a single HTML file
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models