73 results for “topic:fine-tune”
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
使用预训练语言模型BERT做中文NER
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
Scale LLM Engine public repository
Code for finetuning AlexNet in TensorFlow >= 1.2rc0
使用预训练语言模型ALBERT做中文NER
ImageNet pre-trained models with batch normalization for the Caffe framework
Fine-tuning code for CLIP models
Enhancing LLMs with LoRA
A curated list of open source repositories for AI Engineers
[SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-velocity and outro tokens
Various installation guides for Large Language Models
Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!
A scraper for Substack article text content
BERT based pretrained model using SQuAD 2.0 Dataset for Question-Answering
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
DelphiMistralAI wrapper brings Mistral’s text-vision-audio models and agentic Conversations to Delphi, with chat, embeddings, Codestral codegen, fine-tuning, batching, moderation, async/await helpers and live request monitoring.
TensorFlow Implementation of Manifold Regularized Convolutional Neural Networks.
Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.
Domain Randomization Shape Detection
🚂 Fine-tune OpenAI models for text classification, question answering, and more
Training and fine-tuning flan-t5-small model based on provided text
Official Implementation for the paper titled: "Counterfactual Disease Removal and Generation in Chest X-Rays Using Diffusion Models"
Fine tuning LLaMA-2 model on provided text data
Mistral model inference and fine-tune
Flan-t5 model fine tune LoRA and Langchain
Fine-tune wav2vec2-xls-r on data from low-resource-languages
VRAM calculator for Hugging Face models
In this we finetuned the Gemini model with our own medical NER dataset and used to recognize Name Entities
[Bachelor Graduation Project] Use Xception model for face anti-spoofing