"topic:qlora" — Search

406 results for “topic:qlora”

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

agentaideepseekfine-tuninggemmagptinstruction-tuninglarge-language-modelsllamallama3llmloramoenlppeftqloraquantizationqwenrlhftransformers

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook13.8k1.4kUpdated just now

chinese-llmchinese-nlpfinetunegenerative-aiinstruct-gptinstruction-setllamallmloraopen-modelsopen-sourceopen-source-modelsqlora

bitsandbytes-foundation/bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python8.0k831Updated 3 hours ago

llmmachine-learningpytorchqloraquantization

yangjianxin1/Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python6.6k589Updated 2 hours ago

alpacaaquilabaichuanchatglmgemmagptinternlmllamallama2llama3llmloraminicpmmistralmixtralpeftqloraqwenqwen2zephyr

hiyouga/ChatGLM-Efficient-TuningArchived

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Python3.7k469Updated 3 hours ago

alpacachatglmchatglm2chatgptfine-tuninghuggingfacelanguage-modellorapeftpytorchqlorarlhftransformers

iusztinpaul/hands-on-llmsArchived

🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

Jupyter Notebook3.4k546Updated 13 hours ago

3-pipeline-designawsbeambytewaxcicdcomet-mldockerfine-tuninggenerative-aihuggingfacelangchainllmopsllmsmlopsqdrantqlorastreamingtransformers

ssbuild/chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

Python1.5k173Updated 6 days ago

adalorachatglmdeep-learningfreezeia3lorap-tuning-v2pytorchqlorasft

georgian-io/LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

Python870105Updated 1 week ago

ablation-studyclassificationfalconfine-tuningfinetuningflan-t5large-language-modelsllama2llm-testloramistral-7bnlpnlp-machine-learningqloraredpajamasummarizationunit-testingzephyr

X-

X-D-Lab/MindChat

🐋MindChat（漫谈）——心理大模型：漫谈人生路, 笑对风霜途

Python70256Updated 1 day ago

baichuan-13bchatglm2-6bchatgptdomain-llminternlmlarge-language-modelsllmloraqloraqwenqwen-7bqwen1-5qwen2

jianzhnie/LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

Python62065Updated 3 days ago

chatgptdpollamallama3mixtralppoqloraqwenrlhf

GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects

516146Updated just now

chainlitfinetuning-llmsgeminigenerative-aigpt4ogradio-python-llmhuggingfacelangchainlarge-language-modelsllamallama-indexllama3llama3-meta-aillmllmopsloramergekitmistralopenai-apiqlora

X-

X-D-Lab/Sunsimiao

🌿孙思邈中文医疗大模型(Sunsimiao)：提供安全、可靠、普惠的中文医疗大模型

Python47133Updated 4 days ago

baichuanbaichuan-7bchatgpthuggingfacelarge-language-modelsllmmedicalmodelscopeqlora

yangjianxin1/Firefly-LLaMA2-Chinese

Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Python41632Updated 4 weeks ago

baichaun2baichuanbaichuan-13bbloomchatglmfalconfireflyinternlmllamallama-2llama2llmlorapretrainqloraqwenxverse

ddzipp/AutoAudit

AutoAudit—— the LLM for Cyber Security 网络安全大语言模型

HTML35340Updated 2 weeks ago

cyber-securityfine-tuninggptllamaloraqlorasecurity-tools

WangRongsheng/Aurora

The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"

Python26319Updated 1 month ago

chinesefine-tuninggptinstruction-tuninglanguage-modellarge-language-modelsllmloramixtralmixtral-8x7bmixtral-8x7b-instructqlora

iamarunbrahma/finetuned-qlora-falcon7b-medical

Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset

Jupyter Notebook26327Updated 1 week ago

chatbotchatbotsconversational-aifalconfalcon-7bfine-tuninghealthcarellmloramental-healthpeftqlora

taishan1994/Llama3.1-Finetuning

对llama3进行全参微调、lora微调以及qlora微调。

Python21616Updated 6 days ago

llama3loraqloraqwen

codelion/ellora

Enhancing LLMs with LoRA

Jupyter Notebook21015Updated 1 week ago

accuracy-analysischain-of-thoughtchain-of-thought-reasoningdata-generationdistillationfine-tunefine-tuningfine-tuning-llmfinetuningfinetuning-llmsloraqloraquantizationquantization-aware-trainingreasoningreinforcement-learningself-correctionself-distillationsupervised-finetuningtraining

yangjianxin1/LongQLoRA

LongQLoRA: Extent Context Length of LLMs Efficiently

Python16816Updated 2 months ago

llmlong-contextlongloraloraqlora

ssbuild/qwen_finetuning

qwen models finetuning

Python10711Updated 1 day ago

adaloraia3loraqloraqwensfml

ssbuild/llm_finetuning

Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on

Python10011Updated 6 days ago

adalorabloomcpmantgptgpt2llamallama2loramistraloptqlora

NisaarAgharia/Indian-LawyerGPT

Fine-Tuning Falcon-7B, LLAMA 2 with QLoRA to create an advanced AI model with a profound understanding of the Indian legal context.

Jupyter Notebook9334Updated 2 weeks ago

falconfine-tuninggpthuggingface-transformerslarge-language-modelsllamallama2llmspeftqlora

taishan1994/qlora-chinese-LLM

使用qlora对中文大语言模型进行微调，包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE

Python895Updated 4 months ago

alpacabellebloomzchatglmllamaloraqlora

nikolamilosevic86/verifAI

VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for correctness (using posteriori model)

Jupyter Notebook784Updated 3 weeks ago

bioasqdeep-learninggenerative-aigenerative-ai-searchhallucination-detectionllmmachine-learningmedlinemistral-7bnatural-language-processingnlinlpnlp-machine-learningopenaiopensearchpubmedqdrantqlorascifactverification

zjohn77/lightning-mlflow-hf

Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow

Python658Updated 2 weeks ago

adapterazure-mldeep-learninghugging-facelanguage-modelllmloramlflownlppeftpolarspytorchpytorch-lightningqlora

sayedmohamedscu/Vision-language-models-VLM

vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)

Jupyter Notebook6113Updated 4 days ago

colab-notebookcomputer-visionfinetune-llmsfinetuningflorenceflorence-2florence-finetuningloramedgemmamedical-imagingmultimodalpaligemmaqloravisionlanguagevlm

qqqqqf-q/MirrorFlow

从对话数据到训练:数字分身 + 模型蒸馏 From Dialogue Data to Training Closed-Loop: Digital Twin + Model Distillation

Python6120Updated 1 week ago

4oaiai-avatarchatbotdigital-twindistillfinetunefinetune-llmhuggingfacellmopen-aiopenaipersonal-aiqloraqwenunsloth

michaelnny/InstructLLaMA

Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to InstructGPT or ChatGPT, but on a much smaller scale.

Jupyter Notebook5613Updated 1 month ago

4bit-fine-tuneinstructgptllam2ppoqlorarhlf

kyegomez/Finetuning-Suite

Finetune any model on HF in less than 30 seconds

Jupyter Notebook566Updated 3 months ago

artificial-intelligencedeep-learningfinetuninggptgpt4large-language-modelsqloratransformer

uncase-ai/UNCASE

Open-source framework for turning expert knowledge into PII-free synthetic conversational data and production-ready LoRA adapters.

Python521Updated 6 hours ago

compliancedataset-generationfastapifine-tuninggdprhealthlegalllmloralora-adaptersmachine-learningopen-source-frameworkprivacy-firstpythonqlorasynthetic-datasynthetic-dataset-generation

Page 1 of 14