"topic:sft" — Search

158 results for “topic:sft”

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...) (AAAI 2025).

Python13.1k1.3kUpdated 1 hour ago

deepseek-r1embeddinggrpointernvlligerllamallama4llmloramegatronmoemultimodalopen-r1peftqwen3qwen3-nextqwen3-omniqwen3-vlrerankersft

dataelement/bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

TypeScript11.2k1.8kUpdated 3 hours ago

agentaichatbotenterprisefinetunegenaigptlangchianllamallmllmdevopsllmopsocropenaiorchestrationpythonragreactsftworkflow

oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python8.9k708Updated 7 hours ago

dpoevaluationfine-tuninggpt-ossgpt-oss-120bgpt-oss-20binferencellamallmssftslmsvlms

adongwanai/AgentGuide

HTML2.4k246Updated just now

agenticragai-agentcrewaigraphraggrpointerviewjob-huntinglangchainllmmulti-agentragsfttutorial

AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

Python2.2k485Updated 1 day ago

deepseekfine-tuninggemma2gemma3gptjaxlarge-language-modelsllama2llama3llama4llmmistralmixtralsft

ssbuild/chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

Python1.5k173Updated 1 week ago

adalorachatglmdeep-learningfreezeia3lorap-tuning-v2pytorchqlorasft

InternScience/GraphGen

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

Python97980Updated 2 days ago

ai4sciencedata-generationdata-synthesisgraphgenknowledge-graphllama-factoryllmllm-trainingpretrainpretrainingqaquestion-answeringqwensftsft-dataxtuner

ScienceOne-AI/DeepSeek-671B-SFT-Guide

An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions. (DeepSeek-V3/R1 满血版 671B 全参数微调的开源解决方案，包含从训练到推理的完整代码和脚本，以及实践中积累一些经验和结论。)

Python79696Updated 5 days ago

deepseek-r1llmmoepythonsft

jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

Python65867Updated 2 weeks ago

chinesefinancelarge-language-modelsllamanlpqarlhfsfttext-generationtransformers

choosewhatulike/trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Python61246Updated 11 hours ago

agentcharacterlanguage-modellarge-language-modelsllmnatural-language-processingroleplaysft

ukairia777/tensorflow-nlp-tutorial

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

Jupyter Notebook575289Updated 4 days ago

bertbert-nerdpohuggingfacekeras-tutorialllamallmloranamed-entity-recognitionnatural-language-processingnlpnlp-tutorialquestion-answeringsfttensorflowtrainertransformers

awesome-rag/awesome-rag

Awesome-RAG: Collect typical RAG papers and systems.

45435Updated 1 day ago

agentaiawesomeawesome-listgraphragllmmmopensourcepaperragsft

0xsequence/erc-1155

Ethereum Semi Fungible Standard (ERC-1155)

TypeScript318112Updated 3 weeks ago

erc1155ethereumnftsemi-fungiblesfttoken-contract

Zeyi-Lin/Qwen3-Medical-SFT

Qwen3 Fine-tuning: Medical R1 Style Chat

Python29046Updated 22 hours ago

fine-tuningqwen3r1sft

liangyuwang/zo2

ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]

Python20218Updated 1 month ago

deepseekllamallmsoffloadingqwensftzeroth-order-optimization

ScottishFold007/Cosyvoice_DPO_NOTES

CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!

Python11919Updated 3 days ago

cosyvoicedposfttts

NiuTrans/Vision-LLM-Alignment

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

Python11810Updated 6 days ago

alignmentdpollama3-visionllavallmmllmmulti-modelpporewardrlhfsftvision

invergent-ai/surogate

Insanely fast LLM pre-training and fine-tuning for modern NVIDIA GPUs. Enterprise-grade LLMOps.

C++1143Updated 1 day ago

cudadeep-learningfine-tuninggenerative-aillamallmllmsnvidia-gpuqwensft

solv-finance/erc-3525

ERC-3525 Reference Implementation

Solidity11147Updated 3 months ago

erc-3525erc3525sftsolv

fangpin/llm-from-scratch

Build LLM from scratch

Python978Updated 2 days ago

bpegptllmllm-from-zero-to-herorlsfttransformer

TuGraph-family/Awesome-Text2GQL

Fine-Tuning Dataset Auto-Generation for Graph Query Languages.

Python9620Updated 4 hours ago

awesome-listfine-tuninggraphdbhacktoberfestllmpeftsfttext2gqltext2sql

OpenSparseLLMs/LLaMA-MoE-v2

🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Python9313Updated 1 week ago

attentionfine-tuninginstruction-tuningllamallama3mixture-of-expertsmoesftsparsity

ecnu-sea/SEA

[EMNLP 2024 Findings] SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for papers, thereby assisting researchers in improving the quality of their work.

Python8911Updated 4 days ago

automated-peer-reviewingdatasetdomain-llmllmnatural-language-processingpeer-reviewsft

datawhalechina/diy-llm

🎓 系统性大语言模型构建课程｜🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)｜🚀 6 个渐进式作业 + 代码驱动，建立 LLM 全栈认知体系

Jupyter Notebook7111Updated 8 hours ago

gpu-programmingllmnlprlsfttransformertriton

velocitybolt/synkro

AI agent simulation framework

Python677Updated 2 days ago

aianthropicdatasets-preparationevaluationfine-tuninggeminilangchainlangfuselangsmithllmllm-evaluationopenaiqa-datasetsftsupervised-finetuningsupervised-learningsynthetic-datatool-callingtraces

SalesforceAIResearch/CoDA

Salesforce AI Research's open diffusion language model

Python597Updated 2 weeks ago

codingdiffusion-modelsllmpost-trainingpre-trainingsft

km1994/AwesomeMultiModel

【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享大语言模型（LLMs），大模型高效微调（SFT）,检索增强生成（RAG），智能体（Agent），PPT自动生成, 角色扮演，文生图（Stable Diffusion），图像文字识别（OCR），语音识别（ASR），语音合成（TTS），人像分割（SA），多模态（VLM），Ai 换脸(Face Swapping), 文生视频(VD)，图生视频（SVD），Ai 动作迁移，Ai 虚拟试衣，数字人，全模态理解（Omni），Ai音乐生成干货学习等实战与经验。

576Updated 2 weeks ago

agentanimateasrface-recognitionllmllmsmllmocromnipeft-fine-tuning-llmpptragsftstable-diffusionsvdtext-to-musictext-to-sqlvideo-diffusion-modelvirtual-try-onvlm

ssbuild/moss_finetuning

moss chat finetuning

Python514Updated 7 months ago

adalorachatchatmossfinetuingloramossqlorasft

Shekswess/tiny-reasoning-language-model

Code repository dedicated to experimenting and research with tiny reasoning language model

Python494Updated 6 days ago

llmpost-trainingreasoningresearchsftslmtransformerstrl

waltonfuture/RL-with-Cold-Start

SFT+RL boosts multimodal reasoning

Python470Updated 5 days ago

grpomllmsft

Page 1 of 6