"topic:deepspeed" — Search

104 results for “topic:deepspeed”

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellamacuda-kernelsdeepspeedfastertransformerinternlmllamallama2llama3llmllm-inferenceturbomind

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

ai-safetyalpacabeaverdatasetsdeepspeedgptlarge-language-modelsllamallmllmsreinforcement-learningreinforcement-learning-from-human-feedbackrlhfsafe-reinforcement-learningsafe-reinforcement-learning-from-human-feedbacksafe-rlhfsafetytransformertransformersvicuna

zjunlp/KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

Python1.4k133Updated 2 days ago

bilingualchinesedeep-learningdeepspeedenglishgpt-3instructieinstruction-followinginstruction-tuninginstructionsknowlmlanguage-modellarge-language-modelsllamaloramodelspre-trained-language-modelspre-trained-modelpre-trainingreasoning

Coobiw/MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

Jupyter Notebook64232Updated 21 hours ago

deepspeedfine-tuningmllmmodel-parallelmultimodal-large-language-modelspipeline-parallelismpretrainingqwenvideo-language-modelvideo-large-language-models

LambdaLabsML/distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

Python58967Updated 20 hours ago

clustercudadeepspeeddistributed-trainingfsdpgpugpu-clusterkuberenteslambdalabsmpincclpytorchshardingslurm

antgroup/glakeArchived

GLake: optimizing GPU memory management and IO transmission.

Python49845Updated 2 weeks ago

deepspeedgpullmmemoryonnxpytorch

shm007g/LLaMA-Cult-and-More

Large Language Models for All, 🦙 Cult and More, Stay in touch !

HTML45226Updated 1 week ago

alpacachatgptdeepspeedggmlgptgpt4gptqllamallmloralibpytorchtensorflowtransformersvicuna

Xirider/finetune-gpt2xl

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

Python43474Updated 2 weeks ago

deepspeedfinetuninggpt-neogpt-neo-fine-tuninggpt2gpt3huggingfacehuggingface-transformers

OpenMOSS/CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

Python41958Updated 4 weeks ago

deep-learningdeepspeednlppytorch

openpsi-project/ReaLHFArchived

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python33322Updated 3 days ago

deepspeeddistributed-computingdistributed-systemslarge-language-modelslarge-scale-machine-learningllmllm-frameworkllm-trainingmegatron-lmreinforcement-learningreinforcement-learning-from-human-feedbacktransformers

sunzeyeah/RLHF

Implementation of Chinese ChatGPT

Python28935Updated 3 days ago

chatgptdeep-learningdeepspeedglmnlppangupytorch

stanleylsx/llms_tool

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

Python22321Updated 1 month ago

aquilaaquila2baichuanbaichuan2bloomchatglmchatglm2chatglm3deepspeedfalconinternlmllamallama2mistralmosspytorchqwenxverse

bobo0810/LearnDeepSpeed

DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）

Python1885Updated 3 weeks ago

deepspeedexampleslarge-language-models

git-cloner/llama2-lora-fine-tuning

llama2 finetuning with deepspeed and lora

Python17615Updated 2 months ago

deepspeedfinetuningllama2lora

jackaduma/ChatGLM-LoRA-RLHF-PyTorch

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

Python1409Updated 4 months ago

chatglmchatglm-6bchatgptdeepspeedfinetunegptllamallmlorapeftppopytorchreward-modelsrlhf

HomebrewML/revlib

Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload

Python1326Updated 1 month ago

deep-learningdeepspeedmomentumnetpytorchrevnettpuxla

CoinCheung/gdGPT

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

Python9710Updated 1 week ago

baichuan2-7bbloomchatglm3-6bdeepspeedflash-attentionfull-finetunellama2llmmixtral-8x7bmodel-parallizationnlppipelinepytorch

OpenCSGs/llm-inference

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

Python9216Updated 1 month ago

deepspeedllama-cppllm-inferenceraytransformervllm

xyjigsaw/LLM-Pretrain-SFT

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

Python8716Updated 2 weeks ago

baichuan2deepspeedlarge-language-modelsllamaloramistral

billvsme/train_law_llm

✏️0成本LLM微调上手项目，⚡️一步一步使用colab训练法律LLM，基于microsoft/phi-1_5、chatglm3，包含lora微调，全参微调

Jupyter Notebook8412Updated 1 month ago

aideepspeedlawllama2llmlorapython

glb400/Toy-RecLM

A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.

Python696Updated 3 weeks ago

actions-speak-louder-than-wordsdeepspeedlarge-language-modelsllama2recommender-systemsasrec

saforem2/l2hmc-qcd

Application of the L2HMC algorithm to simulations in lattice QCD.

Jupyter Notebook689Updated 1 month ago

deep-learningdeepspeedgauge-theoryhamiltonian-monte-carlohmchorovodhydralatticelattice-qcdmachine-learningmcmcmonte-carlopytorchtensorflow

jackaduma/Alpaca-LoRA-RLHF-PyTorch

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

Python616Updated 3 weeks ago

alpacachatgptdeepspeedfinetunegptllamallmlorapeftppopytorchreward-modelsrlhf

argonne-lcf/LLM-Inference-Bench

LLM-Inference-Bench

Jupyter Notebook607Updated 3 weeks ago

benchmarkdeepspeedinferencellamacppllmtensorrt-llmvllm

l294265421/my-llm

All about large language models

528Updated 2 weeks ago

chatgptdeepspeeddistributed-traininglarge-language-models

liangyuwang/Tiny-DeepSpeed

Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library