"topic:decoder-only" — Search

28 results for “topic:decoder-only”

[ICCV 2025] DONUT: A Decoder-Only Model for Trajectory Prediction

argoverseautonomous-drivingdecoder-onlyiccv2025motion-predictiontrajectory-prediction

Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and vision-language capabilities

Python323Updated 4 days ago

decoder-onlyencoder-decoderllmvision-and-language

liaoyanqing666/Decoder-only-transformer_Time_Series_Prediction

使用Decoder-only的Transformer进行时序预测，包含SwiGLU和RoPE(Rotary Positional Embedding)，Time series prediction using Decoder-only Transformer, Including SwiGLU and RoPE(Rotary Positional Embedding)

Python161Updated 6 months ago

decoder-onlypytorchroperotary-positional-embeddingswiglutime-seriestime-series-predictiontransformer

pittisl/mPnP-LLM

Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"

Python121Updated 14 hours ago

decoder-onlydeep-learningembodied-ailarge-language-modelmodality-adaptationmultimodal

cisnlp/MEXA

🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment

Python111Updated 7 months ago

cross-lingualdecoder-onlyembeddingsevaluationevaluation-metricslarge-language-modelsmultilingualmultilingual-nlp

Ayush-Aditya/decoder-only-seq2seq

Minimal decoder-only seq2seq pipeline with proper causal masking, teacher forcing, Ignite training loop, and checkpointed inference

Python60Updated 2 weeks ago

autoregressive-modelscausal-maskingdecoder-onlydeep-learningmachine-learningnlppytorchpytorch-implementationseq2seq-modeltransformer

SanMumumu/SAMPO

SAMPO: Scale-wise Autoregression with Motion Prompt for Generative World Models

Python50Updated 1 month ago

decoder-onlygenerative-modelvar

ntphuc149/ViAG

ViAG: A Novel Framework for Fine-tuning Answer Generation models ultilizing Encoder-Decoder and Decoder-only Transformers's architecture

Python40Updated 8 months ago

answer-generationbartbartphobertscorebleu-scoredecoder-onlyencoder-decoderfine-tuninginstruction-tuningllamallmmeteorplmsqloraquestion-answeringqwenrouget5vit5

msmrexe/pytorch-gpt2-persian-sentiment-generation

A from-scratch implementation of a scaled-down GPT-2 model in PyTorch, trained on the Snappfood dataset for sentiment-controlled Persian text generation.

Python20Updated 4 months ago

causal-attentioncourse-projectdecoder-onlydeep-learninggpt2persian-text-generationpositional-embeddingreview-generationself-attentiontext-generationtransformeruniversity-project

michaelbabsek/LLM

No description provided.

Python10Updated 9 months ago

attention-mechanismdecoder-onlyllmllm-inferencellm-trainingmultihead-attention

gurpejsingh13/punjabi-gpt-scratch-20m

Developed and pre-trained a 20.39M-parameter Punjabi GPT-style base model from scratch, including corpus preparation, tokenizer training, benchmark evaluation, and text generation, using a cleaned Punjabi corpus and local Apple Silicon GPU acceleration.

Jupyter Notebook10Updated 14 hours ago

decoder-onlygenerative-aigpt2lmheadmodelgpt2tokenizerlanguage-modelllmspunjabi-languagetiny-model

pablo-reyes8/implementing-gpt

Clean-room GPT-2/GPT-3 implementation: tokenizers, architecture blocks, training loop with AdamW + cosine decay, CLI scripts, inference tools, and pytest suite. Covers OpenWebText-10k & WikiText-103 workflows. Designed as an academic reference for understanding and scaling decoder-only transformers

Python10Updated 3 weeks ago

adamwcosine-decaydecoder-onlyeducational-implementationgpt2gpt3gpu-accelerationlanguage-modelnlppytorchtransformers

caochk/zh2en

中文至英文序列转导模型。从零严格复现《Attention Is All You Need》在中文→英文机器翻译（Zh→En）上的完整流程。

Python00Updated 2 months ago

decoder-onlydeep-learningencoder-decodertransformer

RicardoRobledo/BuildingTransformerModelsWithPytorch

This is a compilation about excercises to learn how to implement a transformer model

Jupyter Notebook00Updated 3 months ago

attention-mechanismdecoder-onlyembeddingsencoder-decoder-modelpytorchtokenizerstransformers

sea-rod/minigpt

A mini version of GPT implemented on shakespear using BPE

Jupyter Notebook00Updated 9 months ago

aidecoder-modeldecoder-onlygptminigptpythontransfomer

gloptim/text_generating_transformer

Decoder-only transformer, simplest character-level tokenization, training and text generation.

Python00Updated 1 year ago

decoder-onlyeducationalllmneural-networkpositional-encodingpythonpytorchtext-generationtokenizertorchtransformer

rud-ninja/decoder-transformer-language-model

Auto regressive text generation application using decoder transformer

Python00Updated 2 years ago

decoder-onlydeep-learningflask-applicationgptpythonpytorch

5aurabhpathak/decoder-only-image-reconstruction

A decoder only approach for image reconstruction inspired by adversarial machine learning implemented in keras/tensorflow2

Jupyter Notebook00Updated 1 year ago

decoder-onlyimage-reconstructionkeras-tensorflowtensorflow2

nadeem4/mini-gpt

A compact, readable GPT-style decoder-only Transformer implemented in pure PyTorch. The goal is to expose the essential architectural pieces with minimal scaffolding so you can train and tinker quickly.

Python00Updated 3 months ago

decoder-onlygptpytorch

Amir-Hofo/GPT2

Implementation of the GPT-2 architecture using PyTorch, trained on the TinyStories dataset. Features custom training pipelines on Modal (cloud computing) and integration with the Hugging Face ecosystem.

Python00Updated 1 month ago

bpe-tokenizerdecoder-onlyenglish-nlpgpt2huggingfacemodalpytorchtiny-storiestransformers

mostafabahaa25/multi-modal_language_model_pali-gemma

This project is my PyTorch reproduction of PaliGemma, a compact 3B vision–language model that integrates SigLIP vision features with a Gemma decoder. I implemented the full multimodal pipeline from vision encoding to autoregressive text generation to study modern VLM architectures from a research perspective.

Python00Updated 3 months ago

attentiondecoder-onlydeep-learninggemmaimage-captioningimage-encoderlanguage-modelmachine-learningobject-detectionocrresearch-implementationsiglipvqa

Akhan521/GPT-From-Scratch

🧸 A fully custom GPT-style language model built from scratch using PyTorch and trained on Winnie-the-Pooh! Explored the core mechanics of self-attention, autoregressive text generation, and modular model training, all without relying on any external libraries.

Python00Updated 7 months ago

decoder-onlygenerative-aigptmachine-learningmachine-learning-algorithmspythonpytorchself-attentiontransformers

saschque/patterndecoder

This study examines the effectiveness of transformer-based models for financial time series forecasting, specifically focusing on log returns derived from daily closing prices of the DAX40 index. We propose a decoder-only transformer model designed for immediate-term financial time series forecasting: The PatternDecoder.

Jupyter Notebook00Updated 3 weeks ago

autoformerconvolutional-neural-networksdecoder-onlyinformerlstmtime-series-analysistransformer

egesualp/growth-vs-forgetting

This repository contains the implementation and experiments for comparing gradual growth methods, specifically the G_stack approach, with naive models trained from scratch. The project focuses on addressing catastrophic forgetting and improving model performance in continuous learning scenarios.

Shell00Updated 12 months ago

catastrophic-forgettingdecoder-onlyinferencellmmodel-growthstacking-llm

mariakolyachko/tiny-shakespear-transformer

Decoder-only language model trained to generate texts that look like Shakespear's plays

Jupyter Notebook00Updated 2 weeks ago

decoder-modeldecoder-onlylanguage-modeltransformer

odinhg/decoder-answer-bot

Decoder-only transfomer model for answering short questions using causal self-attention.

Python00Updated 10 months ago

chatbotdecoder-onlygenaigenerative-aigptlearning-exercisenlppytorchtransformer

Banniesdread/decoder-only-seq2seq

Implement a decoder-only Transformer in PyTorch to reverse character sequences using causal masking and cross-entropy loss with Ignite training support

Python00Updated just now

autoregressive-modelscausal-maskingdecoder-onlydeep-learninggithub-configmachine-learningnlppytorchpytorch-implementationseq2seq-modeltransformer

Diatomic-assay511/pytorch-gpt2-persian-sentiment-generation

No description provided.

Python01Updated 1 hour ago

causal-attentioncourse-projectdecoder-onlydeep-learninggpt2persian-text-generationpositional-embeddingreview-generationself-attentiontext-generationtransformeruniversity-project