"topic:pre-training" — Search

197 results for “topic:pre-training”

The official GitHub page for the survey paper "A Survey of Large Language Models".

chain-of-thoughtchatgptin-context-learninginstruction-tuninglarge-language-modelsllmllmsnatural-language-processingpre-trained-language-modelspre-trainingrlhf

datajuicer/data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python6.0k336Updated 5 hours ago

datadata-analysisdata-pipelinedata-processingdata-sciencedata-visualizationfoundation-modelsinstruction-tuninglarge-language-modelsllmllmsmulti-modalpre-trainingsynthetic-data

dbiir/UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python3.1k525Updated 1 week ago

albertbartbertchineseclassificationclueelmofine-tuninggptgpt-2model-zoonatural-language-processingnerpegasuspre-trainingpytorchrobertat5unilmxlm-roberta

ChandlerBang/awesome-self-supervised-gnn

Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).

Python1.7k164Updated 1 week ago

deep-learninggraph-mininggraph-neural-networksgraph-self-supervised-learningmachine-learningpre-trainingpretrainingself-supervised-learning

EgoAlpha/prompt-in-context-learning

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

Jupyter Notebook1.7k100Updated 14 hours ago

ai-agentchain-of-thoughtchatbotchatgptchatgpt-apicotin-context-learninglanguage-modelinglarge-language-modelllmpre-trainingpromptprompt-based-learningprompt-engineeringprompt-learning

LirongWu/awesome-graph-self-supervised-learning

Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"

1.4k164Updated 3 days ago

data-augmentationdeep-learninggraph-neural-networksmachine-learningpre-trainingpretext-taskrepresentation-learningself-supervised-learningtransfer-learningunsupervised-learning

SalesforceAIResearch/uni2ts

Unified Training of Universal Time Series Forecasting Transformers

Jupyter Notebook1.4k192Updated 7 hours ago

deep-learningforecastingmachine-learningpre-trained-modelspre-trainingrepresentation-learningtime-seriestime-series-forecastingtransformersuniversal-forecasting

zjunlp/KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

Python1.4k133Updated 1 day ago

bilingualchinesedeep-learningdeepspeedenglishgpt-3instructieinstruction-followinginstruction-tuninginstructionsknowlmlanguage-modellarge-language-modelsllamaloramodelspre-trained-language-modelspre-trained-modelpre-trainingreasoning

yzhuoning/Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

1.2k57Updated 5 days ago

clipcontrastive-learningpre-training

qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM

A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.

1.2k90Updated 3 days ago

anomalydetectionautoscalingdeeplearningforecastingfoundation-modelslarge-language-modelslarge-modelsmachinelearningpre-trainingrcatimeseries

Tencent/TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Python1.1k148Updated 1 week ago

albertbartbertchineseclassificationclueelmofine-tuninggptgpt-2model-zoonatural-language-processingnerpegasuspre-trainingpytorchrobertat5unilmxlm-roberta

microsoft/OscarArchived

Oscar and VinVL

Python1.1k250Updated 4 days ago

image-captioningimage-text-searchoscarpre-trainingvinvlvision-and-languagevqa

brightmart/bert_language_understanding

Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN

Python968211Updated 6 days ago

attention-is-all-you-needbert-modeldocument-classificationfasttextlanguage-modellanguage-understandingnlppre-trainingquestion-answeringself-attentiontext-classificationtextcnntransfer-learningtransformer-encoder

ChenRocks/UNITER

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"

Python800113Updated 6 days ago

pre-trainingpytorchtransformersvision-and-language

jackroos/VL-BERT

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

Jupyter Notebook746112Updated 5 months ago

berticlr2020pre-trainingpytorchrepresentation-learningself-supervised-learningvision-and-languagevl-bert

nancheng58/Awesome-LLM4RS-Papers

Large Language Model-enhanced Recommender System Papers

74563Updated 15 hours ago

awesome-listgptlarge-language-modelsllmpaperspre-trainingprompt-tuningrecommender-systemsurvey

cxcscmu/Craw4LLM

Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

Python65060Updated 1 week ago

crawlercrawlinglarge-language-modelsllmpre-trainingpretrainingweb-crawlerweb-crawling

princeton-nlp/LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python64257Updated 6 days ago

efficiencyllamallama2llmnlppre-trainingpruning

Shen-Lab/GraphCL

[NeurIPS 2020] "Graph Contrastive Learning with Augmentations" by Yuning You, Tianlong Chen, Yongduo Sui, Ting Chen, Zhangyang Wang, Yang Shen

Python623113Updated 2 days ago

contrastive-learninggraph-neural-networkpre-trainingself-supervised-learning

linwhitehat/ET-BERT

The repository of ET-BERT, a network traffic classification model on encrypted traffic. The work has been accepted as The Web Conference (WWW) 2022 accepted paper.

Python610116Updated 12 hours ago

burst-analysisencrypted-traffic-analysismask-burst-modelingpre-trainingpytorchsame-origin-burst-predictiontransformer-architecture

microsoft/XPretrain

Multi-modality pre-training

Python51036Updated 2 weeks ago

computer-visionmultimediamultimodal-learningnlppre-training

acbull/GPT-GNN

Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"

Python49889Updated 1 month ago

graph-neural-networksgraph-representation-learningpre-trainingself-supervised-learning

GestaltCogTeam/STEP

Code for our SIGKDD'22 paper Pre-training-Enhanced Spatial-Temporal Graph Neural Network For Multivariate Time Series Forecasting.