"topic:video-representation-learning" — Search

24 results for “topic:video-representation-learning”

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python1.7k163Updated 2 years ago

action-recognitionmaemasked-autoencoderneurips-2022pytorchself-supervised-learningtransformervideo-analysisvideo-representation-learningvideo-transformervideo-understandingvision-transformer

ttengwang/Awesome_Long_Form_Video_Understanding

Awesome papers & datasets specifically focused on long-term videos.

36014Updated 5 months ago

audio-visual-event-localizationdense-video-captioninglong-term-videotemporal-action-detectiontemporal-action-localizationtemporal-sentence-groundingvideo-datasetvideo-groundingvideo-large-language-modelsvideo-llmsvideo-representation-learning

cvlab-columbia/hyperfuture

Code for the paper Learning the Predictability of the Future (CVPR 2021)

Python17326Updated 2 years ago

future-predihyperbolic-embeddingsself-supervised-learninguncertainty-modelingvideo-representation-learning

xyzforever/BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

Python16119Updated 3 years ago

action-recognitionbertdeep-learningfoundation-modelsmasked-autoencoderpytorchself-supervised-learningvideo-representation-learningvideo-understanding

ruiwang2021/mvd

[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

Python13512Updated 2 years ago

action-recognitioncvpr2023masked-autoencoderself-supervised-learningvideo-representation-learningvideo-understandingvision-transformer

GV1028/videogan

Implementation of "Generating Videos with Scene Dynamics" in Tensorflow

Python7720Updated 8 years ago

generative-adversarial-networktensorflowvideovideo-generationvideo-representation-learning

lijun2005/ICCV25-HLFormer

[ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.

Python532Updated 6 days ago

cross-modal-retrievalhyperbolic-learningiccv2025lorentz-self-attentionpartial-order-alignmentpartially-relevant-video-retrievalprvrvideo-representation-learningvideo-retrievalvideo-text-retrieval

ihaeyong/PFNR

Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (NIR)

Python495Updated 1 year ago

continual-learningimplicit-neural-representationvideo-representation-learning

sunilhoho/EVEREST

Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].

Python301Updated 1 year ago

masked-autoencoderrepresentation-learningself-supervised-learningvideo-learningvideo-representation-learning

oooolga/JEDi

👆PyTorch Implementation of JEDi Metric described in "Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality"

Python302Updated 1 year ago

computer-visionfvdjedijepammdvideo-generationvideo-metricsvideo-quality-assessmentvideo-representation-learning

xiaojieli0903/MaskAgain

Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)

Python270Updated 1 year ago

knowledge-distillationmasked-video-modelingvideo-representation-learning

boschresearch/rinceArchived

This is the code accompanying the AAAI 2022 paper "Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives" https://arxiv.org/abs/2201.11736 . The method allows you to use additional ranking information for representation learning.

Python254Updated 3 years ago

classificationcontrastive-learningout-of-distribution-detectionpaper-resourcerepresentation-learningself-supervised-learningvideo-representation-learning

mondalanindya/MSQNet

Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]

Python241Updated 2 years ago

action-recognitionanimal-behaviorcharadeshmdb51video-representation-learningvision-and-languagevision-languagevision-transformerzero-shot-classification

xiaojieli0903/FGKVMemPred_video

Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)

Python230Updated 1 year ago

contrative-dictionary-learningmemory-networksself-supervised-learningvideo-representation-learning

gimpong/AAAI25-S5VH

The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).

Python222Updated 7 months ago

aaaiaaai2025contrastive-learninghashingmamba-imagemamba-modelself-supervised-learningssmstate-space-modelvideo-hashingvideo-representation-learningvideo-retrieval

lijun2005/Awesome-Partially-Relevant-Video-Retrieval

A paper list of partially relevant video retrieval.

212Updated 2 days ago

awesome-prvrcross-modal-retrievalpartially-relevant-video-retrievalprvrvideo-representation-learningvideo-text-retrieval

lijun2005/CVPR26-DreamPRVR

[CVPR 2026] Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval.

Python160Updated 6 days ago

cross-modal-retrievalcvpr2026diffusionpartially-relevant-video-retrievalregistervideo-representation-learning

Video-MAC/VideoMAC

Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”

Python121Updated 2 years ago

convnetsmaemasked-autoencoderself-supervised-learningvideo-representation-learningvideo-segmentation

furushchev/chainervr

Chainer implementation of Networks for Learning Video Representations

Python71Updated 7 years ago

chainerdeep-learningvideo-representationvideo-representation-learning

UARK-AICV/Video_Representation

[Asilomar 2022] Contextual Explainable Video Representation: Human Perception-based Understanding

41Updated 3 years ago

temporal-action-detectiontemporal-action-localizationvideo-captioningvideo-representationvideo-representation-learning

XFeiF/ComputerVision_PaperNotes

📚 Paper Notes (Computer vision)

10Updated 5 years ago

action-recognitioncomputer-visioncvcvpreccviccvnotespaperrepresentation-learningself-supervised-learningtpamivideo-papernotesvideo-representationvideo-representation-learningvideo-retrievalvideo-understanding

Mallory24/cae_dataset

The official repository for creating casual action effect (CAE) dataset for the IJCNLP-AACL 2023 paper: Implicit Affordance Acquisition via Causal Action–Effect Modeling in the Video Domain

Python10Updated 1 week ago

commonsense-knowledgelanguage-and-visionvideo-representation-learning

Mallory24/cae_modeling

The official repository for the IJCNLP-AACL 2023 paper: Implicit Affordance Acquisition via Causal Action–Effect Modeling in the Video Domain

Python10Updated 2 years ago

commonsense-knowledgelanguage-and-visionvideo-representation-learning

mdnuruzzamanKALLOL/VideoMAE_Tensorflow

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python00Updated 1 year ago

action-recognitionmaepytorchself-supervised-learningtensorflowtensorflow2transformervideo-analyticsvideo-representation-learningvideo-transformervision-transformer