Jiasen Lu
jiasenlu
Research Scientist @apple
Languages
Repos
58
Stars
2.0k
Forks
494
Top Language
Python
Loading contributions...
Top Repositories
Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
visual dialog model in pytorch
LL3M: Large Language and Multi-Modal Model in Jax
Repositories
58Adds SPICE metric to coco-caption evaluation server codes
LL3M: Large Language and Multi-Modal Model in Jax
Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"
visual dialog model in pytorch
No description provided.
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
No description provided.
🧑🚀 全世界最好的LLM资料总结(视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
Jax implementation of VIT-VQGAN
CDSSM implementation in torch
No description provided.
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
A deep learning library for video understanding research.
An Extensible Deep Learning Library
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Machine Learning eXperiment Utilities
A faster pytorch implementation of faster r-cnn
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
Model parallel transformers in JAX and Haiku
Task-based datasets, preprocessing, and evaluation for sequence models.
Basic simulation code included AODV, LAR, GRID and our approach, GAR
Tensors and Dynamic neural networks in Python with strong GPU acceleration
ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
Object detection, 3D detection, and pose estimation using center point detection:
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning
No description provided.
Datasets, Transforms and Models specific to Computer Vision
Multi Task Vision and Language