Vishal

mvish7

Passionate about multi-modal embodied AI. Highly skilled practitioner of Computer Vision, Natural Language Processing and Sensor Fusion.

Germany

Languages

Python68%Jupyter Notebook18%C++9%CMake5%

Loading contributions...

Top Repositories

AlignVLM

This repository contains the implementation of AlignVLM paper, which proposes a novel method for vision language alignment

14Python

dycoke_token_compression

This repo integrates DyCoke's token compression method with VLMs such as Gemma3 and InternVL3

5Python

Udacity_SensorFusion_Nanodegree

This repository contains all the completed projects of SFND with Udacity

2C++

GPT_Playground

This projects implements a toy-example of GPT-2 with additional bells and whistles like Mixture-of-Experts and MAMBA blocks.

1Python

Identity_protection_with_emoji

This repository contains the code to hide the faces from video with emojies

1Python

DFQ-VLA

Home of FastFlexQwen aka DFQ VLA.

0Python

Repositories

mvish7/DFQ-VLA

Home of FastFlexQwen aka DFQ VLA.

Python00Updated 8 hours ago

autonomous-drivingembodied-aivision-language-action-model

mvish7/VQ-VAE

contains implementation and artifacts of VQ-VAE built upon NVDIA's PhysicalAI AV dataset

Python00Updated 2 weeks ago

autonomous-drivingembodied-aivaevla

mvish7/AlignVLM

This repository contains the implementation of AlignVLM paper, which proposes a novel method for vision language alignment

Python140Updated 9 months ago

huggingface-transformersmultimodalitysmolvlmvision-language-alignmentvision-language-modelvision-language-pretraining

mvish7/dycoke_token_compression

This repo integrates DyCoke's token compression method with VLMs such as Gemma3 and InternVL3

Python50Updated 4 months ago

inference-optimizationtoken-compressionvideo-large-language-modelsvlms

mvish7/GPT_Playground

This projects implements a toy-example of GPT-2 with additional bells and whistles like Mixture-of-Experts and MAMBA blocks.

Python10Updated 7 months ago

mvish7/ml-depth-pro-knowledge-distil

Training (nano) DepthPro with knowledge distillation

Python00Updated 9 months ago

mvish7/ml-hypersimFork

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

00Updated 1 year ago

mvish7/stanford-cs224n-nlp-win23

This repo contains lecture notes and assignments of stanford's CS224N course of NLP

Jupyter Notebook00Updated 1 year ago

cs224ndeep-learningllmsnlppytorchstanfordtrasnformers

mvish7/Llama3_LoRA_from_scratch

This projects implements LoRA/QLoRA to finetune Llama3 from scratch

Python00Updated 1 year ago

mvish7/tokenizers

A project aimed at understanding components of LLM tokenizers

Jupyter Notebook00Updated 1 year ago

mvish7/Udacity_SensorFusion_Nanodegree

This repository contains all the completed projects of SFND with Udacity

C++20Updated 5 years ago

cameracmakecpp11kalman-filterlidarradarsensor-fusion

mvish7/Identity_protection_with_emoji

This repository contains the code to hide the faces from video with emojies

Python10Updated 5 years ago

mvish7/deep_learning_handson

Repository containing work done while taking hands on lessons of Deep learning

Python00Updated 5 years ago

d2l-aideep-learningpytorch

mvish7/Realization-of-Computer-Vision-Algorithms

This repository contains the implementation of CV algorithms in C++

C++00Updated 5 years ago

mvish7/Raspberry-Pi-based-home-surveillance

Employed motion detection for pi-cam based surveillance

Python00Updated 5 years ago

mvish7/lane_line_detection

This repository contains lane detection project from Udacity's Self driving car nanodegree

Python00Updated 5 years ago

computer-visionlane-lines-detectionself-driving-car

mvish7/Instance-Segmentation-with-Mask-RCNN

Employed Transfer Learning for instance segmentation using Mask RCNN

Python00Updated 5 years ago

instance-segmentationmask-rcnntransfer-learning

mvish7/YOLO-based-object-detection

Employed Transfer Learning for performing web-cam based object detection

Python00Updated 5 years ago

object-detectionopencv-pythontransfer-learningyolov3

mvish7/License-Plate-Recognition

Contains code for recognizing license plate characters from Images

Python00Updated 5 years ago

classificationconnected-componentsocr

mvish7/traffic_sign_classifier

Traffic Sign classifier based on TensorFLow

Jupyter Notebook00Updated 5 years ago

classificationgerman-traffic-signslenet-5tensorflow

mvish7/PyTorch_hands_on

PyTorch practice notebooks from various sources

Jupyter Notebook00Updated 6 years ago

pytorch-tutorial

mvish7/tensorflow-yolov3Fork

pure tensorflow Implement of YOLOv3 with support to train your own dataset

Python00Updated 6 years ago

mvish7/moveit_gazebo_integration

No description provided.

CMake02Updated 6 years ago

Vishal

Languages

Loading contributions...

Top Repositories

Repositories

Gists

Recent Activity