Top Repositories
Official implementation of the paper "Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect Detection" accepted @ VISAPP 2024.
Official implementation of the paper "Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect Detection" accepted @ CBMI 2024.
[ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues"
[IROS 24] Official repository of "Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation". We present the first dataset - R2R-IE-CE - to benchmark instructions errors in VLN. We then propose a method, IEDL.
[ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language
Official implementation of "StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues", CVPR 2026.
Repositories
24Official implementation of "StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues", CVPR 2026.
[ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues"
HARPER is a HRI dataset for 3D Human Pose Estimation and Forecasting from the Robot’s Perspective.
Official implementation of the paper "Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect Detection" accepted @ CBMI 2024.
No description provided.
[TPAMI] Official repository for "Unsupervised Active Visual Search with Monte Carlo planning under Uncertain Detections"
Official implementation of the paper "KairosAD: A SAM-Based Model for Industrial Anomaly Detection on Embedded Devices" accepted @ ICIAP 2025.
Official implementation of the paper "Disentangled Latent Spaces Facilitate Data-Driven Auxiliary Learning" accepted @ ICIAP 2025.
Official implementation of the paper "Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect Detection" accepted @ VISAPP 2024.
[IROS 24] Official repository of "Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation". We present the first dataset - R2R-IE-CE - to benchmark instructions errors in VLN. We then propose a method, IEDL.
L-VQAScore
[ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language
Official implementation of the paper "SITUATE: Indoor Human Trajectory Prediction through Geometric Features and Self-Supervised Vision Representation" accepted @ ICPR 2024.
Official implementation of the paper "MTL-Split: Multi-Task Learning for Edge Devices using Split Computing" accepted @ DAC 2024.
Official implementation of the paper "Enhancing Split Computing and Early Exit Applications through Predefined Sparsity" accepted @ FDL 2024.
Official implementation of the paper "LO-SC: Local-only Split Computing for Accurate Deep Learning on Edge Devices" accepted @ VLSI Design 2025.
Official implementation of the OO-dMVMT paper
Official implementation of the paper "Upper-Body Pose-based Gaze Estimation for Privacy-Preserving 3D Gaze Target Detection" accepted at the T-CAP WS (ECCV 2024).
Official implementation of the 3D Pose Estimation baseline on the HARPER dataset, accepted @ IROS 2024..
Official implementation of the paper "MDiFF: Exploiting Multimodal Score-based Diffusion Models for New Fashion Product Performance Forecasting" accepted @ the FashionAI WS (ECCV 2024).
We present SCENE-pathy, a dataset and a set of baselines to study the visual selective attention (VSA) of people towards the 3D scene in which they are located
Some tutorials (and templates) from our research lab, like fixing nvidia-gpu problems, or having templates at hands
No description provided.
Official implementation of the paper "I-Split: Deep Network Interpretability for Split Computing" accepted @ ICPR 2022.