GitHunt

Adrián Bazaga

AdrianBZG

Senior Researcher @ Microsoft. Foundational LLM Development. PhD, Machine Learning @ University of Cambridge. Ex: Amazon AGI, Microsoft Research

@Microsoft, University of Cambridge
London, United Kingdom

Organizations

Languages

Python52%Jupyter Notebook12%Java8%JavaScript8%SCSS4%R4%C#4%HTML4%TypeScript4%

Repos

205

Stars

275

Forks

89

Top Language

Python

Loading contributions...

Top Repositories

Repositories

205
AD
AdrianBZG/llama-multimodal-vqa

Multimodal Instruction Tuning for Llama 3

Python5211Updated 1 year ago
chatbotchatgptgpt-4huggingfaceinstruction-tuninglanguage-modelsllamallama2llama3multimodalmultimodal-instruction-tuningvisual-language-learningvisual-question-answeringvqa
AD
AdrianBZG/TabMDA

[ICML 2024] TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context Subsetting

Python90Updated 1 year ago
data-augmentationdeep-learningin-context-learningmanifold-data-augmentationtabpfntabular-datatabular-transformers
AD
AdrianBZG/HyperBERT

[EMNLP 2024] HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed Hypergraphs

Python231Updated 11 months ago
deep-learninghypergraphslanguage-modelmultimodal-deep-learning
AD
AdrianBZG/Twitter-Follow-ExploitArchived

Automated Twitter mass account creation and follow using Selenium and Tor VPN

Java5816Updated 8 years ago
exploitmass-account-creationopen-sourcetwittertwitter-account-creationtwitter-automationtwitter-followers
AD
AdrianBZG/LLM-distributed-finetune

Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the training on multiple AWS GPU instances

Python606Updated 2 years ago
awsdeep-learningdistributed-trainingfalconfine-tuninghuggingfacelarge-language-modelsnatural-language-processingtransformers
AD
AdrianBZG/adrianbzg.github.io

Personal website

JavaScript00Updated 4 months ago
AD
AdrianBZG/sentence-transformersFork

Multilingual Sentence & Image Embeddings with BERT

00Updated 3 years ago
AD
AdrianBZG/FLUID-LLM

FLUID-LLM: Learning Computational Fluid Dynamics with Spatiotemporal-aware Large Language Models

Python32Updated 1 year ago
deep-learningfluid-dynamicslanguage-modelspatiotemporal
AD
AdrianBZG/Polyglotter

[Nature Scientific Reports] Translating synthetic natural language to database queries: a polyglot deep learning framework

Python268Updated 2 years ago
databasesdeep-learningnatural-language-processingtransformer
AD
AdrianBZG/TISERFork

[ACL 2025] Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models

10Updated 9 months ago
AD
AdrianBZG/SFAVEL

[ICLR 2024] Unsupervised Pretraining for Fact Verification by Language Model Distillation

Python50Updated 2 years ago
deep-learningknowledge-distillationknowledge-graphslanguage-modelmultimodal-deep-learningnatural-language-processingself-supervised-learning
AD
AdrianBZG/Muscular-Dystrophy-Diagnosis

[Applied Soft Computing] A Convolutional Neural Network for the automatic diagnosis of collagen VI-related muscular dystrophies

Python51Updated 6 years ago
biomedical-image-processingconvolutional-neural-networksdeep-learningimage-analysis
AD
AdrianBZG/CancerTargetPrediction

Genome-wide investigation of gene-cancer associations for the prediction of novel therapeutic targets in oncology

Python20Updated 2 years ago
AD
AdrianBZG/SQLformer

SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation

Python63Updated 2 years ago
AD
AdrianBZG/tinyroberta-distillation-qa-es

[ICLR 2024] Language Model Knowledge Distillation for Efficient Question Answering in Spanish

Python10Updated 2 years ago
deep-learningefficient-deep-learningknowledge-distillationlanguage-modelnatural-language-processingquestion-answeringspanish
AD
AdrianBZG/caraml-websiteFork

Content for the CaRAML website

SCSS00Updated 1 year ago
AD
AdrianBZG/fgseaFork

Fast Gene Set Enrichment Analysis

R00Updated 7 years ago
AD
AdrianBZG/TabPFNFork

Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.

00Updated 2 years ago
AD
AdrianBZG/Medieval_Warfare_VR-Unity

Virtual Reality game for the Intelligent Interfaces subject, made with Unity Engine.

C#112Updated 9 years ago
AD
AdrianBZG/scancode-toolkitFork

:mag_right: ScanCode scans code and detects licenses, copyrights, package manifests & dependencies and more ... to discover and inventory open source and third-party packages used in your code.

HTML00Updated 7 years ago
AD
AdrianBZG/A-simple-baseline-algorithm-for-graph-classificationFork

No description provided.

Jupyter Notebook10Updated 7 years ago
AD
AdrianBZG/2018-MachineLearning-Lectures-ESAFork

Machine Learning Lectures at the European Space Agency (ESA) in 2018

Jupyter Notebook10Updated 7 years ago
AD
AdrianBZG/aima-pseudocodeFork

Pseudocode descriptions of the algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"

10Updated 9 years ago
AD
AdrianBZG/aima-javaFork

Java implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"

Java10Updated 8 years ago
AD
AdrianBZG/personal_websiteArchived

My personal website, deployed using Docker

TypeScript10Updated 2 years ago
dockernodejspersonal-websitereactjstemplate
AD
AdrianBZG/yale-lily.github.ioFork

No description provided.

00Updated 2 years ago
AD
AdrianBZG/nlp-tutorialFork

Natural Language Processing Tutorial for Deep Learning Researchers

Jupyter Notebook00Updated 6 years ago
AD
AdrianBZG/graph-representation-learningFork

Autoencoders for Link Prediction and Semi-Supervised Node Classification

Python10Updated 7 years ago
AD
AdrianBZG/InterMine-Data-Browser-Tool

InterMine Data Browser: a tool for exploring semi-homogeneous biological datasets

JavaScript438Updated 2 years ago
gsoc-2018gsoc-2019gsoc-2020intermine
AD
AdrianBZG/BIOLITMAPFork

Code for the paper "BIOLITMAP: a web-based geolocated, temporal and thematic visualization of the evolution of bioinformatics publications", Bazaga et al. (2018). Accepted in Oxford Bioinformatics. doi:10.1093/bioinformatics/bty967

Python31Updated 6 years ago
data-miningdata-sciencedata-visualizationmachine-learningmapsnatural-language-processingresearchresearch-paperscience

Gists

Recent Activity

Adrián Bazaga (AdrianBZG) | GitHunt