GitHunt

Paul Lerner

PaulLerner

Postdoc at Sorbonne Université, CNRS, ISIR

Sorbonne Université, CNRS, ISIR
France

Languages

Python57%Jupyter Notebook30%HTML4%JavaScript4%Rich Text Format4%

Repos

52

Stars

52

Forks

37

Top Language

Python

Loading contributions...

Top Repositories

Repositories

52
PA
PaulLerner/wug

interface for annotating wug tests

HTML00Updated 1 day ago
PA
PaulLerner/ppllm

🤔 A Python Library to Compute LLM's Perplexity and Surprisal

Python60Updated 1 month ago
PA
PaulLerner/texutils

Python utils to process LaTeX

Python00Updated 6 months ago
PA
PaulLerner/aivancity_nlp

Main repository for the 2024-2026 Natural Language Processing class at aivancity by Paul Lerner

Python00Updated 3 months ago
PA
PaulLerner/PaulLerner.github.io

Postdoc at Sorbonne Université, CNRS, ISIR

JavaScript00Updated 3 months ago
PA
PaulLerner/deep_parkinson_handwriting

No description provided.

Jupyter Notebook52Updated 6 years ago
PA
PaulLerner/21-EuroParl

Dataset and code for the paper "Assessing the Political Fairness of Multilingual LLMs: A Case Study based on a 21-way Multiparallel EuroParl Dataset" (Lerner and Yvon, 2025)

Jupyter Notebook00Updated 4 months ago
PA
PaulLerner/anr-dfgFork

A LaTeX template for an ANR-DFG grant proposal.

Rich Text Format00Updated 4 months ago
PA
PaulLerner/inclure

Automatic translation from Standard to Inclusive French, and vice-versa

Python20Updated 1 year ago
PA
PaulLerner/lxmls-toolkitFork

Machine Learning applied to Natural Language Processing Toolkit used in the Lisbon Machine Learning Summer School

Jupyter Notebook00Updated 7 months ago
PA
PaulLerner/ViQuAE

Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retrieval (Lerner et al., ECIR'24)

Python382Updated 1 year ago
PA
PaulLerner/lxmls-guideFork

Lisbon Machine Learning Summer School Lab Guide

00Updated 9 months ago
PA
PaulLerner/affluences

Scrape affluences.com

Jupyter Notebook00Updated 10 months ago
PA
PaulLerner/ensae_dl_pw2

Repository for the second Practical Work of ENSAE's Deep Learning class 2024-2025.

Jupyter Notebook014Updated 10 months ago
PA
PaulLerner/ensae_dl_pw1

Repository for the first Practical Work of ENSAE's Deep Learning class 2024-2025

Jupyter Notebook114Updated 11 months ago
PA
PaulLerner/bertalignFork

Multilingual sentence alignment using sentence embeddings

Python01Updated 11 months ago
PA
PaulLerner/europarl-udsFork

Toolkit to compile a comparable/parallel corpus from European Parliament proceedings

Python00Updated 1 year ago
PA
PaulLerner/neott

Source code and data for the papers by Lerner and Yvon: Towards the Machine Translation of Scientific Neologisms / Unlike “Likely”, “Unlike” is Unlikely: BPE-based Segmentation hurts Morphological Derivations in LLMs

Python00Updated 1 year ago
PA
PaulLerner/neoseg

A tool for Lexematic Segmentation by Paul Lerner

Python00Updated 1 year ago
PA
PaulLerner/arxiv-latex-cleanerFork

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python00Updated 1 year ago
PA
PaulLerner/unlikely

Data for the paper Unlike “Likely”, “Unlike” is Unlikely: BPE-based Segmentation hurts Morphological Derivations in LLMs (Lerner and Yvon, 2025)

00Updated 1 year ago
PA
PaulLerner/tower-evalFork

No description provided.

00Updated 1 year ago
PA
PaulLerner/symptomsFork

Symptoms subset of TERMIUM

00Updated 1 year ago
PA
PaulLerner/france_termeFork

No description provided.

00Updated 1 year ago
PA
PaulLerner/termiumFork

No description provided.

00Updated 1 year ago
PA
PaulLerner/nlp-lab-language-modelsFork

Hit the fork button!

03Updated 3 years ago
PA
PaulLerner/nlp-lab-text-embeddingFork

No description provided.

Python01Updated 2 years ago
PA
PaulLerner/imdb2allocine

Map IMDb to Allociné, for the main purpose of collecting French press reviews/ratings.

Jupyter Notebook00Updated 2 years ago
PA
PaulLerner/pyannote-db-plumcot-loaderForkArchived

Data loader for pyannote.db.plumcot

Python00Updated 5 years ago
PA
PaulLerner/lightningFork

Build and train PyTorch models and connect them to the ML lifecycle using Lightning App templates, without handling DIY infrastructure, cost management, scaling, and other headaches.

Python00Updated 3 years ago

Gists

Recent Activity

Paul Lerner (PaulLerner) | GitHunt