Olivier Binette

OlivierBinette

Research Scientist @ Upstart // Duke Statistical Science PhD

Duke University

Durham, NC

https://olivierbinette.ca/

Organizations

Languages

Python30%HTML17%R17%Jupyter Notebook13%C++9%CSS4%TypeScript4%JavaScript4%

Top Repositories

Awesome-Entity-Resolution

List of entity resolution software and resources.

An End-to-End Evaluation Framework for Entity Resolution Systems

streamlit-survey

Survey components for Streamlit apps

Efficient String Comparison Functions and Fuzzy String Matching

Lightweight validation tool for checking function arguments and data analysis scripts.

Fingerprint matching tools based on NIST's mindtct and bozorth3 algorithms.

Repositories

120

OlivierBinette/fingermatchR

Fingerprint matching tools based on NIST's mindtct and bozorth3 algorithms.

C++100Updated 9 hours ago

OlivierBinette/GroupTreeShap

GroupSHAP variant of the TreeSHAP algorithm.

Jupyter Notebook00Updated 1 day ago

OlivierBinette/Awesome-Entity-Resolution

List of entity resolution software and resources.

11112Updated 3 days ago

awesomeawesome-listdeduplicationentity-resolutionidentity-resolutionrecord-linkage

OlivierBinette/olivierbinette.github.io

No description provided.

HTML10Updated 3 weeks ago

OlivierBinette/FeatureStore-lite

A lightweight feature store for Pandas, DuckDB, or your choice of backend.

Python10Updated 1 month ago

duckdbfeature-storefeaturestoremlopspandaspython

OlivierBinette/xgboostFork

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++00Updated 1 month ago

OlivierBinette/OlivierBinette

No description provided.

10Updated 1 month ago

OlivierBinette/er-evaluation

An End-to-End Evaluation Framework for Entity Resolution Systems

Python3611Updated 3 months ago

author-name-disambiguationdata-sciencededuplicationdisambiguationduplicate-detectionentity-resolutionevaluationfuzzy-matchinginventor-name-disambiguationmatchingml-evaluationml-testingrecord-linkagestatistics

OlivierBinette/StringCompare

Efficient String Comparison Functions and Fuzzy String Matching

Python202Updated 4 months ago

damerau-levenshteinedit-distancefuzzy-matchingjaro-winklerlevenshtein-distancepybind11pythonstring-matching

OlivierBinette/assert

Lightweight validation tool for checking function arguments and data analysis scripts.

R121Updated 4 months ago

argument-checksassertionserror-messagesrvalidation

OlivierBinette/cache

Easily cache and retrieve computation results in R

R71Updated 4 months ago

OlivierBinette/streamlit-survey

Survey components for Streamlit apps

Python2616Updated 4 months ago

feedbackstreamlitstreamlit-surveysurvey

OlivierBinette/welcome-to-the-moon-card-flipper

Card flipping app for "Welcome to the Moon"

CSS70Updated 5 months ago

board-gamejavascriptwelcome-to-the-moon

OlivierBinette/groupbyrule

Deduplicate data using fuzzy and deterministic matching rules.

Python80Updated 10 months ago

edit-distanceentity-resolutionigraphlevenshteinlevenshtein-distancepandasrecord-linkagestring-distancestring-matching

OlivierBinette/MSETools

Code and analyses for the paper titled “On the Reliability of Multiple Systems Estimation for the Quantification of Modern Slavery” (Binette and Steorts, 2021).

R30Updated 12 months ago

OlivierBinette/USPTO-Patents-XML-Resources

USPTO XML resources and data examples for patent text.

HTML10Updated 12 months ago

OlivierBinette/libpostalFork

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

00Updated 1 year ago

OlivierBinette/tina-cloud-starter

No description provided.

TypeScript00Updated 1 year ago

OlivierBinette/VisTreeArchived

No description provided.

Jupyter Notebook60Updated 1 year ago

OlivierBinette/PatentsView-Code-SnippetsFork

No description provided.

00Updated 1 year ago

OlivierBinette/earthquakes

3D data visualization with WebGL/three.js

JavaScript30Updated 1 year ago

OlivierBinette/assignee-search

No description provided.

Jupyter Notebook01Updated 1 year ago

OlivierBinette/JSM-2023

ER-Evaluation Demo for JSM 2023

HTML10Updated 1 year ago

OlivierBinette/simple-typo-tolerant-search

Efficient typo-tolerant search in 76 lines of code, with no dependencies.

Python30Updated 1 year ago

fuzzy-matchingfuzzy-searchlevenshtein-algorithmlevenshtein-automatonlevenshtein-distancesearchsearch-enginetrietrie-structuretypo-tolerant

OlivierBinette/imodelsFork

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).

00Updated 1 year ago

OlivierBinette/Awesome-LLMs-Evaluation-PapersFork

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

00Updated 1 year ago

OlivierBinette/FoFoFork

No description provided.

Python00Updated 1 year ago

OlivierBinette/TruthfulQAFork

TruthfulQA: Measuring How Models Imitate Human Falsehoods

00Updated 1 year ago

OlivierBinette/ul-benchmark-datasets-for-entity-resolution-archive

Unofficial archive of https://dbs.uni-leipzig.de/research/projects/benchmark-datasets-for-entity-resolution

HTML00Updated 1 year ago

OlivierBinette/TessTools

Tools for the use of Tesseract OCR in R

R43Updated 1 year ago

digital-humanitieshistorical-newspapersocrrtesseracttesseract-ocr

Gists

Recent Activity

Olivier Binette (OlivierBinette) | GitHunt