GitHunt

Olivier Binette

OlivierBinette

Research Scientist @ Upstart // Duke Statistical Science PhD

Duke University
Durham, NC

Organizations

Languages

Python30%HTML17%R17%Jupyter Notebook13%C++9%CSS4%TypeScript4%JavaScript4%

Top Repositories

Repositories

120
OL
OlivierBinette/fingermatchR

Fingerprint matching tools based on NIST's mindtct and bozorth3 algorithms.

C++100Updated 9 hours ago
OL
OlivierBinette/GroupTreeShap

GroupSHAP variant of the TreeSHAP algorithm.

Jupyter Notebook00Updated 1 day ago
OL
OlivierBinette/Awesome-Entity-Resolution

List of entity resolution software and resources.

11112Updated 3 days ago
awesomeawesome-listdeduplicationentity-resolutionidentity-resolutionrecord-linkage
OL
OlivierBinette/olivierbinette.github.io

No description provided.

HTML10Updated 3 weeks ago
OL
OlivierBinette/FeatureStore-lite

A lightweight feature store for Pandas, DuckDB, or your choice of backend.

Python10Updated 1 month ago
duckdbfeature-storefeaturestoremlopspandaspython
OL
OlivierBinette/xgboostFork

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++00Updated 1 month ago
OL
OlivierBinette/OlivierBinette

No description provided.

10Updated 1 month ago
OL
OlivierBinette/er-evaluation

An End-to-End Evaluation Framework for Entity Resolution Systems

Python3611Updated 3 months ago
author-name-disambiguationdata-sciencededuplicationdisambiguationduplicate-detectionentity-resolutionevaluationfuzzy-matchinginventor-name-disambiguationmatchingml-evaluationml-testingrecord-linkagestatistics
OL
OlivierBinette/StringCompare

Efficient String Comparison Functions and Fuzzy String Matching

Python202Updated 4 months ago
damerau-levenshteinedit-distancefuzzy-matchingjaro-winklerlevenshtein-distancepybind11pythonstring-matching
OL
OlivierBinette/assert

Lightweight validation tool for checking function arguments and data analysis scripts.

R121Updated 4 months ago
argument-checksassertionserror-messagesrvalidation
OL
OlivierBinette/cache

Easily cache and retrieve computation results in R

R71Updated 4 months ago
OL
OlivierBinette/streamlit-survey

Survey components for Streamlit apps

Python2616Updated 4 months ago
feedbackstreamlitstreamlit-surveysurvey
OL
OlivierBinette/welcome-to-the-moon-card-flipper

Card flipping app for "Welcome to the Moon"

CSS70Updated 5 months ago
board-gamejavascriptwelcome-to-the-moon
OL
OlivierBinette/groupbyrule

Deduplicate data using fuzzy and deterministic matching rules.

Python80Updated 10 months ago
edit-distanceentity-resolutionigraphlevenshteinlevenshtein-distancepandasrecord-linkagestring-distancestring-matching
OL
OlivierBinette/MSETools

Code and analyses for the paper titled “On the Reliability of Multiple Systems Estimation for the Quantification of Modern Slavery” (Binette and Steorts, 2021).

R30Updated 12 months ago
OL
OlivierBinette/USPTO-Patents-XML-Resources

USPTO XML resources and data examples for patent text.

HTML10Updated 12 months ago
OL
OlivierBinette/libpostalFork

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

00Updated 1 year ago
OL
OlivierBinette/tina-cloud-starter

No description provided.

TypeScript00Updated 1 year ago
OL
OlivierBinette/VisTreeArchived

No description provided.

Jupyter Notebook60Updated 1 year ago
OL
OlivierBinette/PatentsView-Code-SnippetsFork

No description provided.

00Updated 1 year ago
OL
OlivierBinette/earthquakes

3D data visualization with WebGL/three.js

JavaScript30Updated 1 year ago
OL
OlivierBinette/assignee-search

No description provided.

Jupyter Notebook01Updated 1 year ago
OL
OlivierBinette/JSM-2023

ER-Evaluation Demo for JSM 2023

HTML10Updated 1 year ago
OL
OlivierBinette/simple-typo-tolerant-search

Efficient typo-tolerant search in 76 lines of code, with no dependencies.

Python30Updated 1 year ago
fuzzy-matchingfuzzy-searchlevenshtein-algorithmlevenshtein-automatonlevenshtein-distancesearchsearch-enginetrietrie-structuretypo-tolerant
OL
OlivierBinette/imodelsFork

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).

00Updated 1 year ago
OL
OlivierBinette/Awesome-LLMs-Evaluation-PapersFork

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

00Updated 1 year ago
OL
OlivierBinette/FoFoFork

No description provided.

Python00Updated 1 year ago
OL
OlivierBinette/TruthfulQAFork

TruthfulQA: Measuring How Models Imitate Human Falsehoods

00Updated 1 year ago
OL
OlivierBinette/ul-benchmark-datasets-for-entity-resolution-archive

Unofficial archive of https://dbs.uni-leipzig.de/research/projects/benchmark-datasets-for-entity-resolution

HTML00Updated 1 year ago
OL
OlivierBinette/TessTools

Tools for the use of Tesseract OCR in R

R43Updated 1 year ago
digital-humanitieshistorical-newspapersocrrtesseracttesseract-ocr

Gists

Recent Activity

Olivier Binette (OlivierBinette) | GitHunt