16 results for “topic:near-duplicate-detection”
Image similarity in Golang. Version 4 (LATEST)
ISCC: International Standard Content Code
A Simple Image Clustering Script using CLIP and Hierarchial Clustering
Fast image similarity search with hash tables (Golang). Version 2 (LATEST)
Simple library for finding duplicate and near-duplicate text documents in massive sets/libraries/databases
Python library for detecting near duplicate texts in a corpus at scale.
Multi module project focused on near-duplicate search for images.
Fast image similarity search with hash tables (Golang). Version 1
Holds code for near-duplicate image parser using optimized image classifiers.
Bachelor's Thesis on Near-Duplicate Image Detection. This repo contains all resources, code, and documentation developed during the process.
A CLI tool for near-duplicate detection in text files, written in Rust with no dependencies on runtime environments.
an application for comparing images using various image hashing algorithms
Find duplicated files including permutations where underscores replace spaces. Allow a tolerance and use a signature for audio files to ignore metadata variations.
Exploiting the PyTerrier library to build a Search Engine and resolve the Near Duplicate Detection tasks.
First homework for the Advance Data Mining course
Language of Vectors (LangVec) is a simple Python library designed for transforming numerical vector data into a language-like structure using a predefined set of words (lexicon).