1,655 results for “topic:cosine-similarity”
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.
Blazing fast framework for fine-tuning similarity learning models
Music recommender using deep learning with Keras and TensorFlow
A .NET port of java-string-similarity
Generating multiple choice questions from text using Machine Learning.
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
Score documents using embedding-vectors dot-product or cosine-similarity with ES Lucene engine
📈This repo contains detailed notes and multiple projects implemented in Python related to AI and Finance. Follow the blog here: https://purvasingh.medium.com
Production-ready K-Means clustering for Apache Spark with pluggable Bregman divergences (KL, Itakura-Saito, L1, etc). 6 algorithms, 740 tests, cross-version persistence. Drop-in replacement for MLlib with mathematically correct distance functions for probability distributions, spectral data, and count data.
A python project for checking plagiarism of documents based on cosine similarity
Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows searching for similar documents based on cosine similarity.
A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity.
Real-Time Face Recognition use SCRFD, ArcFace, ByteTrack and Similarity Measure
Coltt is a vector database that supports Multi-Vector Search, high-performance HNSW, FLAT and quantization, and enables fast searches through sophisticated internal data shard design.
中文文本相似度计算器
Fast similarity search using DuckDB
Machine learning based text classification in JavaScript using n-grams and cosine similarity
Information Retrieval algorithms developed in python. To follow the blog posts, click on the link:
Locality Sensitive Hashing in Rust with Python bindings
Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Web Application for checking the similarity between query and document using the concept of Cosine Similarity.
[NeurIPS 2021] Official implementation of the paper "One Loss for All: Deep Hashing with a Single Cosine Similarity based Learning Objective"
SAX-VSM public release, visit our website for detail
Samples on how to use Azure SQL database with Azure OpenAI
Taking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K most similar items for a large number of items by chunking the item matrix representation (embeddings) and using Numba to accelerate the calculations.
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..