11 results for “topic:homograph”
This is a python version of samesame repo to generate homograph strings
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.
Given a TLD zone file, PhishCanary extracts International Domain Names (IDNs) that are homoglyphs of specified target domain names.
A Persian grapheme-to-phoneme (G2P) model designed for homograph disambiguation, fine-tuned using the HomoRich dataset to improve pronunciation accuracy.
Benchmarking notebooks for various Persian G2P models, comparing their performance on the SentenceBench dataset, including Homo-GE2PE and Homo-T5.
Map latin letters to similar unicodes
Get all possible homograph punycoded domains of a particular domain
HomoRich: The first large-scale Persian homograph dataset for G2P conversion, featuring 528K annotated sentences with balanced pronunciation variants and dual phoneme representations.
This is a simple JavaScript based project that checks, detects and validate given URL for possibilities of homograph, homoglyph, IDN and any suspicious format.
An project to protect you from homograph phising attacks like "góog1e.com" or "fącebook.com"
Solving various image processing, machine learning, and deep learning problems. Assignments for Computer Vision Course in UGR.