166 results for “topic:chinese-language”
A generative speech model for daily dialogue.
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
A linting tool for Chinese language.
Rime Cantonese input schema | 中州韻粵語拼音輸入方案
Learn, read, write and practice Mandarin by drawing strokes in Anki Desktop, AnkiDroid and AnkiMobile with audio of HSK 2.0 (HSK1-6) and HSK 3.0 (HSK 1-9) characters.
A framework for cleaning Chinese dialog data
中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.
收集非普通話漢語和古漢語的中州韻輸入法拼音方案 Collection of phonetic spelling schemas for Sinitic languages and dialects
Discovering magic squares in Tang Dynasty poems
Python scraper for Language Pods such as Japanesepod101.com :japanese_ogre: :japan: :sushi: Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
solidity-by-example 教程中文翻译|@Web3-Club
Complete, HSK 2.0/3.0 (汉语水平考试) Vocabulary Lists in Json
CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조
A webapp to visualize relationships among Chinese characters and to see example sentences that illustrate their use. Also available for Japanese learners.
Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
Free Human Language Learning Resources
Từ điển tiếng Việt dành cho máy đọc sách Kindle, Kobo, Pocketbook v.v.
文本去重
简繁转换 簡繁轉換 Python implementation of StarCC, the next generation of Simplified-Traditional Chinese conversion framework
開放粵語字典 - 現代粵語字音數據庫
《精通以太坊》(中文开放版) 原书作者:Andreas M. Antonopoulos, Gavin Wood
This codebase is a solution for making Chinese study, through Anki, more enjoyable by making the flashcards beautiful.
寧波閒話吳語拼音輸入方案 · 寧波話吳語拼音輸入方案 · A Rime input schema for Ningbo Dialect
上海吳語拼音輸入方案 · 上海吴语拼音输入方案 · Rime input schemas for Shanghai Dialects
A tool to add Putonghua pronunciations in IPA form on Chinese texts
Cleaned up HSK 3.0 vocabulary list with pinyin, POS, traditional terms, variants etc.
A demo of fine tune Stable Diffusion on Pokemon-Blip-Captions in English, Japanese and Chinese Corpus
中文《诗歌总集》,距今为止最全面,最系统的中文诗词数据集,统一数据建模.