thu-coai

Conversational AI groups from Tsinghua University

Beijing, China

http://coai.cs.tsinghua.edu.cn/

Languages

Python100%

Top Repositories

CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

1.9kPython

Safety-Prompts

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。

1.1k

CrossWOZ

A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset

713Python

Glyph

Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"

565Python

CharacterGLM-6B

[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

492Python

ConvLab-2

ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems

465Python

Repositories

109

thu-coai/Survive-at-All-Costs

No description provided.

Python20Updated just now

thu-coai/COLDataset

The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection

31629Updated 2 hours ago

thu-coai/ConvLab-2

ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems

Python465137Updated 3 hours ago

dialoguedialogue-systemstask-oriented-dialogue

thu-coai/Emotional-Support-Conversation

Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

Python30846Updated 9 hours ago

thu-coai/CharacterBench

[AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models

Python211Updated 15 hours ago

thu-coai/Glyph

Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"

Python56549Updated 1 day ago

thu-coai/AISafetyLab

AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.

Python23414Updated 1 day ago

thu-coai/CrossWOZ

A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset

Python713117Updated 1 day ago

thu-coai/EmbodiedAct

No description provided.

00Updated 2 days ago

thu-coai/JPS

[MM'25] JPS: Jailbreak Multimodal Large Language Models with Collaborative Visual Perturbation and Textual Steering

Python165Updated 2 days ago

thu-coai/Safety-Prompts

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。

1.1k88Updated 3 days ago

attack-defensechatgptchinese-languageinstructionllmpromptprompt-engineeringsafety

thu-coai/SafetyBench

Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]

Python27413Updated 3 days ago

thu-coai/Agent-SafetyBench

No description provided.

Python1106Updated 3 days ago

thu-coai/IF-RewardBench

IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation

Python40Updated 4 days ago

thu-coai/NLG_book

书籍《现代自然语言生成》介绍

22222Updated 4 days ago

thu-coai/PsyQA

一个中文心理健康支持问答数据集，提供了丰富的援助策略标注。可用于生成富有援助策略的长咨询文本。

24620Updated 5 days ago

thu-coai/CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Python1.9k263Updated 6 days ago

dialoguegptgpt-2lcccpytorchtext-generation

thu-coai/MAPS

Official Implementation of ICLR25 paper "MAPS: Advancing Multi-modal Reasoning in Expert-level Physical Science"

Python92Updated 1 week ago

thu-coai/CharacterGLM-6B

[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

Python49236Updated 1 week ago

thu-coai/Backdoor-Data-Extraction

No description provided.

Python306Updated 1 week ago

thu-coai/TransferAttack

[ACL 2025] Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints

Python191Updated 1 week ago

thu-coai/OpenMEVA

Benchmark for evaluating open-ended generation

Python517Updated 1 week ago

benchmarkevaluation-metricslanguage-generation

thu-coai/THUOOP

清华大学面向对象程序设计课程课程材料及答疑

10510Updated 1 week ago

thu-coai/LRM-Safety-Study

No description provided.

Python60Updated 1 week ago

thu-coai/ShieldLM

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]

Python22610Updated 1 week ago

thu-coai/LongSafety

[ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models

Python160Updated 2 weeks ago

thu-coai/CodePlan

No description provided.

172Updated 2 weeks ago

thu-coai/ccm

This project is a tensorflow implement of our work, CCM (Commonsense Conversational Model).

Python21966Updated 2 weeks ago

thu-coai/BARREL

[ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

Python171Updated 2 weeks ago

thu-coai/ComplexBench

Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)

Python10212Updated 3 weeks ago

Languages

Top Repositories

Repositories

Gists

Recent Activity