GitHunt

thu-coai

Conversational AI groups from Tsinghua University

Languages

Python100%

Top Repositories

Repositories

109
TH
thu-coai/Survive-at-All-Costs

No description provided.

Python20Updated just now
TH
thu-coai/COLDataset

The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection

31629Updated 2 hours ago
TH
thu-coai/ConvLab-2

ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems

Python465137Updated 3 hours ago
dialoguedialogue-systemstask-oriented-dialogue
TH
thu-coai/Emotional-Support-Conversation

Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

Python30846Updated 9 hours ago
TH
thu-coai/CharacterBench

[AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models

Python211Updated 15 hours ago
TH
thu-coai/Glyph

Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"

Python56549Updated 1 day ago
TH
thu-coai/AISafetyLab

AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.

Python23414Updated 1 day ago
TH
thu-coai/CrossWOZ

A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset

Python713117Updated 1 day ago
TH
thu-coai/EmbodiedAct

No description provided.

00Updated 2 days ago
TH
thu-coai/JPS

[MM'25] JPS: Jailbreak Multimodal Large Language Models with Collaborative Visual Perturbation and Textual Steering

Python165Updated 2 days ago
TH
thu-coai/Safety-Prompts

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。

1.1k88Updated 3 days ago
attack-defensechatgptchinese-languageinstructionllmpromptprompt-engineeringsafety
TH
thu-coai/SafetyBench

Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]

Python27413Updated 3 days ago
TH
thu-coai/Agent-SafetyBench

No description provided.

Python1106Updated 3 days ago
TH
thu-coai/IF-RewardBench

IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation

Python40Updated 4 days ago
TH
thu-coai/NLG_book

书籍《现代自然语言生成》介绍

22222Updated 4 days ago
TH
thu-coai/PsyQA

一个中文心理健康支持问答数据集,提供了丰富的援助策略标注。可用于生成富有援助策略的长咨询文本。

24620Updated 5 days ago
TH
thu-coai/CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Python1.9k263Updated 6 days ago
dialoguegptgpt-2lcccpytorchtext-generation
TH
thu-coai/MAPS

Official Implementation of ICLR25 paper "MAPS: Advancing Multi-modal Reasoning in Expert-level Physical Science"

Python92Updated 1 week ago
TH
thu-coai/CharacterGLM-6B

[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

Python49236Updated 1 week ago
TH
thu-coai/Backdoor-Data-Extraction

No description provided.

Python306Updated 1 week ago
TH
thu-coai/TransferAttack

[ACL 2025] Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints

Python191Updated 1 week ago
TH
thu-coai/OpenMEVA

Benchmark for evaluating open-ended generation

Python517Updated 1 week ago
benchmarkevaluation-metricslanguage-generation
TH
thu-coai/THUOOP

清华大学面向对象程序设计课程 课程材料及答疑

10510Updated 1 week ago
TH
thu-coai/LRM-Safety-Study

No description provided.

Python60Updated 1 week ago
TH
thu-coai/ShieldLM

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]

Python22610Updated 1 week ago
TH
thu-coai/LongSafety

[ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models

Python160Updated 2 weeks ago
TH
thu-coai/CodePlan

No description provided.

172Updated 2 weeks ago
TH
thu-coai/ccm

This project is a tensorflow implement of our work, CCM (Commonsense Conversational Model).

Python21966Updated 2 weeks ago
TH
thu-coai/BARREL

[ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

Python171Updated 2 weeks ago
TH
thu-coai/ComplexBench

Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)

Python10212Updated 3 weeks ago

Gists

Recent Activity

thu-coai | GitHunt