Top Repositories
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"
[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
Repositories
109No description provided.
The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems
[AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models
Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"
AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.
A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
No description provided.
[MM'25] JPS: Jailbreak Multimodal Large Language Models with Collaborative Visual Perturbation and Textual Steering
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
No description provided.
IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation
书籍《现代自然语言生成》介绍
一个中文心理健康支持问答数据集,提供了丰富的援助策略标注。可用于生成富有援助策略的长咨询文本。
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
Official Implementation of ICLR25 paper "MAPS: Advancing Multi-modal Reasoning in Expert-level Physical Science"
[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
No description provided.
[ACL 2025] Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
Benchmark for evaluating open-ended generation
清华大学面向对象程序设计课程 课程材料及答疑
No description provided.
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]
[ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models
No description provided.
This project is a tensorflow implement of our work, CCM (Commonsense Conversational Model).
[ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs
Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)