"topic:emnlp2025" — Search

17 results for “topic:emnlp2025”

Official Triton kernels for TopK and HierarchicalTopK Sparse Autoencoder decoders.

emnlpemnlp2025interpretabilityllmsaesparse-autoencodertriton

[EMNLP 2025]Repository for paper "DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning"

Python293Updated 8 months ago

emnlp2025guimodality

parameterlab/leaky_thoughts

Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025

Python171Updated 2 months ago

contextual-privacyemnlpemnlp2025llmllm-safetyprivacyreasoning-language-modelsresearch

shimo-lab/modelmap

Embedding language models in probability space via log-likelihood vectors

Jupyter Notebook161Updated 4 months ago

acl2025embeddingsemnlp2025information-geometrylanguage-modelllmtsnevisualization

HATS-ICT/PersonaEvolve

[EMNLP 2025 Main] Official Repo for Paper: "Implicit Behavioral Alignment of Language Agents in High-Stakes Crowd Simulations"

C#70Updated 3 months ago

active-shooter-incidentaicomputational-social-scienceemnlpemnlp2025generative-agentsllmllm-agentsnlpsocial-simulation

SecurityLab-UCD/FuzzAug

[EMNLP'25] FuzzAug: Data Augmentation by Coverage-guided Fuzzing for Neural Test Generation

Python61Updated 6 months ago

data-augmentationemnlpemnlp2025llmrusttest-generation

idramalab/quantify-llm-explanations

Evaluating Large Language Models for Detecting Antisemitism

Python41Updated 6 months ago

emnlp2025evaluation-metricsllmnlp

madhavkrishangarg/ReviewEval

ReviewEval: An Evaluation Framework for AI-Generated Reviews

Python31Updated 6 months ago

ai-agentsemnlpemnlp2025peer-review

ApplyU-ai/ResumeBench

Beyond Human Labels: A Multi-Linguistic Auto-Generated Benchmark for Evaluating Large Language Models on Resume Parsing [EMNLP 2025 Main Conference]

20Updated 4 months ago

benchmarkdatasetemnlp2025evaluationlarge-language-modelsllmresumeresume-parser

bhimanbaghel/ResolveUnderOverEdit

Official implementation of "Resolving UnderEdit & OverEdit with Iterative & Neighbor-Assisted Model Editing" (EMNLP 2025 Findings).

Python10Updated 4 months ago

emnlp2025knowledge-editinglarge-language-modelsllmsmachine-learningmodel-editingnatural-language-processingnlppytorchtransformers

Augustus2011/Beyond_One_World

Beyond One World — A benchmark for testing how well LLMs role-play version-specific characters (e.g., superheroes across universes). Covers 30 heroes and 90 canon variants through two tasks: Canon Events (factual recall) and Moral Dilemmas (ethical reasoning). Introduces the Think-Act Matching metrices.

Python10Updated 4 months ago

agentemnlpemnlp2025roleplaywordplay

aauss/temporal-answer-qa

Time to Revisit Exact Match (Findings of EMNLP 2025)

Python10Updated 6 months ago

emnlpemnlp2025evaluationlarge-language-modelsquestion-answeringtemporal-reasoning

cincynlp/FrameID

Code for the paper: "Do LLMs Encode Frame Semantics? Evidence from Frame Identification"

Python11Updated 5 months ago

emnlp2025

Rongite/Persuasion

Code & reproducibility for the EMNLP paper “Profiling LLMs’ Copyright Infringement Risks under Adversarial Persuasive Prompting”: prompts, seeds queries, and figure scripts.

Python00Updated 6 months ago

adversarial-attackscopyrightemnlp2025jailbreakllmnlppersuasionprompting

sinaabbasi1/NormXLogit

The official repo for the EMNLP 2025 paper "NormXLogit: The Head-on-Top Never Lies"

Jupyter Notebook00Updated 4 months ago

emnlp2025explainabilityfaithfulnessinterpretabilityllmnlpplausibilitytransformers

Huang-AI4Medicine-Lab/FigEx

official code repo of EMNLP 2025 paper FigEx: Aligned Extraction of Scientific Figures and Captions

Python00Updated 3 days ago

emnlp2025scientific-discoveryvision-language-model

Ebad-urRehman/AyahVerse-Mahed-SharedTask

This repository contains the code and detailed analysis regarding competition and system paper I will submit regarding MAHED 2025 subtask1(hate and hope speech classification) in Arabic NLP colocated with EMNLP.

Jupyter Notebook00Updated 3 months ago

acl2025arabic-nlpemnlp2025hate-speech-detectionmahed-2025