📖 tl;dr Responsible AI

❯

❯

Tag: LLM

4 items with this tag.

Apr 13, 2025
Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations
Apr 13, 2025
Building Safe GenAI Applications - An End-to-End Overview of Red Teaming for Large Language Models
Apr 13, 2025
Safety Alignment Should Be Made More Than Just a Few Tokens Deep
Apr 13, 2025
Agent-SafetyBench - Evaluating the Safety of LLM Agents

Recent

Sociotechnical Safety Evaluation of Generative AI Systems
The Impact of Generative AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort and Confidence Effects From a Survey of Knowledge Workers
Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations
Summaries of RAI concepts, research, and frameworks
AART AI-Assisted Red-Teaming with Diverse Data Generation for New LLM-powered Applications

Created with Quartz v4.4.0 © 2025

GitHub
Bluesky