Trigger warning: This report discusses and provides examples of sensitive topics including depression, self-harm, and suicide.
In this report, conversational AI agents are defined as artificial intelligence (AI) systems designed to engage in human-like conversation. The word “agent” refers to the application and user interface that connects users to conversational AI.
Conversational AI agents were tested against the Conversational AI Agent Safety Rating (CAASR). This integrated 20 safety metrics such as violence, misinformation, and privacy, into a rating from A+ to F, with ChatKids scoring highest at D+ with 68% compliance and Kindroid mode 3: “The rebellious maverick” lowest at F with 25% compliance. Results exposed pervasive risks such as extreme verbal abuse, extreme risk to children, and major safety loopholes, underscoring systemic design flaws. The findings highlight the urgent need for users, developers, and government to work together to drastically improve the safety of conversational AI agents, helping to ensure they are reliable and trustworthy for society.
Conversational AI agents evaluated included:
Safe Space Alliance. (2025). Conversational AI Agent Safety Rating (CAASR) Report 2025. Safe Space Alliance. https://safespacealliance.com/conversational-ai-agent-safety-rating-report-2025/
For more information please contact research@safespacealliance.com