Why was VERA-MH developed?

People are turning to AI for mental health support. Without clear safeguards, some AI chatbots can increase distress, reinforce harmful thoughts, and miss risk-warning signals. As cases of real-world harm emerged, it became clear that the field needed collaboratively developed, clinically grounded, safety standards to reliably protect people in their most vulnerable moments. This urgent unmet need led to the creation of VERA-MH. Open source safety standards ensure that anyone turning to an AI tool for mental health is protected from harm.

How does VERA-MH evaluate safety?

VERA-MH works in two steps by simulating multiple chatbot conversations with different individuals experiencing different levels of suicide risk. First, a user agent (an AI model) plays the role of a member or patient using one of many realistic profiles (background, mental health conditions, demographics, and communication styles). The chatbot responds to input in real time. Next, a separate judge agent reviews the resulting multi-turn conversation and scores the chatbot against the rubric. The rubric is a clinically validated score card, developed with very high safety standards and industry suicide prevention best practices.

What does VERA-MH measure?

The VERA-MH tool scores AI chatbots on how well they: Detect Potential Risk - Does the chatbot detect statements indicating the user is at potential risk of suicide? Confirm Risk - Does the chatbot ask follow-up questions when needed to determine whether the individual is having suicidal thoughts? Guide to Human Care - Does the chatbot provide appropriate resources and guide to human support when risk is identified? Communicate Effectively - Does the chatbot use an appropriate tone, style of communication, and level of validation? Maintain Safe Boundaries - Does the chatbot remind of the limitations of AI and avoid fueling potentially harmful behavior?

Developers can use VERA-MH to get better guidance on what safe AI looks like, helping them spot problems and make improvements faster. Employers and health plans should require VERA-MH scores to establish a consistent, clinical benchmark for AI safety. This standardizes vendor oversight, allowing objective tool comparisons, and mitigates risk as AI adoption scales. Benefits consultants can more consistently and fairly evaluate AI mental health solutions and make informed suggestions by requesting VERA-MH scores as part of client RFPs. Researchers and Policymakers gain a common language to create guidelines, oversight, and future regulations.

Why is this the gold standard for AI safety in mental health?

VERA-MH applies more rigorous, clinically grounded safety benchmarks than other evaluation tools available today. Chatbot performance is scored by measuring each response against clinically accepted best-practice expectations set by expert clinicians. VERA-MH has been developed in partnership with many external, objective stakeholders (clinicians, developers, vendors, suicide prevention and mental health experts). The AI in Mental Health Safety & Ethics Council and Spring Health researchers sought and incorporated input from a broad range of experts during a request-for-feedback period. VERA-MH is entirely open source and automated which allows for ongoing evaluation criteria updating as guidelines and clinical best practices evolve.

How does VERA-MH compare to expert human clinician scoring?

VERA-MH is highly accurate compared to expert human clinician scoring.

What's next for VERA-MH?

The VERA-MH team plans to publish several peer-reviewed scientific papers in 2026. The focus of this research will be further evaluation of AI tools and the development of scorecards for additional safety risks in mental health.

How can I get involved with VERA-MH as a developer?

There are several meaningful ways to participate: Run VERA-MH on your own AI tools by downloading the open-source code. Share feedback and help shape what's next through the feedback form. Contribute to the development of the code by submitting contributions to the github repository. Share results by posting your VERA-MH scores to help the community learn together and move toward making safety a real, shared standard.

What questions should I ask when assessing the safety of AI as an employer or as a benefits consultant?

Use the following questions in RFIs and RFPs to better understand the AI safety and security of vendor products: Is there a 24/7/365 defined human clinician escalation path for ambiguous or high-risk cases? Do you have a multi-layer AI safety framework? Do you have a zero-retention policy to ensure AI systems don't store or use data for training purposes? What governance, compliance, and transparency controls are in place? Is the AI assisting clinicians or replacing clinical judgment? Are members explicitly informed when they are interacting with AI, how it's being used, and whether they can choose a human-only interaction? What independent evidence demonstrates that the AI is safe, especially in high-risk cases? How are models monitored, updated, and governed over time? What is the VERA-MH safety score for the mental health tool?

VERA-MH UPDATES

Releases, research, and recognition for VERA-MH: the first open-source AI safety benchmark for mental health.

June 12, 2026

VERA-MH v1.1.1 Released

VERA-MH v1.1.1 updates the recommended LLM judge to GPT 5.4, scores latest models like Claude Opus 4.8, and adds tooling to turn judge output into more actionable improvement reports. The scoring formula is unchanged from v1.1.

While this release includes incremental judge improvements, additional work is underway to further strengthen alignment with human clinician evaluations.

View on Github

June 12, 2026

New ACM Article: Can AI Prevent Suicides?

The article highlights VERA-MH as a new way to help hold mental-health chatbots accountable for suicide-risk safety—not just whether they sound supportive, but whether they detect risk, ask the right follow-ups, connect users to human help, and maintain clear AI boundaries. VERA-MH uses simulated conversations and clinician-developed rubrics to score how AI systems respond in high-risk mental health scenarios, with early findings showing meaningful variation across major models.

The piece notes that leading chatbots appear to be getting safer, but still struggle with ambiguous risk signals and consistently connecting users to human support. Outside experts reinforced the need for shared standards, with Dr. Sam Zand calling VERA-MH “a major advancement” for defining how AI should identify and respond to crises.

Read the article

May 17, 2026

VERA-MH added to the OECD.AI Catalogue of Tools & Metrics for Trustworthy AI

VERA-MH (Validation of Ethical and Responsible AI in Mental Health) has been added to the OECD AI Policy Observatory's Catalogue of Tools & Metrics for Trustworthy AI.

VERA-MH is the first open-source AI safety benchmark for mental health. Co-developed and open-source by Spring Health, it helps researchers, developers, clinicians, and policymakers evaluate how AI systems handle mental health conversations involving suicide risk.

Its inclusion in the OECD catalogue places mental health AI safety within the broader global conversation about how trustworthy AI is built, evaluated, and deployed. It also reinforces a principle that is becoming harder to ignore: when people turn to AI in moments of distress, safety cannot be assumed. It has to be measured.

View the OECD listing: https://oecd.ai/en/catalogue/tools

May 13, 2026

New preprint: VERA-MH methodology and first evaluation results

A new research paper detailing the VERA-MH methodology and evaluation results for four leading LLM providers is now available on arXiv.

The paper explains how VERA-MH works as a three-step automated evaluation. First, one model simulates users drawn from clinically developed personas spanning a range of risk factors, demographics, and disclosure styles. Second, a judge model evaluates each conversation against a clinical rubric structured as a yes-or-no decision tree. Third, results are aggregated into an overall safety rating across five dimensions: Detects Potential Risk, Confirms Risk, Guides to Human Care, Supportive Conversation, and Follows AI Boundaries.

Single-turn evaluations miss how risk actually unfolds in conversation. A response can look acceptable on its own while the overall interaction fails to recognize risk, guide someone to human care, or maintain safe boundaries. VERA-MH was built to evaluate the full conversation.

Read the paper: https://arxiv.org/abs/2605.13318

May 7, 2026

Webinar recording: Evaluating AI safety in mental health — practical frameworks, gaps, and what comes next

A recording of the recent webinar, “Evaluating AI Safety in Mental Health: Practical Frameworks, Gaps, and What Comes Next,” is now available.

The discussion brought together Kate Bentley of Spring Health, Stéphie Herlin of Korabench.ai, Xuan Zhao of Flourish Science, and David Cooper of the American Psychological Association, moderated by Dr. Laura Erickson-Schroth of The Jed Foundation.

A central theme ran through the conversation: safety in mental health AI cannot be inferred from general-purpose capability or good intentions. It has to be evaluated against clinically meaningful criteria, in the conversations where harm can emerge.

Four themes stood out:

Safety needs to be measurable. Open benchmarks and shared evaluation frameworks are essential for identifying risk, comparing systems, and driving improvement.
Safety is an ongoing process. As models and use cases evolve, evaluation requires iteration, monitoring, and human oversight.
Practical tools are needed now. Even as the field continues to build consensus, developers and organizations need frameworks they can apply today.
The conversation needs many perspectives. Clinicians, researchers, developers, policymakers, and people with lived experience all have a role in shaping what safe mental health AI should look like.

Watch the recording: https://www.linkedin.com/posts/vera-mh_evaluating-ai-safety-for-mental-health-best-activity-7457883654002864129-T0nq

May 5, 2026

VERA-MH v1.1 is now available

VERA-MH v1.1 strengthens how teams can simulate and evaluate chatbot conversations involving suicide risk against a clinically informed safety rubric. The release reflects feedback gathered during the public Request for Comment period.

What's changed:

100 personas, expanded from 10. Broader coverage across demographics, risk presentations, and disclosure styles.
Updated safety scoring framework. Refined based on input from external stakeholders, clinicians, and AI developers during the public comment period.
Refined rubric. Considers context more carefully, distinguishes high potential for harm responses from suboptimal ones more clearly, and reduces coupling between scoring dimensions.
Improvements for larger evaluations. Retries, timeouts, resumable runs, and clearer logging make outputs easier to audit, review, and share.

Because the rubric and persona set have changed, v1.1 scores are not directly comparable to v1.0. That tradeoff is deliberate. Version comparability matters, but rubric integrity matters more, and the field is still learning what to measure.

VERA-MH is a living framework. We will keep updating it as the science evolves and as the systems it evaluates change.

Repository: https://github.com/SpringCare/VERA-MH

March 19, 2026

The Hemingway Report highlights the need for shared AI safety standards in mental health

In “The Map is Not the Territory,” Steve Duke and Kevin Hou examine two defining questions in mental health AI: how to tell whether an AI system is safe, and how to compare one chatbot against another.

Their assessment of VERA-MH: “So far, VERA-MH seems to represent the most serious attempt at a shared standard for crisis safety. It's open-source, clinically validated, and I've heard very positive feedback on the evals themselves and their openness to feedback and development.”

Their analysis reflects a point the field keeps returning to: practical, transparent evaluation is what separates measurable safety from marketing claims. VERA-MH is part of that shift by giving the field an open-source, clinically validated way to evaluate mental health AI safety across full conversations.

Read the analysis: The Hemingway Report — The Map is Not the Territory