Posts

Showing posts with the label mental safety

OpenAI's New Under-18 Principles Enhance AI Ethics and Teen Safety in ChatGPT

Image
On December 18, 2025, OpenAI updated its Model Spec —the written set of behavioral expectations that guides how ChatGPT should respond—by adding a new section: Under-18 (U18) Principles . The goal is straightforward: teens (ages 13–17) have different developmental needs than adults, and a “one-size-fits-all” safety posture can create gaps in higher-risk situations. At a high level, the update clarifies how existing safety rules apply in teen conversations and adds age-appropriate guidance where needed. The principles emphasize prevention, clearer boundaries, and stronger encouragement toward real-world support when risks show up. This article explains what the U18 Principles are, why they matter, and what “safe, age-appropriate behavior” looks like in practice—without turning teen safety into vague slogans. If you’re interested in related context on teen safety work, you may also want to read: OpenAI’s Teen Safety Blueprint . TL;DR What changed: OpenAI added ...

OpenAI Launches $2 Million Grant Program to Advance AI and Mental Health Research

Image
Disclaimer: This article provides information for educational purposes only and is not professional advice. Details may change over time. Readers should make decisions based on their own context and consult relevant professionals as needed. OpenAI has introduced a groundbreaking grant program, offering up to $2 million to foster research at the intersection of artificial intelligence (AI) and mental health. This initiative aims to explore both the potential benefits and risks of AI applications in mental health care, emphasizing ethical considerations and practical impacts. With submissions open until December 19, 2025, the program invites researchers to propose projects that delve into AI's role in mental health diagnosis, treatment, and support. By funding these studies, OpenAI seeks to advance understanding and guide the responsible use of AI in sensitive areas. Overview of OpenAI's Grant Program The OpenAI grant program is designed to support research t...

Evaluating Safety Measures in Advanced AI: The Case of GPT-4o

Image
Temporal & Scope Guidance: This analysis is grounded in the GPT-4o System Card and Preparedness Framework results published in early August 2024. Because GPT-4o is natively multimodal—integrating text, audio, and vision in a single neural network—safety assessments are dynamic. These findings represent the model's state at launch and do not account for emergent vulnerabilities discovered during wider public deployment or subsequent fine-tuning iterations. Use this information at your own discretion; we can’t accept liability for decisions made based on it. Artificial intelligence models like GPT-4o expand what “a single model” can do: not just text, but voice, images, and real-time interaction. That expansion also changes the threat surface. A safety evaluation for a multimodal system is not only about harmful text—it is about how capabilities combine , how users react to more human-like interaction, and how small failures (like misidentifying a voice or drifting...

Assessing AI Risks: Hugging Face Joins French Data Protection Agency’s Enhanced Support Program

Image
This analysis is based on the regulatory landscape of the European Union and the French CNIL's action plan as of May 2023. As AI governance frameworks are currently under intense negotiation within the European Parliament, the interpretations of data protection law regarding Large Language Models (LLMs) are subject to immediate and significant changes. This content does not constitute legal advice and may not reflect later domestic or international legislative updates. The rapid growth of artificial intelligence (AI) technologies raises urgent questions about knowledge reliability, privacy, and accountability. As foundation models and their “tool ecosystems” move into everyday products, data protection concerns increasingly sit alongside traditional safety concerns: how data is collected , how outputs are generated , and how individuals can exercise their rights when automated systems shape information and decisions. TL;DR Hugging Face has been selected ...