Behaviour Models in System Modelling

11h

Anthropic Releases New Open-Source Tool That Evaluates How AI Models Behave

Dubbed Bloom, the AI tool creates a series of scenarios to test an AI model for a particular behavioural trait.

Anthropic launches Bloom to help researchers understand how AI models behave in real situations

Anthropic has launched Bloom, a new open-source tool designed to help researchers understand how advanced AI models behave in real-world situations, making it easier to study alignment, safety, and ...

Tech Xplore on MSN

Can AI read humans' minds? A pedestrian behavior model is shockingly good at it

In a striking leap toward safer self-driving cars, researchers at Texas A&M University College of Engineering and the Korea ...

Why complex reasoning models could make misbehaving AI easier to catch

In a new paper from OpenAI, the company proposes a framework for analyzing AI systems' chain-of-thought reasoning to understand how, when, and why they misbehave.

Business Insider

Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

You're currently following this author! Want to unfollow? Unsubscribe via the link in your email. Follow Lakshmi Varanasi Every time Lakshmi publishes a story, you’ll get an alert straight to your ...

ZDNet

Hide inaccessible results

Anthropic Releases New Open-Source Tool That Evaluates How AI Models Behave

Anthropic launches Bloom to help researchers understand how AI models behave in real situations

Can AI read humans' minds? A pedestrian behavior model is shockingly good at it

Why complex reasoning models could make misbehaving AI easier to catch

Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

AI models know when they're being tested - and change their behavior, research shows

New AI model uses behavior data from Apple Watch for better health predictions

OpenAI unveils specs for desired AI model behavior

OpenAI found features in AI models that correspond to different ‘personas’