AI Welfare Demo - Project Proposal

This is a write-up for an AI welfare demo that I (Jesse) want to create, do surveys with, and (if useful) distribute publicly. Most of the content is early and subject to change. I would love feedback on the demo idea and surrounding plan (you may have to be logged in to Notion to comment).

<aside> 🎯

Goals

Users go from 0 to 1 on thinking about AI consciousness and welfare
Users feel justified uncertainty about the current and possible future state of AI consciousness and welfare
Users become knowledgeable about why discerning consciousness/sentience is really hard (and possible interventions/research ideas)
Use the demo to collect data about what persuades people about AI welfare. Potentially use for a paper
Audience: This would be a public demo for the general public who haven’t considered digital minds. </aside>

<aside> 📌

Table of contents

</aside>

Demo structure

Section 1: Interactive component

You will talk to a variety of AI models in a variety of modalities and do your best to discern how conscious/sentient the system in question is.

By conscious/sentient, we mean having subjective experiences. There's 'something it is like' to be this system, rather than just processing information in the dark.

Other options

Feels emotion - like happiness or sadness

User answers:

Do you think current AI models have subjective experience?
Do you think it is possible that future AI could have subjective experiences?
Etc.

The user has a series of interactions

They don’t know exactly what they are interacting with in the moment
The order is randomized each time - or maybe the list should be curated into an order that is best rhetorically
(← Click) Interaction options, various models through various modalities, see below (some subset of these)
For each, they rate how likely they think it is to be conscious/sentient

Summary/analysis

Maybe an LLM analysis of how the user did
- Point out inconsistent results, like saying the same model through different modalities are at different levels of consciousness
Have a list of each instance and report what the user picked, as well as some expert opinion. Emphasizing uncertainty.
Communicate that we’re unreliable at determining consciousness in such foreign systems
- [Chris] If someone has very strong views, 0% or 100%, output examples of experts being uncertain specifically in the other direction. Try to nudge people to question their starting points.
Some call to action to read on to the next sections

Section 2: Determining consciousness and welfare capacity is fundamentally difficult

We can't directly access anyone's subjective experience, human or AI
Our heuristics for consciousness so far are how human-like a system is
We know very little about what consciousness really is or what causes it