IUCN Demo: AI for the Red List Assessor Exam

This demo will mainly be to just give a quick overview of what I’ve built – a demo of how Agentic AI is now at the point where it can reliably pass the Red List Assessor Exam. But then it’ll be great to discuss what this means for the IUCN and the Red List bigger picture.

But before getting into that, I thought I’d briefly give a bit of context on how I started working on this.

I arrived here in Cambridge to start my PhD in Cambridge, eager to do impactful research in AI for conservation. But, leaning on my startup experience, I first wanted to look at what the actual big problems in conservation are, before just force-fitting AI in as a panacea solution. And so it felt like very serendipitous timing that the start of my PhD coincided with the IUCN’s World Conservation Congress, which only happens every four years! I thus registered and attended the conference remotely.

Attending the Opening Ceremony of the IUCN’s World Conservation Congress from a cafe at Jesus College Cambridge

Here, the Red List repeatedly came up as a crucial resource for conservation, but facing significant challenges with scaling assessments and re-assessments. This ChatGPT thread spells it out pretty well, explaining why the Red List is important, what it’s big challenges are, and how AI could help.

Why the Red List is important

What the Red List’s challenges are

How AI could help assessors

This is all good and well in theory, and I know the IUCN is starting to think about this too. Eg we see in the Red List’s 60 Years of Success document:

Screenshot 2025-12-10 at 1.51.04 pm.png

But next I thought okay at the very least, to ground myself more, I should learn more about the actual Red List workflow – see what it takes to become a certified Red List assessor and see from my own experience where AI could be helpful and where it wouldn’t.

So I found the online training course, and did the 12-hour course over the course of a couple weeks:

Screenshot 2025-12-10 at 1.47.35 pm.png

Screenshot 2025-12-10 at 1.49.38 pm.png

At the end I did the exam and got 80%.

Screenshot 2025-12-10 at 1.53.05 pm.png

But then I naturally wanted to see how well AI could do! Turns out pretty good!!

Screenshot 2025-12-10 at 5.46.38 pm.png

Here you can see it averages 86%, better than I can do. At the end I can demo it answering the questions in real-time if you want.

But stepping back, this is just a small example. In reality, the question is: can Agentic AI assist assessors? I see 3 ways: