Full Width [alt+shift+f] Shortcuts [alt+shift+k] TRY SIMPLE MODE
Sign Up [alt+shift+s] Log In [alt+shift+l]
36
It’s been a hectic 2022 so far, but August is looking a lot calmer; this is the first of hopefully a few blog posts this month catching up on various things. In this post I want to talk about the … Continue reading →
over a year ago

Improve your reading experience

Logged in users get linked directly to articles resulting in a better reading experience. Please login for free, it takes less than 1 minute.

More from Xena

Think of a number: an update

A month or two ago I wrote this post which expressed my frustration with various issues around private datasets as a way of measuring the mathematical abilities of language models. More generally I was frustrated about the difficulty of being … Continue reading →

4 months ago 45 votes
What is a quotient?

Undergraduate mathematicians usually have a hard time defining functions from quotients in Lean, because they have been taught a specific model for quotients in their classes, which is not the model that Lean uses. This post is an attempt to … Continue reading →

5 months ago 55 votes
Think of a number.

My feed was recently clogged up with news articles reporting that Sam Altman thinks that AGI is here, or will be here next year, or whatever. I will refrain from giving even more air to this nonsense by linking to … Continue reading →

6 months ago 49 votes
Can AI do maths yet? Thoughts from a mathematician.

So the big news this week is that o3, OpenAI's new language model, got 25% on FrontierMath. Let's start by explaining what this means. Continue reading →

7 months ago 42 votes
Fermat’s Last Theorem — how it’s going

So I'm two months into trying to teach a proof of Fermat's Last Theorem to a computer. We already have one interesting story, which I felt was worth sharing. Continue reading →

7 months ago 42 votes

More in AI

GPT-5: It Just Does Stuff

Putting the AI in Charge

23 hours ago 7 votes
AI Roundup 130: GPT-5

August 8, 2025.

an hour ago 1 votes
ML for SWEs #62: World models are the breakthrough AI agents need

Welcome to machine learning for software engineers.

2 days ago 4 votes
The Age of AI Flattery

How Wrong Incentives Can Undermine Honest AI-Human Dialogue

3 days ago 7 votes
AI Roundup 129: Personal Superintelligence

August 1, 2025.

a week ago 11 votes