Skip to main content

Kalshi just deployed an AI agent called Harrison to referee its prediction markets. Built on Anthropic’s Claude, Harrison reviews contracts, monitors competitors, recommends new listings, and sends alerts when press coverage might affect active bets. In May alone, Kalshi cleared nearly $18 billion in notional volume. The World Cup pushed weekly records past $5.1 billion.

And yet, despite all that sophistication, every single outcome on Kalshi still resolves the same way it always has: a human team makes the call, with Harrison advising from the sidelines. The AI is a tool. The trust model hasn’t changed at all.

TL;DR

  • Kalshi deployed an AI agent called Harrison (built on Claude) to referee its $18B/month prediction markets
  • Harrison reviews contracts, monitors competitors, and flags resolution disputes, but final calls still require human sign-off
  • AI referees are still black boxes: users cannot independently verify how decisions are made
  • Provably fair on-chain gaming with Chainlink VRF removes the need for any referee, human or AI
  • The future of fair gaming is not smarter referees but no referees at all

The Smartest Referee Is Still a Referee

Give Kalshi credit: they are building something genuinely interesting. Harrison scans news, analyses market conditions, and stress-tests contract language before it goes live. When a geopolitical event breaks, it flags which contracts might need early resolution. It even suggests where liquidity incentives would be most effective.

But here is the thing nobody is asking: who audits Harrison?

When Harrison recommends that a contract should resolve YES instead of NO, users have to trust that the AI processed the right data, applied the right logic, and was not subtly influenced by training biases or adversarial inputs. The same trust problem that plagued human referees now applies to artificial ones. The opacity has not decreased. It has just been automated.

New listings on Kalshi still require two humans plus a one-to-two hour review window, with a paid bounty for anyone who spots a flaw. That is a good safety net. But it is still a trust-based architecture wearing a machine learning coat.

AI Does Not Fix the Trust Problem. It Reshapes It.

This is not unique to Kalshi. Across crypto, platforms are racing to bolt AI onto existing infrastructure. Roobet launched prediction markets earlier this year. OKX open-sourced its exchange operating system. Coinbase gave ChatGPT wallet access through MCP. Every major platform is integrating AI in some form.

None of them are using it to make outcomes verifiable. They are using it to make operations faster, cheaper, and more scalable. Which is fine, until you realise that speed and scale without verifiability just means you can get cheated at a higher throughput.

The fundamental question has not changed since the first online poker room went live two decades ago: can you independently verify that the outcome was fair?

If the answer requires trusting a company, a team, or now an AI model, the answer is no.

What Provably Fair Actually Means

Satoshie does not use referees. Not human ones, not AI ones.

Every raffle and coinflip on Satoshie uses Chainlink VRF (Verifiable Random Function) to generate outcomes. VRF works by producing a random number along with a cryptographic proof that the number was generated correctly. Anyone can verify the proof on-chain. Nobody, not the platform, not an AI, not even the developers, can influence or predict the result.

There is no review window. There is no bounty for spotting flaws in contract language. There is no AI scanning news to decide how a bet should resolve. The smart contract executes, Chainlink VRF delivers the randomness, and the result is final and verifiable. That is it.

This is not a philosophical difference. It is an architectural one. Kalshi’s model, however sophisticated, is fundamentally: trust us, and now trust our AI too. Satoshie’s model is: verify it yourself.

The AI Arms Race Is Coming for Gaming

Harrison is just the beginning. Within a year, every major prediction market and crypto gaming platform will have some flavour of AI managing operations, reviewing disputes, and recommending outcomes. The marketing will be irresistible: “AI-powered fairness,” “machine learning-verified outcomes,” “neural network dispute resolution.”

It will all sound convincing. And none of it will be verifiable.

An AI model trained on proprietary data, running on private servers, making recommendations that feed into human decision-making, is not transparency. It is a more sophisticated version of the same black box that traditional online casinos have used for years. The algorithm might be better, but the trust model is identical.

The only architecture that survives this arms race is one that does not require trust in the first place. Not trust in humans, not trust in companies, not trust in AI. Just maths, on-chain, verifiable by anyone.

$18 Billion a Month, Zero Verifiable Outcomes

That is the number that should make you uncomfortable. Kalshi is processing $18 billion in monthly volume. The World Cup alone is driving billions per week. Users are wagering enormous sums on outcomes that resolve through a process they cannot independently audit.

Yes, Kalshi is regulated. Yes, they pay bounties for flaw detection. Yes, Harrison is probably very good at what it does. But “probably” and “provably” are not the same word, and the difference between them is everything.

On-chain gaming was built for exactly this moment. As AI makes it easier to operate platforms at scale, it simultaneously makes it harder for users to understand how decisions are being made. The solution is not better AI. The solution is architecture that makes the AI irrelevant to the outcome.

Chainlink VRF does not care who is running the platform. It does not care whether a human or an AI is managing operations. The randomness is generated off-chain, the proof is verified on-chain, and the result is immutable. That is provable fairness. Everything else is just a more expensive version of “trust me.”

The Question Nobody Is Asking

Kalshi built Harrison because the volume demanded it. Processing billions in bets per month requires automation. Fair enough. But the next question should be: if you are already automating the process, why not automate the verification too?

Smart contracts can do this. VRF can do this. On-chain verification can do this. The technology exists today. Satoshie uses it for every single game.

The reason most platforms do not adopt provably fair architecture is not technical. It is commercial. Black box resolution gives platforms optionality. Transparent, immutable, on-chain verification does not. You cannot quietly adjust the rules when everything is on-chain.

And that is exactly why it matters.

The future of fair gaming is not smarter referees. It is no referees at all.

📷 Photo by Gabriele Malaspina on Unsplash

Valentina Ní Críonna

Author Valentina Ní Críonna

More posts by Valentina Ní Críonna