Assessment Is the Conversation

By Andy Szybalski

Think about how a great tutor works.

They ask a question. The student answers. Based on that answer, the tutor adjusts — asks a follow-up, introduces a new concept, backs up and tries a different angle, or pushes deeper. There's no moment where the tutor says "OK, now stop learning — it's time for a test." The assessment IS the conversation. Every exchange is simultaneously teaching, evaluating, and guiding.

Nobody who has ever learned something 1:1 experienced a separation between learning and assessment. It would be bizarre. Imagine hiring a piano teacher who spent 45 minutes teaching, then handed your kid a written exam about piano. You'd fire that teacher.

Yet that's exactly how school works.

We teach for a few weeks, then stop everything for a test. The test produces a number. The number goes in a gradebook. Then we start teaching again.

This isn't how anyone actually learns. So why do we do it?

Because we had to.

When you have one teacher and 25 students, you can't have 25 simultaneous conversations. You can't assess each student in real-time while also teaching. The logistics don't work. So we invented a workaround: batch the teaching, batch the assessment, compare the results. Standardize the questions so grading is efficient. Separate the processes because one person physically cannot do both at once for that many kids.

The entire structure of "teach, then test" is an artifact of scale. It was never a pedagogical ideal. It was an engineering compromise.

And like most engineering compromises, we eventually forgot it was a compromise. We started treating it as the correct way to do things. Assessment became its own discipline, with its own theories and frameworks and billion-dollar testing industries — all built on the assumption that learning and evaluation are separate activities.

They're not. They never were.

What changes when you remove the constraint

At Tradewinds, our class sizes are small enough that teachers can actually have ongoing conversations with each student. But even in a class of 8, one teacher can't be inside every student's head simultaneously.

That's where AI changes the equation.

We're building an open source platform called Rabbithole where every student has a daily block working with an AI tutor. The AI is Socratic — it asks questions, pushes thinking deeper, connects ideas across domains. A student working on ecosystems might get asked: "You said animals depend on plants for food. But what do plants depend on? And what would happen if you removed one piece?" The AI isn't delivering a lesson. It's probing for understanding in real-time.

Running silently behind every one of these conversations is what we call the observer — a separate AI process that reads the transcript and writes down what it saw. Not grades. Not scores. Just: what concepts did this student engage with, how deeply did they understand them, and where should we go next?

The teacher's dashboard in Rabbithole becomes a living map of each student's thinking. Color-coded status orbs show who's engaged, who's in deep work, who might need help — all updating in real time, without a single test administered.

The information problem

Traditional tests are low-resolution. A student gets 78% on a math test. What does that tell you? They missed some questions. Which ones? Why? Did they not understand the concept, make a careless error, run out of time, or freeze up from anxiety? The number doesn't say.

A conversation tells you everything. When a student explains their reasoning to an AI tutor, the gaps become visible immediately. Better yet — misconceptions become visible. "Student believes heavier objects fall faster" tells a teacher exactly where to intervene and exactly what mental model needs revision. "Got 3 out of 5 wrong on the gravity section" tells them nothing.

The observer is specifically designed to treat misconceptions as gold. A well-articulated wrong answer is more informative than a correct one.

Teacher as conductor

This doesn't remove the teacher from the loop. It amplifies what the teacher can do.

The teacher designs the learning experience — choosing which AI persona the student works with, which thinking lens to apply, what topic to focus on. While students work, the teacher monitors every conversation from Rabbithole's teacher dashboard and can inject what we call "whispers" — private guidance to the AI that the student never sees. Something like "she's getting frustrated, try a different angle" or "he's ready for the challenge problem." The AI weaves that guidance into its next response naturally.

It's exactly what a lead teacher does when they lean over to a student teacher and quietly say "try asking her why." Except now one teacher can do it for every student simultaneously.

After each session, the observer suggests "seeds" — things this student should explore next, based on what actually happened in the conversation. A kid fascinated by garden math might get a seed for Fibonacci sequences in sunflower spirals. A kid analyzing game strategies might be ready for the prisoner's dilemma. The teacher reviews and approves these before they become active. The system observes and suggests; the humans decide where to go.

The standards question

One choice we made early: the observer records mastery by concept, not by standard. It writes "Oliver demonstrated Analyze-level understanding of Nash equilibria in a game theory context" — not "Oliver met standard 4.OA.3." Nash equilibria don't appear in any K-8 standard. They're exactly what Oliver was thinking about.

We can still generate a standards compliance view when we need one — for parent conferences, accreditation, grant applications. But standards are a secondary lens, not the organizing principle. Common Core is one small, politically-negotiated subgraph of human knowledge. The real question is: what does this child actually know and care about?

Why this matters for gifted kids especially

Gifted students are often the ones most poorly served by the test-and-move-on model. They ace the test, so the system assumes they're fine. But "getting the right answer" and "being intellectually challenged" are completely different things. A gifted student can score 100% on a fractions test while being bored out of their mind — and the test will never reveal that.

A conversation will. An AI and a teacher who are engaging with a student's actual thinking will discover what that student is ready for, what excites them, where they can go deeper. And they can act on it immediately.

There's a phenomenon in gifted development that's important here: a kid might be able to Create — invent their own notation, design an original solution — before they can reliably Remember standard terminology. They can Analyze a system they can't yet Apply procedurally. In traditional assessment, those gaps look like problems. In a continuous conversation, they look like exactly what they are: asynchronous development. Interesting data, not errors.

The point

Education should feel like one long conversation — a tutor who is constantly teaching, constantly assessing, and constantly nudging toward discovery. The fact that we separated these things was always a concession to logistics, not a reflection of how people actually learn.

Small class sizes and AI make it possible to put them back together. That's what we're building with Rabbithole — and it's open source.

Join Our Founding Class

Fall 2026 admissions are open. Spaces are limited—families are encouraged to apply early.

Apply for Fall 2026

Rolling admissions now open. Spaces are limited.

Start Application →

Schedule a Conversation

Talk with our team about whether Tradewinds is right for your family.

Book a Meeting →

Stay Connected

Get updates on admissions, events, and news from Tradewinds.

A
M
K
L
+296

300+ families on Oʻahu have registered for updates