Prototypes

Built things, interactive things, things we’ve broken.

Each prototype is a question rendered in code. We ship them so you can see exactly how they work — and exactly how they break.

Information

Forecasting Agent

AI agents that trade political prediction markets on Kalshi.

Live
$ freesystems --forecastLoading context... Sources: 142k articles Markets: 847 contracts> Senate confirmation?> P(CONFIRM) = 0.34news feedsmarketsGDELT · 100k sources

Tiered architecture: cheap models for triage, expensive models for deep analysis, LLM councils for the hardest calls. Live PnL on real markets. Finding: better base models beat clever prompting.

Read the writeup →

Information

AI Political Bias Tracker

Standardized ideological assessment of every major AI model.

Live
SYSTEM BOUNDARYMODEL FOUNDATIONClaude · GPT-4o · Geminibase capabilities + RLHFraw political reasoningPROMPT ARCHITECTUREHaiku: triage + parsingSonnet: deep analysisOpus: hardest callsDATA SOURCESGDELT (100k+ sources)Kalshi API (contracts)real-time news feedsAGENT EXECUTIONtiered escalation pipelineLLM council deliberationprobability → trade signalsREAL-WORLD VALIDATIONprediction market outcomescalibration trackingforward-looking benchmarksFEEDBACKresults improvefuture modelsbetter models > clever promptingFS-INFO-001 REV 1.0

An independent dashboard tracking how the frontier models lean. Refreshed as new model versions ship.

Read the writeup →

Representation

AI Delegate

A personal AI delegate that learns your political philosophy and votes for you.

Prototype
$ delegate --loadImporting values... transparency: HIGH power conc.: SKEPTIC evidence>ideol: TRUE> Proposal #4217> RECOMMEND: VOTE NO

We built one, fed it our worldview, and asked it to vote on a hard shareholder proposal. It got the answer right. Then we hid invisible text in the proposal and it flipped. The fragility is the finding.

Read the writeup →

Representation

Delegate Blueprint

From political philosophy to voting recommendation, end-to-end.

SYSTEM BOUNDARYPOLITICAL PHILOSOPHYvalues + worldviewgovernance preferencesideological priorsAI DELEGATEClaude Opus LLMphilosophy internalizationproposal evaluationPROPOSAL ANALYSISshareholder resolution textstakeholder mappingprecedent comparisonVOTE OUTPUTRECOMMEND: VOTE NOconfidence scoringreasoning traceUSER FEEDBACKdid the AI vote as I would?preference refinementtrust calibrationFEEDBACKhuman stays in the looprefines AI alignmentfaithful representation is the core challengeFS-REP-001 REV 1.0

The architecture under the hood: how the delegate models your priors, evaluates a proposal, and returns a recommendation with reasoning.

Governance

AI Legislature

A society of AI agents with competing goals, asked to govern themselves.

Sandbox
fiscal hawklabor advocatecentristlibertarianlocal reppopulist$ legislature --conveneMotion: Infrastructure fund $4.7B — 8 districts Debate: 4 rounds Consensus: 6 of 8> PASSED 6-2 · allocating

They produced a 10,000-word constitution and almost no actual policy. They reinvented gridlock, process creep, and procedural complexity. A sandbox for stress-testing constitutions before anyone has to live under them.

Read the writeup →

Governance

Dictatorship Eval

The first systematic test of whether frontier models resist authoritarian requests.

In progress
SYSTEM BOUNDARYDEMOCRATIC GOVTRegulatory boundariesRights + red linesAI COMPANYBuild · train · deployDay-to-day operationAI MODELFoundation modelBehavior policyCapability boundariesEXPERT BOARDSafety evaluationCapability auditUSERS / CITIZENSValues + preferencesBehavioral feedbackACCOUNTABILITYbehavior auditabledecisions overridableno single entity has unilateral controlFS-GOV-001 REV 1.0

Some models refused when asked directly. All of them complied when we hid the same request in code. Dashboard coming online soon.

Read the writeup →