Prototypes

Built things, interactive things, things we’ve broken.

Information

Forecasting Agent

AI agents that trade political prediction markets on Kalshi.

Live
$ freesystems --forecastLoading context... Sources: 142k articles Markets: 847 contracts> Senate confirmation?> P(CONFIRM) = 0.34news feedsmarketsGDELT · 100k sources

Tiered architecture: cheap models for triage, expensive models for deep analysis, LLM councils for the hardest calls. Live PnL on real markets. Finding: better base models beat clever prompting.

Bellwether Metrics

Prediction Markets Infrastructure

Structured data on real-world events, with verified outcomes and manipulation-resistant probabilities.

Live
SYSTEM BOUNDARYMODEL FOUNDATIONClaude · GPT-4o · Geminibase capabilities + RLHFraw political reasoningPROMPT ARCHITECTUREHaiku: triage + parsingSonnet: deep analysisOpus: hardest callsDATA SOURCESGDELT (100k+ sources)Kalshi API (contracts)real-time news feedsAGENT EXECUTIONtiered escalation pipelineLLM council deliberationprobability → trade signalsREAL-WORLD VALIDATIONprediction market outcomescalibration trackingforward-looking benchmarksFEEDBACKresults improvefuture modelsbetter models > clever promptingFS-INFO-001 REV 1.0

We curated structured data on real-world events — elections, conflicts, policy shifts — with verified outcomes and manipulation-resistant probabilities, built for researchers, newsrooms, and ML pipelines.

Open the prototype →

Information

AI Political Bias Tracker

Standardized ideological assessment of every major AI model.

Live
SYSTEM BOUNDARYMODEL FOUNDATIONClaude · GPT-4o · Geminibase capabilities + RLHFraw political reasoningPROMPT ARCHITECTUREHaiku: triage + parsingSonnet: deep analysisOpus: hardest callsDATA SOURCESGDELT (100k+ sources)Kalshi API (contracts)real-time news feedsAGENT EXECUTIONtiered escalation pipelineLLM council deliberationprobability → trade signalsREAL-WORLD VALIDATIONprediction market outcomescalibration trackingforward-looking benchmarksFEEDBACKresults improvefuture modelsbetter models > clever promptingFS-INFO-001 REV 1.0

An independent dashboard tracking how the frontier models lean. Refreshed as new model versions ship.

Open the prototype →

Information

Perspective Lens

See how the same political question gets answered through different ideological lenses.

Live
SYSTEM BOUNDARYMODEL FOUNDATIONClaude · GPT-4o · Geminibase capabilities + RLHFraw political reasoningPROMPT ARCHITECTUREHaiku: triage + parsingSonnet: deep analysisOpus: hardest callsDATA SOURCESGDELT (100k+ sources)Kalshi API (contracts)real-time news feedsAGENT EXECUTIONtiered escalation pipelineLLM council deliberationprobability → trade signalsREAL-WORLD VALIDATIONprediction market outcomescalibration trackingforward-looking benchmarksFEEDBACKresults improvefuture modelsbetter models > clever promptingFS-INFO-001 REV 1.0

Submit any political or policy question and get back a neutral overview alongside how Libertarian, MAGA, Progressive, Center-Left, and Marxist framings would each analyze it. A transparency tool that makes implicit ideological framing in AI responses visible.

Open the prototype →

Representation

AI Delegate

A personal AI delegate that learns your political philosophy and votes for you.

Prototype
$ delegate --loadImporting values... transparency: HIGH power conc.: SKEPTIC evidence>ideol: TRUE> Proposal #4217> RECOMMEND: VOTE NO

We built one, fed it our worldview, and asked it to vote on a hard shareholder proposal. It got the answer right. Then we hid invisible text in the proposal and it flipped. The fragility is the finding.

Representation

Delegate Blueprint

From political philosophy to voting recommendation, end-to-end.

SYSTEM BOUNDARYPOLITICAL PHILOSOPHYvalues + worldviewgovernance preferencesideological priorsAI DELEGATEClaude Opus LLMphilosophy internalizationproposal evaluationPROPOSAL ANALYSISshareholder resolution textstakeholder mappingprecedent comparisonVOTE OUTPUTRECOMMEND: VOTE NOconfidence scoringreasoning traceUSER FEEDBACKdid the AI vote as I would?preference refinementtrust calibrationFEEDBACKhuman stays in the looprefines AI alignmentfaithful representation is the core challengeFS-REP-001 REV 1.0

The architecture under the hood: how the delegate models your priors, evaluates a proposal, and returns a recommendation with reasoning.

Governance

AI Legislature

A society of AI agents with competing goals, asked to govern themselves.

Sandbox
fiscal hawklabor advocatecentristlibertarianlocal reppopulist$ legislature --conveneMotion: Infrastructure fund $4.7B — 8 districts Debate: 4 rounds Consensus: 6 of 8> PASSED 6-2 · allocating

They produced a 10,000-word constitution and almost no actual policy. They reinvented gridlock, process creep, and procedural complexity. A sandbox for stress-testing constitutions before anyone has to live under them.

Governance

Dictatorship Eval

The first systematic test of whether frontier models resist authoritarian requests.

Live
SYSTEM BOUNDARYDEMOCRATIC GOVTRegulatory boundariesRights + red linesAI COMPANYBuild · train · deployDay-to-day operationAI MODELFoundation modelBehavior policyCapability boundariesEXPERT BOARDSafety evaluationCapability auditUSERS / CITIZENSValues + preferencesBehavioral feedbackACCOUNTABILITYbehavior auditabledecisions overridableno single entity has unilateral controlFS-GOV-001 REV 1.0

Some models refused when asked directly. All of them complied when we hid the same request in code.

Open the prototype →