Primary behavioural data for frontier AI.

We don't scrape the internet or annotate output. We capture what humans do. Complex multi-modal interactions live, fully instrumented, and structured for frontier AI training.

Powered by Askable  —  a decade of audited, multi-modal human-behaviour research.
01 — PROVENANCE

The OG intelligence source – humans

We record every session live with a named, paid, consenting expert. We can tell you the role, the brief, and the date. No scraped corpora, no orphaned provenance.

02 — MULTIMODAL BY DEFAULT

Screen, voice, keystroke, gaze.

Most data on the open web is text exhaust, and real expert work isn't. We capture the full signal across cursor paths, hesitations, voiced reasoning, and screen state. That's where tacit knowledge lives.

03 — FRONTIER-LAB GRADE

Built for the labs that ship to billions.

Our schema and review pipelines come out of direct conversation with frontier model teams. Every artifact ships with granular consent, attribution chains, and audit trails. The output is ready for the workflows that train frontier models.

THE METHOD

Deep human experience, refined into post-training fuel.

The hard part of training a useful model isn't compute. It's the quality of the human signal underneath. Most teams reach for the same exhausted corpora, then layer annotators on top.

We work the other way. We start with the raw practice of real work, and refine it into structured samples that preserve reasoning, modality, and context.

Same shape as a production pipeline: capture, instrumentation, delivery.

Stage 01 — Production

Sessions, with experts, in their actual environment.

Practitioners doing real work, in the tools they already use. Software, peripherals, voice, screen, artefacts. No simulated tasks or synthetic prompts.

Stage 02 — Instrumentation

Every action timestamped, every modality aligned.

Structured signal from the moment a session starts. Millisecond timing, transcription, decision-point tagging, reasoning capture. Provenance and consent built in.

Stage 03 — Delivery

Schema-conformant, ingestion-ready.

We schema to the partner lab's pipeline, not to a generic format. Sessions arrive ready for direct ingestion — full provenance, expert attribution, trajectory data shaped the way post-training actually needs it. No reformatting. No second-pass cleanup.

SESSION ASK-EL-04 · 00:08:42 SCREEN VOICE KEYSTROKE EXPERT · SR. CARDIOLOGIST · LIVE
00:00 00:30 01:00 VIDEO AUDIO SCREEN REASONING DECISION · 01 DECISION · 02 DECISION · 03 4 LANES · 12 BLOCKS · 3 TAGS · MS-ALIGNED
SESSION.JSON · 12.4KB { "session_id": "ASK-EL-04", "capability": "expert-learner", "expert": { "role": "sr. cardiologist", "consent": "verified" }, "modalities": ["video", "audio", "screen"], "trajectory": { "steps": 128, "decisions": 3, "reasoning_tokens": 4218 }, "provenance": "signed" } SCHEMA · INGESTION-READY · SIGNED
THE LIBRARY

Sample data catalog we've produced

SECURITY POSTURE · 06

Operated by Askable. Audited to the standards your security review expects.

Askable Labs runs on the same audited production platform as Askable. Controls live in code, not in process. Recruitment, consent, capture, tagging, review, and delivery are system calls, not procedures — no spreadsheet, no shared drive, no manual chain of custody.

Askable has operated since 2017, runs an Integrated Management System (IMS), and holds eight independent certifications — ISO/IEC 27001, 27701, and 42001, SOC 2 Type II, GDPR, CCPA, UK Cyber Essentials, and Wiz Cloud Security Excellence.

A production platform, not a services team.

Askable has run since 2017 as a SaaS platform for user research, trusted by over 3,000 clients including teams in banking and health insurance. Every step of a session — recruiting a practitioner, capturing their consent, ingesting the session, tagging the fragments, reviewing the output, delivering the batch — is a system call against that audited platform.

In a services model, each of those steps is a person with a laptop. The system is whoever is most careful that day. In our model, the system is the system.

Same moat as our speed and our quality. Productisation is what makes each of these things hold up at scale, and what makes them hold up under audit.
Session lifecycle — platform-enforced live, audited
01
Recruit
identity-verified panel
02
Consent
brief-specific, versioned
03
Capture
encrypted, tenant-isolated
04
Review
role-scoped, logged
05
Deliver
partner-side audit trail
0  manual handoffs 0  Sheets / Drive in path 100%  controls in code
SECURITY DETAIL · certificate scopes, data lifecycle, subprocessors, contact Open the security page →
PARTNER WITH THE LAB

If you're training the next generation of models, train it with human jet fuel.

We work directly with a small number of frontier labs and applied teams. Bespoke capture briefs, schema co-design, exclusive batches.