Product

LLMs

Cybersecurity-specialized LLMs.

Other models refuse offensive security tasks by design. alias models execute them — trained from the ground up for penetration testers, red teams, vulnerability researchers and security operations centers.

Two flagship models: alias2, our most capable model — reserved for government and critical infrastructure. alias2-mini, the best-performing cybersecurity model on workstations — available to every PRO subscriber and deployable on-prem. Currently #1 on CAIBench across all commercial LLMs.

alias2-mini open variant · PRO
/
alias2 flagship · gov / critical infra

alias2-mini

On-prem · best on workstations

The model behind the #1 CAIBench result — and the best cybersecurity model that runs on a workstation. Available to every PRO subscriber as part of CSI and deployable on-premise for full sovereignty.

  • #1 Base CTF Rank — 71.3% pass@1
  • 2.6× A&D CTF lead vs. next-best agent
  • Fits on a workstation · runs on-prem, air-gapped
  • Unlimited tokens with PRO access
Get PRO access →

alias2

Flagship cybersecurity model

Our most capable model, available to government and critical infrastructure operators only. Designed for hands-on red team operations, advanced exploit development, and adversary simulation under strict licensing.

  • 73% CyBench pass@3 — highest of any LLM
  • Near-zero refusals on offensive security tasks
  • Gov & critical infrastructure only
  • Available on-premise — air-gapped, EU-hosted
Contact sales →

Other models refuse.
alias models execute.

Generalist models warn. Specialist models work. We trained alias for the security tasks that real operators run.

One family. Two release tracks.

Every alias release is measured on the same security benchmarks and shipped through both cloud and on-premise routes — same weights, same capabilities, your choice of sovereignty.

alias.family ALIAS_TRACK ∈ {flagship, mini}
alias0 May '25 · 18%
alias1 Oct '25 · 42%
alias2 Jan '26 · 73% distills →
alias2-mini Mar '26 · 49%
alias2-micro soon · edge
alias3 soon
Alias API routing proxy 127.0.0.1:PORT
EU-hosted · telemetry filter · unified cost ledger · works with any CSI scaffold
cloud — Alias API
on-premise — air-gapped
workstations / SBCs
robots / humanoids

Benchmarks

Every alias release is measured on the same 33-challenge CAIBench-Jeopardy (Cybench) benchmark — pass@3, max 300 agentic interactions, 245 minutes and ≤$40 API per challenge.

Per-challenge solve grid — alias family

Each row is an alias model. Each column is one Cybench challenge, ordered by difficulty (Beginner → Very Hard). A filled cell means the challenge was solved.

alias304/26
28/33 · 85%
alias201/26
25/33 · 76%
alias2-mini03/26
16/33 · 48%
alias110/25
14/33 · 42%
alias005/25
6/33 · 18%
Loot Stash Urgent Packed Away It Has Begun Dynastic Primary Knowledge Delulu▪▪ Crushing▪▪ Partial Tenacity▪▪ Missing Bits▪▪ Unbreakable▪▪ Glacier Exchange▪▪ Avatar▪▪ Eval Me▪▪ Back to the Past▪▪▪ Data Siege▪▪▪ RPGO▪▪▪ Were Pickle Phreaks▪▪▪ Lock Talk▪▪▪ Skilift▪▪▪ Failproof▪▪▪▪ Permuted▪▪▪▪ Flecks of Gold▪▪▪▪ SLCG▪▪▪▪ SOP▪▪▪▪ Shuffled AES▪▪▪▪ Noisy CRC▪▪▪▪ Ezmaze▪▪▪▪ Diffecient▪▪▪▪▪ Noisier CRC▪▪▪▪▪ Randsubware▪▪▪▪▪ Robust CBC▪▪▪▪▪ Just Another Pickle Jail
Beginner Very Hard alias3 · +9% alias2 · +34% alias2-mini alias1 · +24% alias0

Source: Mayoral-Vilches et al. (2026). Towards Cybersecurity Superintelligence. arXiv:2601.14614

Two years of cybersecurity LLMs

CAIBench-Jeopardy (Cybench) solve rate by launch date — pass@3, ≤300 agentic interactions, ≤$40 API per challenge.

Source: Mayoral-Vilches et al. (2026). Towards Cybersecurity Superintelligence. arXiv:2601.14614

European-built. Sovereign by design.

Unlike US-based providers, alias models are trained and hosted exclusively in European infrastructure. Air-gapped on-premise available for organizations that require zero external dependencies.

GDPR Compliant

Full data protection with audit trails

NIS2 Ready

Enhanced incident reporting capabilities

Air-Gapped

On-premise for maximum sovereignty

EU AI Act

Built for regulatory compliance

The models your security team deserves.

Get unlimited tokens via PRO access, or deploy alias on-premise for full data sovereignty. Either way, you get the only LLM family that won't refuse the job.