Back to directory
AI & ML · LLMs

SubquadraticLaunch orchestrated by The Launch Video Company (TLVC)

The first model built for long‑context tasks

Building better algorithms. Co-Founder at @subquadratic
25K followers
TLVC Rating

The launch absolutely crushed it, a great video, a compelling product, and a topic spicy enough to spark real debate. It took off on X almost instantly, hitting 5 million views in just a few hours.

Community Rating
No ratings yet
Your rating
Sign in to rate this launch.

About

Subquadratic is a frontier AI research company launching SubQ, a large language model designed for very long context workloads, alongside SubQ Code, a coding agent built on the same architecture. The product is aimed at developers and AI teams whose agents need to reason across entire codebases, large document corpora, or multi-week task histories without the chunking and retrieval scaffolding that current models require. Where the industry standard is 128,000 tokens for many AI models and up to 1 million tokens for frontier cloud models such as Claude Sonnet 4.7 and Gemini 3.1 Pro, SubQ can manage a context window of up to 12 million tokens, maintain accuracy, increase speed and reduce compute cost. The technical claim behind the launch is architectural rather than incremental. Co-founders Justin Dangel, who is chief executive, and Chief Technology Officer Alexander Whedon told SiliconANGLE the company settled on a proprietary transformer architecture that implements sparse attention, focused on transitioning from a dense, quadratic-scaling architecture to a sparse, linear one. Instead of comparing every token to every other token, SubQ identifies and computes only the relationships that matter, which the company says cuts attention compute by close to 1,000 times at the 12 million token mark and delivers roughly a 52x speedup over FlashAttention at 1 million tokens. On RULER at 128K, SubQ scores 97.1 against Opus 4.6's 94.8, with reported speedups of 7.2x at 128K and 52.2x at 1M in its benchmarks. The launch matters now because long-context performance has become the binding constraint for serious agent work, and most current solutions paper over it with RAG and orchestration layers that add latency and bias. The company was previously called Aldea and worked on speech models before pivoting , and has raised $29 million to date at a $500 million valuation from investors including former SoftBank Vision Fund partner Javier Villamizar and Tinder co-founder Justin Mat een. Early access to both SubQ and SubQ Code is open through subq.ai.
Tags
SeedProduct launchB2BGlobal5M+DemoUSFounder-led
Comments (14)
Sign in to join the discussion.
Ndidi Okafor28d ago

12M context is cute but I need to know if I can pipe my entire Confluence into it without legal having a stroke. Where do you sit on SOC2 and data residency in EU?

Tomás L.28d ago

52x faster than FlashAttention is the kind of number that either changes everything or quietly disappears from the README in six months. Rooting for the former.

Carlos Benavides28d ago

Reply guys are arguing about benchmarks while the real question is whether the founder is going to drop a technical blog post or make us reverse engineer it from a podcast.

Rajdeep Banerjee28d ago

The tweet buried the lede under three bullets and a dash. 'First frontier model with 12M context' should have been line one, not line three.

miracle28d ago

Every long-context demo I've ever seen finds the needle in the haystack and then hallucinates the barn. Show me recall on a 10M token contract review and I'll believe you.

Paolo Marchetti28d ago

What does the API surface look like? Streaming on 12M tokens, rate limits, and is there a self-hosted path or are we all sharing one very tired endpoint?

Yuki Tanabe28d ago

Tagline rewrite, on the house: 'Attention, but it scales.' You can Venmo me later.

Fatima Z.28d ago

The launch video pacing is genuinely good. Whoever cut the architecture diagram into the benchmark reveal earned their paycheck this quarter.

Sven Karlsson28d ago

Sub-quadratic sparse attention has been the academic unicorn for years. If this actually ships in production with real users, half of NeurIPS owes you a drink.

Kavya Iyer28d ago

First three seconds of the demo video are just a logo fade. You had a 52x number and you opened with vibes.

Marc de Vries28d ago

Retention question nobody asks at launch: how many of those 12M tokens does a user actually fill before churning back to a 200k context model that's good enough?

Obi Amadi28d ago

Counterpoint: most people asking for 12M context have a RAG problem they refuse to solve.

Lin Zhao28d ago

Waiting for the inevitable $SUBQ token gated inference tier. Don't do it. (Please don't do it.)

Hema Prasad28d ago

A long context window is a longer rope. Whether you climb or hang yourself with it depends on the eval suite.