Featherless AI
One API key. Instant access.
Sick video but haven't got the attention it deserved.
About
The 'few countries, few companies, few chips' opener is doing heavy lifting in that tweet. Manifesto energy for a Series A is a choice and I respect it.
One API key for any open weights model is the kind of thing that sounds boring until you've spent a weekend wrangling vLLM containers.
Distribution question: are you betting on devs swapping out OpenAI base URLs, or on a brand new audience that never had a key in the first place?
hot take: the real moat here isn't the inference, it's whoever makes the long tail of huggingface models actually reachable without crying.
Curious about cold start times on the obscure stuff. Calling a model nobody else has loaded should be the real benchmark, not llama-3 latency.
been telling people for months that open-weights infra is the next picks-and-shovels play. nice to be vindicated by someone else's term sheet.
concentration is gravity. open infrastructure is just whoever keeps paying the electric bill to push back on it.
How's the docs situation? An any-model API lives or dies on whether I can find the right model card without spelunking through three tabs.
naive question, if I have a finetune sitting on huggingface does it just work, or is there a 'we support these 400 models' list hiding somewhere?
The launch thread engagement curve on this is going to be interesting. Manifesto tweets either rip or get ratio'd, no in between.
Procurement hat on: SOC2? Data residency? If my legal team sees 'serverless inference for any open model' they're going to need a paper bag to breathe into.
RWKV in the bio and serverless inference as the day job is a delightful combo. Builds the engine and the fuel.
how big is the team behind this? every time I see 'any open model, one endpoint' I assume an army, then it turns out to be like nine people in a discord.
would love to see published throughput numbers across model sizes before I migrate anything. trust but verify and all that.