AI & LLM Tools
Curated directory of LLM providers, agent frameworks, and self-hostable AI tools — with links straight to the source.
79 curated links across 12 sections
Everything we reach for when building AI-powered features. If you're wiring up an LLM, running inference locally, or stitching agents together into workflows, start here. Each link goes straight to the provider — no affiliate tracking, no wrapper sites.
Frontier LLM providers
Hosted APIs with state-of-the-art models.
- OpenAIFeaturedPaid
GPT-5, GPT-4.1, o-series reasoning models. Responses API is the stateful default; chat/completions still works.
LLM providerAPI - AnthropicFeaturedPaid
Claude Sonnet, Opus, Haiku. Strong at long context, tool use, and following complex instructions.
LLM providerAPI - Google AI StudioFreemium
Gemini Pro / Flash / Nano with huge context windows and native multimodal input. Free tier is generous.
LLM providermultimodal - xAIPaid
Grok models via Twitter-adjacent API. Competitive reasoning and real-time web grounding.
LLM provider - MistralFreemium
Open-weight and hosted models out of Paris. Strong European option with EU data residency.
LLM provideropen-weight - CoherePaid
Command R / R+ models optimized for RAG and enterprise workloads. Strong rerank + embed endpoints.
LLM providerRAG
Fast inference & open models
Cheap, fast hosted inference for Llama, Mistral, DeepSeek, and friends.
- GroqFeaturedFreemium
LPU inference — hundreds of tokens/sec for Llama, Mixtral, Qwen. Ridiculous latency ceilings.
inferencelow-latency - Together AIPaid
Hosts 100+ open models with OpenAI-compatible APIs. Great for switching providers without rewriting code.
inferenceopen models - Fireworks AIPaid
Fast serverless inference for open-weight models with fine-tuning support.
inferencefine-tuning - ReplicatePaid
Run any open-source model behind an HTTP API. Great for vision / audio / niche models.
inferencemulti-modal - Hugging FaceFeaturedFreemium
The open-source model hub. Datasets, weights, Spaces, Inference Endpoints — the whole ecosystem.
model hubdatasets - DeepInfraPaid
Pay-per-token OpenAI-compatible API for open models. Aggressive pricing.
inference
Router & gateway
One API key to try many models, or smart routing across providers.
- OpenRouterFeaturedFreemium
Unified API for 300+ models across every major provider. Budget caps, fallback routing, BYO keys.
gatewaymulti-provider - PortkeyFreemium
Observability + guardrails + routing layer for production LLM apps. 50+ providers.
gatewayobservability - HeliconeFreemium
Open-source LLM observability. Proxies requests and captures logs, cost, latency.
observabilityopen-source
Run LLMs locally
Inference on your own laptop or server. No cloud, no data leaving the box.
- OllamaFeaturedOpen-source
One-command local LLM runtime. `ollama run llama3.2` and you're off. REST API baked in.
localCLI - LM StudioFree
Desktop app for chatting with local models. Great for non-technical users.
localdesktop - llama.cppOpen-source
The CPU/GPU inference engine that started the local-LLM revolution. Powers most of the stack.
localinference engine - vLLMOpen-source
High-throughput batched inference for serving open-weight models in production.
inference engineserving
Agent & orchestration frameworks
Libraries for building multi-step AI applications — chains, tools, memory, routers.
- LangChainFeaturedOpen-source
The most widely-used framework for chaining LLM calls, tools, and retrievers. TS + Python.
frameworkagents - LangGraphOpen-source
State-machine-style agent framework from the LangChain team. Durable workflows + checkpointing.
agentsworkflow - LlamaIndexOpen-source
Data framework optimized for RAG — indexing, retrieval, query engines. TS + Python.
RAGframework - Vercel AI SDKFeaturedOpen-source
TypeScript SDK for streaming chat UIs on Next.js / React / Svelte. Provider-agnostic.
TypeScriptUI - MastraOpen-source
TypeScript-first agent framework with memory, tools, workflows, and eval built in.
TypeScriptagents - CrewAIFreemium
Multi-agent framework organized around role-based crews. Popular in the no-code adjacent space.
agentsmulti-agent
Workflow & automation builders
Visual canvases for stitching AI steps, APIs, and triggers into workflows.
- n8nFeaturedFreemium
Self-hostable workflow automation with 400+ integrations and native LLM nodes. The go-to Zapier alternative.
workflowself-hosted - FlowiseOpen-source
Drag-and-drop UI for LangChain flows. Self-hostable, open-source, great for prototyping agents.
visualself-hosted - LangflowOpen-source
Visual authoring for LangChain pipelines. Big community, plays well with LangGraph.
visualopen-source - ZapierFreemium
The canonical no-code automation platform. AI actions across 7000+ apps.
workflowno-code
Self-hostable chat UIs
Run your own ChatGPT. Point at any OpenAI-compatible endpoint and you're ready.
- LibreChatFeaturedOpen-source
Open-source chat UI supporting OpenAI, Anthropic, Gemini, Bedrock, and local providers. Active community.
chat UIself-hosted - OpenWebUIOpen-source
Polished WebUI built for Ollama + any OpenAI-compatible API. One of the cleanest self-hosted options.
chat UIself-hosted - AnythingLLMOpen-source
Desktop + Docker app that turns documents into a private RAG-powered workspace.
RAGdesktop - TypingmindPaid
Polished chat UI that lives in your browser, keys stay local. Supports every major provider.
chat UIBYO key
Video AI & generation
Text-to-video, image-to-video, and AI avatar tools. The space is moving fast — these are the platforms shipping models people actually use in production.
- OpenAI SoraFeaturedPaid
OpenAI's flagship text-to-video model. Long-clip generation with strong physical consistency. Bundled with ChatGPT Plus / Pro tiers.
text-to-videoOpenAI - Google VeoFeaturedPaid
Google DeepMind's text-to-video. Veo 3 ships native audio + lip sync. Available via Gemini, Vertex AI, and the Flow filmmaking app.
text-to-videoGoogle - RunwayFeaturedFreemium
Gen-3 / Gen-4 video models. The platform of choice for filmmakers — camera controls, motion brush, lip sync, and full editing suite.
text-to-videofilmmaking - PikaFreemium
Text-to-video with PikaSwaps, PikaScenes, and lip-sync. Strong on stylized, social-friendly clips.
text-to-video - Luma Dream MachineFreemium
Ray2 model with fast generation, image-to-video, and keyframe control. API available for builders.
text-to-videoAPI - Kling AIFreemium
Kuaishou's video model — strong physics, long durations, and competitive prompt adherence. Popular for cinematic shots.
text-to-video - MiniMax HailuoFreemium
MiniMax's video gen. Standout for realistic motion and human subjects. API via the MiniMax platform.
text-to-videoAPI - HeyGenFreemium
AI avatars for talking-head videos — realistic lip sync from a script in any language. Used heavily for product demos and marketing.
AI avatarslip sync - SynthesiaPaid
Enterprise-grade AI video presenters. 230+ avatars, 140+ languages, custom-avatar pipeline. Locked-down compliance story.
AI avatarsenterprise - D-IDFreemium
Turn a single photo into a talking avatar. Real-time conversational agents and historical-figure recreations.
AI avatars - HunyuanVideoFeaturedOpen-source
Tencent's open-weight 13B video model — open-source, runnable locally with enough GPU. The current OSS quality leader.
open-weightself-hostable - Mochi 1Open-source
Genmo's open-source 10B video diffusion model under Apache 2.0. Solid baseline for self-hosted research and fine-tuning.
open-weightresearch
Audio AI: speech, voice, music
Text-to-speech, speech-to-text, voice cloning, and music generation. Audio finally feels solved — these are the providers worth paying for.
- ElevenLabsFeaturedFreemium
Best-in-class TTS, voice cloning, dubbing, and conversational voice agents. The default for shipping production audio.
TTSvoice cloning - OpenAI WhisperFeaturedOpen-source
Open-source multilingual speech-to-text from OpenAI. Runs anywhere — laptop, server, edge — and powers half the audio stack.
STTopen-source - AssemblyAIFreemium
Speech-to-text plus audio intelligence — diarization, sentiment, summarization, PII redaction. Great DX.
STTaudio intelligence - DeepgramFeaturedFreemium
Real-time and batch STT with industry-leading latency. Nova-3 model, voice agent stack, and streaming WebSocket APIs.
STTreal-time - CartesiaFreemium
Sonic TTS at sub-100ms latency. The pick for real-time voice agents that need to feel alive, not robotic.
TTSreal-time - Hume AIFreemium
EVI — empathic voice interface that reads tone and adapts. Useful where pitch and pacing matter as much as words.
TTSemotion - PlayHTFreemium
Ultra-low-latency TTS optimized for voice agents. 600+ voices, 100+ languages, instant voice cloning.
TTSvoice cloning - Resemble AIPaid
Voice cloning, real-time dubbing, and an audio-watermark / deepfake-detection toolkit. Enterprise focus.
voice cloningdeepfake detection - SpeechmaticsPaid
STT with broad accent and dialect coverage. Strong on hard audio (meetings, accents, noisy mics).
STTenterprise - SunoFeaturedFreemium
Text-to-music with vocals — full songs in seconds. The most popular consumer-grade music generator on the market.
music genvocals - UdioFreemium
Text-to-music in the same league as Suno, with stem separation and longer compositions.
music gen - Stable AudioFreemium
Stability AI's text-to-audio model — instrumentals, sound effects, and stems. Open-weights checkpoints available.
music genopen-weight
Evals, observability, guardrails
Measure, monitor, and control your LLM apps in production.
- LangfuseFeaturedFreemium
Open-source LLM observability + evals + prompt management. Self-hostable and cloud.
observabilityopen-source - LangSmithFreemium
LangChain's commercial observability + eval platform. Tight LangChain / LangGraph integration.
observabilityevals - BraintrustPaid
Production-grade eval platform for shipping LLM features safely. Dataset + experiment tracking.
evals - PromptfooOpen-source
CLI + YAML for running prompt / red-team tests. Great in CI.
evalsCLI
LLM evaluation frameworks & leaderboards
Eval libraries you wire into CI, and the public leaderboards you check before picking a model. Don't ship an LLM feature without one of each.
- DeepEvalFeaturedOpen-source
Open-source pytest-style framework for LLM unit tests — hallucination, toxicity, bias, RAG faithfulness. From Confident AI.
frameworkPython - RagasFeaturedOpen-source
Reference-free metrics built specifically for RAG pipelines — context precision, faithfulness, answer relevancy.
RAGframework - TruLensOpen-source
Open-source library for tracing + scoring LLM apps. RAG triad metrics and feedback functions out of the box.
frameworkPython - OpenAI EvalsOpen-source
OpenAI's official eval framework + registry of public benchmarks. Reference implementation for model-graded evals.
frameworkbenchmarks - Inspect AIOpen-source
UK AI Safety Institute's eval framework — capability, agent, and safety evals with sandboxed tool use.
frameworksafety - Arize PhoenixOpen-source
Open-source observability + eval for LLMs and agents. Traces, datasets, and prompt experiments in one tool.
frameworkobservability - lm-evaluation-harnessOpen-source
EleutherAI's harness — the de facto standard for academic benchmarks (MMLU, GSM8K, HellaSwag, ARC, etc.).
benchmarksacademic - LMSYS Chatbot ArenaFeaturedFree
Crowd-sourced ranking via blind A/B votes from real users. The most-cited public leaderboard in the field.
leaderboardhuman eval Hugging Face's standardized benchmark leaderboard for open-weight models. Run on Eleuther's harness.
leaderboardopen-weight- LiveBenchFree
Contamination-resistant benchmark with monthly-refreshed questions. Maintained by Abacus AI + NYU + Meta.
leaderboardbenchmark - HELMFree
Stanford CRFM's Holistic Evaluation of Language Models — multi-metric scenarios across many models.
leaderboardacademic Coding-focused leaderboards: polyglot benchmark across six languages, plus refactor + edit-format scoring.
leaderboardcodingSide-by-side comparison of frontier models on quality, latency, cost, and context window. Updated frequently.
leaderboardcomparison
Vector databases & RAG stores
Where your embeddings live.
- PineconeFreemium
Managed vector DB with the most mature enterprise feature set. Serverless tier is cheap.
vector DB - QdrantOpen-source
Open-source Rust vector DB. Self-hostable or managed cloud. Great filtering story.
vector DBopen-source - WeaviateOpen-source
Open-source vector DB with hybrid search and module-based embedders. Good GraphQL API.
vector DBopen-source - pgvectorFeaturedOpen-source
Vector similarity search for Postgres. If you already have Postgres, start here.
Postgresextension - TurbopufferPaid
Serverless vector + full-text search built on object storage. Cheap at scale.
vector DBserverless
Other resource directories
Links on this page point to third-party sites. We pick them on merit — no affiliate tracking, no paid placements. Spot something outdated or missing? Open an issue.