dev101.io

AI & LLM Tools

Curated directory of LLM providers, agent frameworks, and self-hostable AI tools — with links straight to the source.

79 curated links across 12 sections

Everything we reach for when building AI-powered features. If you're wiring up an LLM, running inference locally, or stitching agents together into workflows, start here. Each link goes straight to the provider — no affiliate tracking, no wrapper sites.

Frontier LLM providers

Hosted APIs with state-of-the-art models.

  • OpenAI
    FeaturedPaid

    GPT-5, GPT-4.1, o-series reasoning models. Responses API is the stateful default; chat/completions still works.

    LLM providerAPI
  • Anthropic
    FeaturedPaid

    Claude Sonnet, Opus, Haiku. Strong at long context, tool use, and following complex instructions.

    LLM providerAPI
  • Gemini Pro / Flash / Nano with huge context windows and native multimodal input. Free tier is generous.

    LLM providermultimodal
  • xAI
    Paid

    Grok models via Twitter-adjacent API. Competitive reasoning and real-time web grounding.

    LLM provider
  • Mistral
    Freemium

    Open-weight and hosted models out of Paris. Strong European option with EU data residency.

    LLM provideropen-weight
  • Command R / R+ models optimized for RAG and enterprise workloads. Strong rerank + embed endpoints.

    LLM providerRAG

Fast inference & open models

Cheap, fast hosted inference for Llama, Mistral, DeepSeek, and friends.

  • Groq
    FeaturedFreemium

    LPU inference — hundreds of tokens/sec for Llama, Mixtral, Qwen. Ridiculous latency ceilings.

    inferencelow-latency
  • Hosts 100+ open models with OpenAI-compatible APIs. Great for switching providers without rewriting code.

    inferenceopen models
  • Fast serverless inference for open-weight models with fine-tuning support.

    inferencefine-tuning
  • Run any open-source model behind an HTTP API. Great for vision / audio / niche models.

    inferencemulti-modal
  • Hugging Face
    FeaturedFreemium

    The open-source model hub. Datasets, weights, Spaces, Inference Endpoints — the whole ecosystem.

    model hubdatasets
  • Pay-per-token OpenAI-compatible API for open models. Aggressive pricing.

    inference

Router & gateway

One API key to try many models, or smart routing across providers.

  • OpenRouter
    FeaturedFreemium

    Unified API for 300+ models across every major provider. Budget caps, fallback routing, BYO keys.

    gatewaymulti-provider
  • Portkey
    Freemium

    Observability + guardrails + routing layer for production LLM apps. 50+ providers.

    gatewayobservability
  • Helicone
    Freemium

    Open-source LLM observability. Proxies requests and captures logs, cost, latency.

    observabilityopen-source

Run LLMs locally

Inference on your own laptop or server. No cloud, no data leaving the box.

  • Ollama
    FeaturedOpen-source

    One-command local LLM runtime. `ollama run llama3.2` and you're off. REST API baked in.

    localCLI
  • Desktop app for chatting with local models. Great for non-technical users.

    localdesktop
  • llama.cpp
    Open-source

    The CPU/GPU inference engine that started the local-LLM revolution. Powers most of the stack.

    localinference engine
  • vLLM
    Open-source

    High-throughput batched inference for serving open-weight models in production.

    inference engineserving

Agent & orchestration frameworks

Libraries for building multi-step AI applications — chains, tools, memory, routers.

  • LangChain
    FeaturedOpen-source

    The most widely-used framework for chaining LLM calls, tools, and retrievers. TS + Python.

    frameworkagents
  • LangGraph
    Open-source

    State-machine-style agent framework from the LangChain team. Durable workflows + checkpointing.

    agentsworkflow
  • LlamaIndex
    Open-source

    Data framework optimized for RAG — indexing, retrieval, query engines. TS + Python.

    RAGframework
  • Vercel AI SDK
    FeaturedOpen-source

    TypeScript SDK for streaming chat UIs on Next.js / React / Svelte. Provider-agnostic.

    TypeScriptUI
  • Mastra
    Open-source

    TypeScript-first agent framework with memory, tools, workflows, and eval built in.

    TypeScriptagents
  • CrewAI
    Freemium

    Multi-agent framework organized around role-based crews. Popular in the no-code adjacent space.

    agentsmulti-agent

Workflow & automation builders

Visual canvases for stitching AI steps, APIs, and triggers into workflows.

  • n8n
    FeaturedFreemium

    Self-hostable workflow automation with 400+ integrations and native LLM nodes. The go-to Zapier alternative.

    workflowself-hosted
  • Flowise
    Open-source

    Drag-and-drop UI for LangChain flows. Self-hostable, open-source, great for prototyping agents.

    visualself-hosted
  • Langflow
    Open-source

    Visual authoring for LangChain pipelines. Big community, plays well with LangGraph.

    visualopen-source
  • Zapier
    Freemium

    The canonical no-code automation platform. AI actions across 7000+ apps.

    workflowno-code

Self-hostable chat UIs

Run your own ChatGPT. Point at any OpenAI-compatible endpoint and you're ready.

  • LibreChat
    FeaturedOpen-source

    Open-source chat UI supporting OpenAI, Anthropic, Gemini, Bedrock, and local providers. Active community.

    chat UIself-hosted
  • OpenWebUI
    Open-source

    Polished WebUI built for Ollama + any OpenAI-compatible API. One of the cleanest self-hosted options.

    chat UIself-hosted
  • AnythingLLM
    Open-source

    Desktop + Docker app that turns documents into a private RAG-powered workspace.

    RAGdesktop
  • Polished chat UI that lives in your browser, keys stay local. Supports every major provider.

    chat UIBYO key

Video AI & generation

Text-to-video, image-to-video, and AI avatar tools. The space is moving fast — these are the platforms shipping models people actually use in production.

  • OpenAI Sora
    FeaturedPaid

    OpenAI's flagship text-to-video model. Long-clip generation with strong physical consistency. Bundled with ChatGPT Plus / Pro tiers.

    text-to-videoOpenAI
  • Google Veo
    FeaturedPaid

    Google DeepMind's text-to-video. Veo 3 ships native audio + lip sync. Available via Gemini, Vertex AI, and the Flow filmmaking app.

    text-to-videoGoogle
  • Runway
    FeaturedFreemium

    Gen-3 / Gen-4 video models. The platform of choice for filmmakers — camera controls, motion brush, lip sync, and full editing suite.

    text-to-videofilmmaking
  • Pika
    Freemium

    Text-to-video with PikaSwaps, PikaScenes, and lip-sync. Strong on stylized, social-friendly clips.

    text-to-video
  • Ray2 model with fast generation, image-to-video, and keyframe control. API available for builders.

    text-to-videoAPI
  • Kling AI
    Freemium

    Kuaishou's video model — strong physics, long durations, and competitive prompt adherence. Popular for cinematic shots.

    text-to-video
  • MiniMax's video gen. Standout for realistic motion and human subjects. API via the MiniMax platform.

    text-to-videoAPI
  • HeyGen
    Freemium

    AI avatars for talking-head videos — realistic lip sync from a script in any language. Used heavily for product demos and marketing.

    AI avatarslip sync
  • Enterprise-grade AI video presenters. 230+ avatars, 140+ languages, custom-avatar pipeline. Locked-down compliance story.

    AI avatarsenterprise
  • D-ID
    Freemium

    Turn a single photo into a talking avatar. Real-time conversational agents and historical-figure recreations.

    AI avatars
  • HunyuanVideo
    FeaturedOpen-source

    Tencent's open-weight 13B video model — open-source, runnable locally with enough GPU. The current OSS quality leader.

    open-weightself-hostable
  • Mochi 1
    Open-source

    Genmo's open-source 10B video diffusion model under Apache 2.0. Solid baseline for self-hosted research and fine-tuning.

    open-weightresearch

Audio AI: speech, voice, music

Text-to-speech, speech-to-text, voice cloning, and music generation. Audio finally feels solved — these are the providers worth paying for.

  • ElevenLabs
    FeaturedFreemium

    Best-in-class TTS, voice cloning, dubbing, and conversational voice agents. The default for shipping production audio.

    TTSvoice cloning
  • OpenAI Whisper
    FeaturedOpen-source

    Open-source multilingual speech-to-text from OpenAI. Runs anywhere — laptop, server, edge — and powers half the audio stack.

    STTopen-source
  • Speech-to-text plus audio intelligence — diarization, sentiment, summarization, PII redaction. Great DX.

    STTaudio intelligence
  • Deepgram
    FeaturedFreemium

    Real-time and batch STT with industry-leading latency. Nova-3 model, voice agent stack, and streaming WebSocket APIs.

    STTreal-time
  • Cartesia
    Freemium

    Sonic TTS at sub-100ms latency. The pick for real-time voice agents that need to feel alive, not robotic.

    TTSreal-time
  • Hume AI
    Freemium

    EVI — empathic voice interface that reads tone and adapts. Useful where pitch and pacing matter as much as words.

    TTSemotion
  • PlayHT
    Freemium

    Ultra-low-latency TTS optimized for voice agents. 600+ voices, 100+ languages, instant voice cloning.

    TTSvoice cloning
  • Voice cloning, real-time dubbing, and an audio-watermark / deepfake-detection toolkit. Enterprise focus.

    voice cloningdeepfake detection
  • STT with broad accent and dialect coverage. Strong on hard audio (meetings, accents, noisy mics).

    STTenterprise
  • Suno
    FeaturedFreemium

    Text-to-music with vocals — full songs in seconds. The most popular consumer-grade music generator on the market.

    music genvocals
  • Udio
    Freemium

    Text-to-music in the same league as Suno, with stem separation and longer compositions.

    music gen
  • Stability AI's text-to-audio model — instrumentals, sound effects, and stems. Open-weights checkpoints available.

    music genopen-weight

Evals, observability, guardrails

Measure, monitor, and control your LLM apps in production.

  • Langfuse
    FeaturedFreemium

    Open-source LLM observability + evals + prompt management. Self-hostable and cloud.

    observabilityopen-source
  • LangSmith
    Freemium

    LangChain's commercial observability + eval platform. Tight LangChain / LangGraph integration.

    observabilityevals
  • Production-grade eval platform for shipping LLM features safely. Dataset + experiment tracking.

    evals
  • Promptfoo
    Open-source

    CLI + YAML for running prompt / red-team tests. Great in CI.

    evalsCLI

LLM evaluation frameworks & leaderboards

Eval libraries you wire into CI, and the public leaderboards you check before picking a model. Don't ship an LLM feature without one of each.

  • DeepEval
    FeaturedOpen-source

    Open-source pytest-style framework for LLM unit tests — hallucination, toxicity, bias, RAG faithfulness. From Confident AI.

    frameworkPython
  • Ragas
    FeaturedOpen-source

    Reference-free metrics built specifically for RAG pipelines — context precision, faithfulness, answer relevancy.

    RAGframework
  • TruLens
    Open-source

    Open-source library for tracing + scoring LLM apps. RAG triad metrics and feedback functions out of the box.

    frameworkPython
  • OpenAI Evals
    Open-source

    OpenAI's official eval framework + registry of public benchmarks. Reference implementation for model-graded evals.

    frameworkbenchmarks
  • Inspect AI
    Open-source

    UK AI Safety Institute's eval framework — capability, agent, and safety evals with sandboxed tool use.

    frameworksafety
  • Arize Phoenix
    Open-source

    Open-source observability + eval for LLMs and agents. Traces, datasets, and prompt experiments in one tool.

    frameworkobservability
  • EleutherAI's harness — the de facto standard for academic benchmarks (MMLU, GSM8K, HellaSwag, ARC, etc.).

    benchmarksacademic
  • Crowd-sourced ranking via blind A/B votes from real users. The most-cited public leaderboard in the field.

    leaderboardhuman eval
  • Hugging Face's standardized benchmark leaderboard for open-weight models. Run on Eleuther's harness.

    leaderboardopen-weight
  • Contamination-resistant benchmark with monthly-refreshed questions. Maintained by Abacus AI + NYU + Meta.

    leaderboardbenchmark
  • Stanford CRFM's Holistic Evaluation of Language Models — multi-metric scenarios across many models.

    leaderboardacademic
  • Coding-focused leaderboards: polyglot benchmark across six languages, plus refactor + edit-format scoring.

    leaderboardcoding
  • Side-by-side comparison of frontier models on quality, latency, cost, and context window. Updated frequently.

    leaderboardcomparison

Vector databases & RAG stores

Where your embeddings live.

  • Pinecone
    Freemium

    Managed vector DB with the most mature enterprise feature set. Serverless tier is cheap.

    vector DB
  • Qdrant
    Open-source

    Open-source Rust vector DB. Self-hostable or managed cloud. Great filtering story.

    vector DBopen-source
  • Weaviate
    Open-source

    Open-source vector DB with hybrid search and module-based embedders. Good GraphQL API.

    vector DBopen-source
  • pgvector
    FeaturedOpen-source

    Vector similarity search for Postgres. If you already have Postgres, start here.

    Postgresextension
  • Serverless vector + full-text search built on object storage. Cheap at scale.

    vector DBserverless

Other resource directories

Links on this page point to third-party sites. We pick them on merit — no affiliate tracking, no paid placements. Spot something outdated or missing? Open an issue.