Open-Source LLMs (Llama, Mistral, Qwen)
Freely available language models for self-hosting. Llama 4, Mistral Large 2 and Qwen 3 lead the field.
Category: Models & PlatformsSMB Relevance: 3/5Cost: HighTech. Barrier: high
Open-source models have reached proprietary model levels in 2026 [1]. Llama 4 offers 10M token context, Mistral excels for European GDPR scenarios [2]. However: self-hosting requires GPU servers (from ~EUR 5,000 for smaller models, EUR 50,000+ for the large ones; or EUR 500-5,000/month cloud GPU rental). For most SMBs, API access to these same models via providers like Together AI or Mistral is far more pragmatic and cheaper than running own infrastructure.
KI-Videoproduktion
AI-generated videos for marketing, training, and e-commerce. Avatar tools and generative video models.
Category: Models & PlatformsSMB Relevance: 4/5Cost: MediumTech. Barrier: low
The AI video market grows at 34% CAGR [1]. Avatar tools (HeyGen, Synthesia) reduce production from 30-60 to under 5 minutes [2]. Generative models (Runway, Veo) approach broadcast quality after Sora shut down due to costs [1].
KI-Workflow-Automatisierung
No-code/low-code platforms with native AI integration. n8n, Make, and Zapier as leading tools.
Category: Tools & AgentsSMB Relevance: 5/5Cost: FreeTech. Barrier: medium
n8n has established itself with native LangChain integration and self-hosting as a GDPR-compliant alternative to Zapier and Make [1][2]. Typical SMB use cases: lead qualification, invoice processing, RAG agents.
Agentic Coding IDEs
AI-powered development environments with autonomous agents. Cursor, Windsurf, Copilot, and Claude Code.
Category: Tools & AgentsSMB Relevance: 3/5Cost: Low (<€50/month)Tech. Barrier: high
2026 marks the agentic IDE era: Cursor (Cloud Agents, from $20/mo), Copilot (Coding Agent for GitHub Issues, from $10/mo), Windsurf (Cascade, from $15/mo), and Claude Code (terminal agent with 1M context) [1][2].
KI-gestützte Buchhaltung
Cloud accounting with AI receipt recognition and automatic booking. sevdesk, Lexware Office, BuchhaltungsButler.
Category: Tools & AgentsSMB Relevance: 5/5Cost: Low (<€50/month)Tech. Barrier: low
German cloud accounting tools increasingly use AI in 2026: automatic receipt recognition, booking suggestions, DATEV export. sevdesk is the test winner in a 12-tool comparison [1]. Lexware Office from EUR 7.90/month [1].
KI-Agenten
Autonomous AI systems that independently plan and execute multi-step tasks.
Category: Tools & AgentsSMB Relevance: 4/5Cost: MediumTech. Barrier: high
LLMs have essentially become agent operating systems in 2026 [1]. They work reliably for well-defined tasks. Framework choice depends on the use case [2]. Most pragmatic for SMBs via n8n or similar no-code tools.
Lokale Inferenz-Server
LLM inference on own hardware without cloud. Ollama, LM Studio, and vLLM as leading tools.
Category: Data & InfrastructureSMB Relevance: 4/5Cost: FreeTech. Barrier: medium
Ollama (250k+ GitHub stars) is the de facto standard for local LLM inference [1]. The MLX backend speeds up Apple Silicon by up to 93% [2]. LM Studio adds a GUI for less technical users [1].
Vektorsuche in PostgreSQL
RAG directly in PostgreSQL via pgvector. No separate vector database needed for most SMB cases.
Category: Data & InfrastructureSMB Relevance: 4/5Cost: FreeTech. Barrier: medium
pgvector + pgvectorscale achieves 471 QPS at 99% recall on 50M vectors [1]. If you use PostgreSQL (Supabase, Neon), no separate vector DB needed. Supabase offers Row Level Security for RAG documents [2].
Context Engineering
The evolution of prompt engineering: designing the entire context window as information architecture.
Category: Methods & PracticeSMB Relevance: 4/5Cost: FreeTech. Barrier: medium
Popularized by Anthropic: designing the entire context window as information architecture, not just individual prompts [1]. System instructions, tool schemas, documents, examples as an interface contract. All major providers converge on this methodology [2].
RAG vs. Fine-Tuning (Architekturentscheidung)
When RAG, when fine-tuning, when both? The most important architecture decision for AI projects.
Category: Methods & PracticeSMB Relevance: 3/5Cost: FreeTech. Barrier: high
RAG for dynamic knowledge (4-8 weeks setup), fine-tuning for consistent style and domain expertise (8-16 weeks) [1]. Best systems combine both [2]. For SMBs, RAG is almost always the right starting point.