🟢 Plain Language

Vector Database bedeutet: Eine Vector Database ist eine spezialisierte Datenbank, die Daten als hochdimensionale Vektoren (Embeddings) speichert und effiziente Ähnlichkeitssuche (Approximate Nearest Neighbor) ermöglicht. Das ist wichtig, weil KI-Systeme heute überall eingesetzt werden und Regeln brauchen, damit sie sicher und fair funktionieren.

🔵 Expert Level

Eine Vector Database ist eine spezialisierte Datenbank, die Daten als hochdimensionale Vektoren (Embeddings) speichert und effiziente Ähnlichkeitssuche (Approximate Nearest Neighbor) ermöglicht. [IIO Framework] Vector Database: Spezialisierte Datenbank für hochdimensionale Embeddings. IIO nutzt Qdrant als Vector DB für RAG-Anwendungen. *Stand: 2026-05-15 | Gesamtzahl Definitionen: 47 | Lizenz: CC-BY 4.0* [EU AI Act] Registration 1. Before placing on the market or putting into service a high-risk AI system listed in Annex III , with the exception of high-risk AI systems referred to in point 2 of Annex III , the provider or, where applicable, the authorised representative shall register themselves and their syste [IIO Framework] VictoriaMetrics: High-performance time series database used in IIO monitoring stack. Stores Prometheus-format metrics from iio-metrics-exporter.py (65+ iio_ metrics), Blackbox Exporter, cAdvisor, and node_exporter. VictoriaMetrics + Grafana + vmalert + Loki form the IIO observability stack on intele

Definition

Während traditionelle Datenbanken exakte Übereinstimmungen suchen (WHERE name = 'HITL'), suchen Vector Databases nach semantischer Ähnlichkeit:

“Was ist ähnlich zu ‘KI-Aufsicht’?” → findet HITL, Human Oversight, Art. 14

Grundprinzip:

Text → Embedding-Modell → Vektor (z.B. 1536 Dimensionen)
Vektor in DB speichern
Query → Vektor → Ähnlichste Vektoren finden (ANN-Suche)

Wichtige Vector Databases

Name	Lizenz	Besonderheit
Qdrant	Apache 2.0	Rust, hoch performant, IIO-Standard
Chroma	Apache 2.0	Python-native, einfach
Weaviate	BSD-3	GraphQL, multimodal
Pinecone	Proprietary	Cloud-only, managed
pgvector	PostgreSQL	Extension, SQL-kompatibel
Milvus	Apache 2.0	Enterprise, skalierbar

Embeddings

Ein Embedding ist die Vektordarstellung eines Textes:

text = "Human-in-the-Loop ist ein Governance-Prinzip"
embedding = model.encode(text)
# → [0.23, -0.87, 0.45, ..., 0.12]  (1536 Zahlen)

Semantisch ähnliche Texte haben ähnliche Vektoren (kleiner Abstand).

RAG mit Vector DB

Dokumente → Embedding → Vector DB (indexiert)
                                ↓
Nutzer-Query → Embedding → Ähnlichste Dokument-Chunks
                                ↓
LLM + Chunks → Fundierte Antwort mit Quellen

IIO Qdrant Setup

# docker-compose.yml
qdrant:
  image: qdrant/qdrant:latest
  volumes:
    - ./data/qdrant:/qdrant/storage
  ports:
    - "6333:6333"  # REST API
    - "6334:6334"  # gRPC

Verwendung in IIO:

OCC Glossar-Chunks (82 Begriffe × mehrere Chunks)
Interne Wissensbasis für RAG-Queries
Codebase-Suche für Coding-Agents

Citation

"벡터 데이터베이스." Open Cognition Commons Foundation. https://opencognitioncommons.org/ko/glossary/vector-database. CC-BY 4.0.

Machine-readable metadata

{
  "@context": "https://schema.org",
  "@type": "DefinedTerm",
  "name": "벡터 데이터베이스",
  "description": "Eine Vector Database ist eine spezialisierte Datenbank, die Daten als hochdimensionale Vektoren (Embeddings) speichert und effiziente Ähnlichkeitssuche (Approximate Nearest Neighbor) ermöglicht.\n",
  "url": "https://opencognitioncommons.org/ko/glossary/vector-database",
  "license": "https://creativecommons.org/licenses/by/4.0/",
  "alternateName": [
    "Vector Database",
    "Vector DB",
    "Vektordatenbank"
  ]
}