Glossary

TermDefinition
OKF (Open Knowledge Format)An open specification for storing knowledge as a directory of Markdown files + YAML frontmatter (v0.1, Google Cloud, 2026)
BundleA directory containing all knowledge files — the unit of distribution (in this project: wiki/)
ConceptOne unit of knowledge = one .md file
Concept IDThe file's path within the bundle with .md stripped, e.g. tables/orders.mdtables/orders
FrontmatterA YAML block at the top of a file, delimited by ---, storing metadata
BodyThe Markdown content below the frontmatter
LinkA Markdown link between concepts = a relationship (untyped)
CitationA link from a concept to an external source that supports a claim in the content
Reserved fileReserved files: index.md (table of contents), log.md (change log)
Progressive disclosureShowing the table of contents before opening the actual file — reduces context window overflow
ConformanceA bundle following the v0.1 rules (parseable frontmatter + type present + reserved files correctly structured)
typeThe only required frontmatter field — specifies the kind of concept
ReferenceA type used for synthesized knowledge (joins, metric definitions), typically under references/
RAGRetrieval-Augmented Generation — fetching raw document chunks at query time and stuffing them into context
LLM-wiki patternA concept (Karpathy) where AI synthesizes knowledge into a continuously maintained Markdown wiki, instead of re-fetching raw sources every time
IngestThe process of taking raw sources and synthesizing them into wiki concepts (should be human-supervised)
Contradiction flagA > **CONTRADICTION FLAG**: ... marker added when new information conflicts with existing content
BM25A keyword-based ranking algorithm that scores document relevance
EmbeddingA vector representing the semantic meaning of text (generated by a model, e.g. via Ollama)
Semantic searchSearch by semantic similarity (cosine distance between embeddings)
Hybrid searchCombining BM25 + semantic search
RRF (Reciprocal Rank Fusion)A method for merging ranked results from multiple signals: Σ 1/(k + rank)
MCP (Model Context Protocol)An open standard for AI agents to connect to external tools and data
MCP serverA service that exposes tools (search/get/propose) wrapping a bundle for agent connections
PR-gatedA write model using branch + Pull Request + CI + review
Lease/lockA write model that reserves exclusive rights to a concept with a TTL, preventing write conflicts
LeaseA TTL-based reservation that expires automatically, verified by a token
Curator(Model 3) A single agent that aggregates proposals and merges them into the wiki
CODEOWNERSA file defining owners per subtree (used to split responsibility across teams in a monorepo)
Federated bundlesMultiple repos/bundles per domain; MCP servers mount multiple bundles, namespaced by bundle name
air-gapAn environment with no internet connection (closed network)
Gitea / GitLab CESelf-hostable git servers used as the source of truth
AGENTS.mdA schema file that tells agents about the structure/rules/workflow (may be named CLAUDE.md/GEMINI.md)