The architecture combines proven RAG infrastructure with a maintained knowledge graph and a curated wiki layer. RAG answers questions from current sources. The graph shows relationships. The wiki keeps distilled, verified knowledge as a durable asset. Together they form a system that does not only retrieve, but learns.
1. Source and connector layer
The system starts where knowledge already appears: document stores, CRM, ERP, ticketing systems, email archives, project folders, transcripts, databases, and agent work logs. Every source receives metadata: owner, department, confidentiality, freshness date, validity scope, and access rights. Later, the question is not only "what is semantically relevant?" but also "is this person allowed to see it?" and "is this information still fresh?".
2. Ingestion and normalization
Documents are extracted, versioned, cleaned, and split into domain-specific sections. Structured data keeps its table and entity logic. Unstructured sources receive semantic chunks, summaries, and source anchors. Every processing step stores provenance: original source, hash, date, parser version, and responsible pipeline.
3. Hybrid retrieval
The search layer combines full-text search, vector search, and re-ranking. Exact terms such as customer names, contract clauses, and product codes must be found reliably. Semantic search finds related concepts. Re-ranking prioritizes the genuinely useful matches. For smaller systems, PostgreSQL with pgvector is often enough. For larger or more specialized retrieval scenarios, Qdrant, Weaviate, Pinecone, or comparable vector databases are good fits.
4. Knowledge graph and LLM wiki
This is the difference from a normal RAG system. Important entities, concepts, projects, customers, processes, and decisions are connected in a graph. In parallel, an LLM maintains wiki pages that summarize sources, flag contradictions, and preserve analyses. Strong answers flow back into the system. The Company Brain gains substance over time.
5. Agent and application layer
The Company Brain becomes useful through concrete work surfaces: search assistant, proposal assistant, onboarding assistant, decision copilot, project memory, sales research, support answers, or Company OS dashboards. Agents do not get unrestricted access. They work through permissioned tools, verified retrieval APIs, and clear write rules.
6. Governance, evaluation, and maintenance
A Company Brain needs upkeep. The system checks dead links, stale sources, contradictions, duplicate entities, missing owners, and weak answer quality. Regular evaluation sets test whether answers are correct, complete, and sufficiently cited. This keeps the brain alive without forcing people to manually garden a wiki.