occ / forge
OCC Forge
Forge is the tool used to turn trusted sources into structured knowledge packs. It ingests documents and URLs, extracts concepts, builds LLM Wiki pages, and produces a signed manifest ready for review.
llm wiki pattern
Classic RAG
- Raw sources → chunked at query time
- No knowledge accumulation between queries
- LLM re-discovers context from scratch each time
LLM Wiki (OCC)
- Sources → LLM builds structured wiki → queries hit the wiki
- Knowledge accumulates and cross-references
- Conflicts flagged at ingest time, not at query time
Pattern inspired by Andrej Karpathy's LLM Wiki proposal (April 2026).
pipeline
Sources
URLs or local files — documentation, papers, specifications
Concept extraction
GPT-5 reads each source and extracts a list of concepts with slug, title, and summary
LLM Wiki pages
For each concept, Forge writes or updates a structured markdown wiki page
index.md / log.md
The index is regenerated on every run; the log is append-only with timestamps and source hashes
manifest.yaml
Hub-ready manifest with name, version, domains, source list with hashes, and signature field
Review candidate
The pack enters the OCC review queue for community approval before registry inclusion
core operations
LLM reads a new document, extracts concepts, writes or updates wiki pages, and appends to the log. Existing pages are enriched without losing content. Conflicts are flagged inline.
LLM reads index.md to identify relevant pages, retrieves them, and synthesizes an answer with citations. Retrieval is keyword-based with stop-word filtering.
Health check: detects contradictions, orphaned pages, outdated claims, and missing cross-references. Returns a structured report.