[NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers
-
Updated
Oct 20, 2022 - Python
[NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers
A bilingual awesome list for VLM/MLLM knowledge injection research: benchmarks, papers, tools, resources, and ecosystem updates.
📚 Code for my master's thesis "Investigating Knowledge Injection Approaches for Research Field Classification of Scholarly Articles".
MCP server that injects structured professional domain expertise into AI agents — accounting, legal, medical, Islamic finance, real estate, and more
Command-line interface for interacting with a modular RAG system: query, ingest, and manage knowledge pipelines.
We optimize a compact latent state (frozen weights) to force failed multi-hop chains to output the missing answer D. 5 pre-registered controls show it simply injects D: carries it without the code-fact, leaves intermediates invisible, inert to hop corruption, and doesn’t transfer. No latent composition at 3B (Llama-3.2-3B, Qwen2.5-3B).
Add a description, image, and links to the knowledge-injection topic page so that developers can more easily learn about it.
To associate your repository with the knowledge-injection topic, visit your repo's landing page and select "manage topics."