Full self-improving long-horizon LLM agent with strategy memory, failure analysis, Grok teacher labels, and QLoRA student distillation.
llama agents knowledge-distillation llm chromadb qlora terminal-bench long-horizon-tasks self-improving-agents agentbench
-
Updated
May 26, 2026 - Python