AI Memory
Reference · 4 min

References

The full bibliography behind this guide: the papers, open-source projects, and articles that ground every number and design claim on these pages.

Every benchmark number, threshold, and design decision in this guide traces back to one of the sources below. The papers establish the ideas, the open-source projects show how those ideas survive contact with production, and the articles and docs fill the gaps that academic papers leave out (cost, latency, failure modes). Each entry notes what it established and links to the wiki page where it does the most work, so you can read forward from a citation or backward from a claim.

When a figure on this wiki comes from one production system rather than a paper, it is framed as such in context (for example, an embedding dimension or a model-routing choice). Community and forum numbers are reported as claims, not measured facts. The entries here are the primary, verifiable sources.

#Papers

#Foundations: retrieval and context

  • Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela. "Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks." NeurIPS 2020. The origin of parametric vs non-parametric memory and the embed, store, retrieve, generate loop; grounds RAG. https://arxiv.org/abs/2005.11401
  • Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, Hannaneh Hajishirzi. "Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection." ICLR 2024. Retrieve conditionally, then verify grounding; grounds RAG. https://arxiv.org/abs/2310.11511
  • Nelson F. Liu, Kevin Lin, John Hewitt, Ashwin Paranjape, Michele Bevilacqua, Fabio Petroni, Percy Liang. "Lost in the Middle: How Language Models Use Long Contexts." TACL 2024. The U-shaped curve showing that inclusion is not usage, so the best context belongs at the edges of the prompt; grounds context vs memory and hybrid retrieval. https://arxiv.org/abs/2307.03172 (code: https://github.com/nelson-liu/lost-in-the-middle)

#Agent memory architectures

#Graph and hierarchical retrieval

  • Darren Edge, Ha Trinh, Newman Cheng, Joshua Bradley, Alex Chao, Apurva Mody, Steven Truitt, Jonathan Larson. "From Local to Global: A Graph RAG Approach to Query-Focused Summarization." 2024. Entity graph plus Leiden community summaries for global sensemaking, at a heavy indexing cost; grounds knowledge graphs. https://arxiv.org/abs/2404.16130
  • Bernal Jiménez Gutiérrez, Yiheng Shu, Yu Gu, Michihiro Yasunaga, Yu Su. "HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models." NeurIPS 2024. Personalized PageRank over a triple graph does multi-hop in one step, 10 to 30 times cheaper than iterative retrieval; grounds knowledge graphs. https://arxiv.org/abs/2405.14831
  • Bernal Jiménez Gutiérrez, et al. "From RAG to Memory: Non-Parametric Continual Learning for LLMs" (HippoRAG 2). 2025. Adds passage nodes and a recognition-memory filter, and supplies the measured cost and quality comparison across GraphRAG, RAPTOR, and LightRAG. https://arxiv.org/abs/2502.14802
  • Parth Sarthi, et al. "RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval." ICLR 2024. Recursive embed, cluster, and summarise into a tree you can query at any level; grounds hierarchical memory. https://arxiv.org/abs/2401.18059
  • LightRAG. 2024. Dual-level keyword plus graph retrieval with incremental updates, the answer to "must re-index the world"; grounds landscape. https://arxiv.org/abs/2410.05779
  • Zep. "A Temporal Knowledge Graph Architecture for Agent Memory." 2025. The bi-temporal model (event time and system time) where contradictions invalidate rather than delete; grounds temporal memory. https://arxiv.org/abs/2501.13956

#Surveys and benchmarks

#Open-source projects

#Articles, blogs, and documentation

Cite this page

MemoryPlugin. “References.” The Field Guide to AI Memory. https://www.memoryplugin.com/wiki/references.html (2026).