Author Image

Hi, I am Hongjin

Hongjin Qian

Researcher @ BAAI

My research sits at the intersection of information retrieval and large language models — specifically, how AI systems can actively search, reason over, and synthesize knowledge to answer complex, real-world questions.

Recent work focuses on LLM-based deep research agents that autonomously decompose multi-step tasks, retrieve heterogeneous evidence, and produce grounded answers — including reward-driven search (InfoFlow), deep research for reasoning models (WebThinker), and agentic memory (MemoBrain). This builds on a foundation in retrieval-augmented generation: memory-augmented architectures (MemoRAG) and information-foraging-guided reasoning (Scent of Knowledge).

Research Interests
Information Retrieval Retrieval-Augmented Generation LLM Agents Deep Research Search & Reasoning Knowledge-Intensive NLP
Background

News

Research

Agent
2025-Present 13
AI Agents represent the next evolution of LLMs, moving from passive conversation to active task execution.
Publications
Retrieval-Augmented Generation (RAG) is a method that first retrieves relevant information from an external knowledge source and then combines it with the model’s input to generate more accurate and informative responses.
Publications
Conversational search is an interactive search paradigm where users and systems engage in a dialogue, allowing queries, clarifications, and refinements across multiple turns to iteratively reach more accurate and context-aware results.
Publications
Dialogue System, QA System, Ranking, Retrieval, Theory, etc.
Publications

Experiences

1
Assistant Research Fellow
Peking University

Aug 2025 - Present, Beijing, China

Responsibilities:
  • Memory-Enhanced Agent
  • Agentic Search

Postdoctoral Researcher
Peking University

Oct 2024 - Present, Beijing, China

Responsibilities:
  • Memory-Enhanced LLMs
  • Efficiency KV cache techniques
2

3
Research Intern
Wechat Group, Tencent.

Jun 2023 - Oct 2023, Beijing, China

Responsibilities:
  • LLM for IR
  • LLM for QA

PhD. Researcher
Renmin University of China.

Sept 2020 - Jun 2024, Beijing, China

Responsibilities:
  • Personalized Intelligence
  • Conversational Intelligence
  • Information Retrieval
4

5
NLP Engineer
Elensdata.

Jan 2019 - Sept 2020, Beijing, China

Elensdata is a start-up company which offers high-calibre data science/AI solutions that help real businesses, in media, finance, etc.

Responsibilities:
  • Core NLP Toolkit for Chinese, English and other languages
  • NLP Applications in Multiple Domains (Financial, Security, Media etc.)
  • Large-Scale Pretrained Language Model and Text Generation
Patents Statistics
20 Patents in Total
18 Granted Patents
6 First-Inventor Patents
Academic Service
Reviewer / PC Member:
Neurips, ICLR, ICML, ACL, EMNLP, MM
EACL, ACL ARR, SIGKDD, theWebConf, TOIS
Projects & Grants
Hierarchical Memory-Enhanced Knowledge Reasoning for Large Language Models Jan 2026 - Dec 2028

This project focuses on exploring techniques to expand the knowledge scale and memory scale at the input stage. The goal is to overcome the limitations of current LLMs in complex knowledge reasoning, knowledge memorization, and global knowledge understanding. This will be achieved by constructing a hierarchical memory mechanism that enables the scaling, memorization, and dynamic, coordinated retrieval of multi-source, heterogeneous knowledge.

MemoRAG Aug 2024 - Present

MemoRAG is a next-generation retrieval-augmented generation system with long-term memory, enabling superior context-aware information retrieval and enhanced performance on complex tasks where traditional RAG systems struggle.

Infomatica Aug 2025 - Present

Informatica is a comprehensive collection of systematic research projects focused on deep research systems. Our mission is to provide open-source, scalable frameworks, datasets, data synthesis methods, models, and demonstrations.

Patents 8