Nan Zhang Profile
Nan Zhang

@NanZhangNLP

Followers
122
Following
285
Media
12
Statuses
99

PhD Student @ISTatPENNSTATE, NLP #NLProc, ML, AI. Ex-intern @SFResearch, @NECLabsAmerica

State College, PA
Joined December 2015
Don't wanna be here? Send us removal request.
@NanZhangNLP
Nan Zhang
4 months
📢 Happy to introduce SiReRAG: our #ICLR2025 paper on RAG indexing!. Facilitating comprehensive knowledge synthesis on multihop reasoning, SiReRAG models both similarity and relatedness signals of a corpus. Code: Paper: (1/N)🧵.
Tweet card summary image
arxiv.org
Indexing is an important step towards strong performance in retrieval-augmented generation (RAG) systems. However, existing methods organize data based on either semantic similarity (similarity)...
1
6
12
@NanZhangNLP
Nan Zhang
2 months
RT @omarsar0: Enhancing RAG with Application-Aware Reasoning. Neat trick to improve RAG systems: give it the relevant knowledge and show it….
0
121
0
@NanZhangNLP
Nan Zhang
2 months
RT @rohanpaul_ai: Brilliant Paper. We need to evaluate reasoning steps separately for knowledge correctness and reasoning quality. LLMs….
0
76
0
@NanZhangNLP
Nan Zhang
3 months
RT @RyoKamoi: 📢 New paper!.FoVer enhances PRMs for step-level verification of LLM reasoning w/o human annotation 🚀.We synthesize training d….
0
26
0
@NanZhangNLP
Nan Zhang
3 months
RT @_philschmid: 100 Days After DeepSeek-R1. What have we learned? Where did we see success? What was most challenging? A Survey on Replica….
0
46
0
@NanZhangNLP
Nan Zhang
3 months
RT @ruizhang_nlp: As Vision Language Models treat images as tokens, high-resolution images create long sequences, similar to long-context c….
0
5
0
@NanZhangNLP
Nan Zhang
3 months
RT @ruizhang_nlp: This work is led by my first PhD student Yusen @YusenZhangNLP, who is graduating soon and actively seeking a postdoc posi….
0
5
0
@NanZhangNLP
Nan Zhang
3 months
RT @YusenZhangNLP: 🚀 How Far Are VLMs from Effective High-Resolution Image Understanding?.👉 We found: Still far. 🆕 Introducing HRScene Ben….
0
5
0
@NanZhangNLP
Nan Zhang
3 months
RT @Alibaba_Qwen: Introducing Qwen3! . We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 den….
0
2K
0
@NanZhangNLP
Nan Zhang
3 months
RT @Shiwei_Liu66: Our ICLR 2025 Workshop on Sparsity in LLMs (@sparseLLMs) kicks off with a talk by @DAlistarh on lossless (~1% perf drop)….
0
15
0
@NanZhangNLP
Nan Zhang
4 months
RT @SFResearch: 🔥 Phenomenal Day 1 of poster sessions at #ICLR25! ✨ . Attending tomorrow? Visit us at Booth #G03 to dive into how our groun….
0
3
0
@NanZhangNLP
Nan Zhang
4 months
RT @jasonwu0731: Heading to Singapore for #ICLR25 to present three papers: ReGenesis, BingoGuard, SiReRAG. We are investing heavily on Ag….
0
3
0
@NanZhangNLP
Nan Zhang
4 months
I will join #ICLR2025 virtually and my collaborators are presenting SiReRAG in person during Poster Session 1. We will be #61 at Hall 3 + Hall 2B. I am happy to discuss research on RAG, LLMs compression, and large reasoning models.🤝.
@NanZhangNLP
Nan Zhang
4 months
📢 Happy to introduce SiReRAG: our #ICLR2025 paper on RAG indexing!. Facilitating comprehensive knowledge synthesis on multihop reasoning, SiReRAG models both similarity and relatedness signals of a corpus. Code: Paper: (1/N)🧵.
0
1
10
@NanZhangNLP
Nan Zhang
4 months
RT @sarkarssdas: Try GReaTerPrompt today to supercharge your prompts — whether you're using open-source models or API-based models! 👇.
0
3
0
@NanZhangNLP
Nan Zhang
4 months
RT @ruizhang_nlp: 📢 GreaterPrompt is Now Live!. We're excited to introduce GreaterPrompt, a unified, customizable, and high-performance ope….
0
6
0
@NanZhangNLP
Nan Zhang
4 months
RT @SFResearch: SiReRAG helps identify core bottlenecks in current #AISystems and develop elegant solutions that address them. By integrati….
0
3
0
@NanZhangNLP
Nan Zhang
4 months
RT @ruizhang_nlp: Excited to share SiReRAG! Our #ICLR2025 paper on improving RAG indexing for multihop reasoning. 🔍 SiReRAG combines simil….
0
6
0
@NanZhangNLP
Nan Zhang
4 months
This paper marks a wonderful internship experience at @SFResearch ! A huge shoutout to my amazing collaborators, Prafulla Kumar Choubey, @alexfabbri4 , Gabriel Bernadett-Shapiro, @ruizhang_nlp , Prasenjit Mitra, @CaimingXiong , and @jasonwu0731 !. (N/N).
0
0
2
@NanZhangNLP
Nan Zhang
4 months
Finally, SiReRAG showcases wide applicability on multihop QA across various retrieval methods. We show that SiReRAG significantly improves other non-indexing methods (e.g., reranking and iterative retrieval). (6/N) đź§µ
Tweet media one
1
0
0
@NanZhangNLP
Nan Zhang
4 months
SiReRAG delivers consistent improvement over state-of-the-art baselines on multihop datasets (an average 1.9% F1 gain), which shows the advantage of extensive knowledge synthesis. Baselines focus on either similarity (RAPTOR) or relatedness (HippoRAG and GraphRAG). (5/N) đź§µ
Tweet media one
1
0
0
@NanZhangNLP
Nan Zhang
4 months
SiReRAG stands for RAG indexing of similarity and relatedness: similarity via a recursive chunk-based tree, and relatedness through entity-based synthesis. Both fine-tuned LLMs and GPT-4o effectively extract entities and propositions. (4/N) đź§µ
Tweet media one
1
0
0