Tirthankar Ghosal
@TirthankarSlg
Followers
575
Following
5K
Media
22
Statuses
4K
Scientist @ORNL #NLProc #LLMs #peerreview #SDProc Editor @SIGIRForum Org. #AutoMin2023 @SDProc @wiesp_nlp AC @IJCAIconf @emnlpmeeting Prevly @ufal_cuni @IITPAT
Knoxville, TN
Joined January 2017
@WiNLPWorkshop is partnering with the @aaclmeeting D&I Committee to launch a mentorship program for students, early-career researchers, and first-time attendees! ๐ค https://t.co/u2pBN5wqjl No one should navigate NLP alone โ letโs build a community where everyone belongs ๐๐
0
2
1
Hugging Face just dropped a 200 page playbook on training LLMs. It covers everything from pre-training to post-training and infrastructure with real examples of what worked and what didn't. 100% free and Opensource.
10
28
168
Seriously... why did no one tell me about this?! BrowserOS is a 100% open-source agentic browser alternative to ChatGPT Atlas and Perplexity Comet ๐คฏ Gave it a spin, itโs surprisingly smooth and stable. Repo in ๐งต โ
13
15
61
๐๐ด๐ฒ๐ป๐๐ถ๐ฐ ๐ฅ๐๐ and what you need to know about it as an AI Engineer? Simple naive RAG systems are rarely used in real world applications. We are usually adding some agency to the RAG system - ideally a minimal amount. There is ๐ป๐ผ ๐๐ถ๐ป๐ด๐น๐ฒ ๐ฏ๐น๐๐ฒ๐ฝ๐ฟ๐ถ๐ป๐ on how
11
108
589
I finally understand the fundamentals of building real AI agents. This new paper โFundamentals of Building Autonomous LLM Agentsโ breaks it down so clearly it feels like a blueprint for digital minds. Turns out, true autonomy isnโt about bigger models. Itโs about giving an LLM
487
369
2K
What is RAG? What is Agentic RAG? ๐๐๐ (๐๐๐ญ๐ซ๐ข๐๐ฏ๐๐ฅ-๐๐ฎ๐ ๐ฆ๐๐ง๐ญ๐๐ ๐๐๐ง๐๐ซ๐๐ญ๐ข๐จ๐ง) RAG connects a generation model to external knowledge through retrieval. Hereโs how it works - 1./ A user submits a query. 2./ The system searches a pre-indexed set of
57
240
1K
Combining the benefits of RL and SFT with on-policy distillation, a promising approach for training small models for domain performance and continual learning.
Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other
100
227
3K
Writing a lit review is easy if you have a strategy. Modern AI tools can also make it fast and efficient. Here is how:
1
7
43
Build MCP AI Agents with reasoning, system prompts, and tool orchestration. Nanobot wraps existing MCP servers into intelligent agents and renders React components directly in chat via MCP-UI. 100% open-source.
10
79
443
Google open-sourced LangExtract Python library! It uses LLMs to extract entities, attributes, and relations with exact source grounding from unstructured documents. Flexible LLM support (Gemini, OpenAI, Ollama) 100% open-source.
17
193
1K
Toolkit for fine-tuning and training large language and vision models
3
43
236
Stanford CS229: Building Large Language Models This brilliant 1.5h lecture unpacks how ChatGPT-like models are built: From tokenization & scaling laws to training hurdles, benchmarks, SFT/RLHF, and efficiency Lecture link in ๐งต โ
13
66
412
๐๐This 200-Page LLM Paper Is a ๐๐ผ๐น๐ฑ๐บ๐ถ๐ป๐ฒ โ and itโll save you months ๐ฃ๐ฟ๐ผ๐บ๐ฝ๐๐ถ๐ป๐ด, ๐๐ฟ๐ฎ๐ถ๐ป๐ถ๐ป๐ด, ๐ฎ๐น๐ถ๐ด๐ป๐บ๐ฒ๐ป๐ โ finally crystal clear. If you donโt have time to read all 200+ pages, here are the most valuable ๐๐ฎ๐ธ๐ฒ๐ฎ๐๐ฎ๐๐ โ ใ
14
286
1K
Open-ended reasoning is one of the hardest problems in reasoning LLMs rn. So in this paper, they aim to solve this by reverse-engineering plausible thought chains from good answers via a gradient-free search With DeepWriter-8B trained on this data outperforming top OS models!
7
67
356
If you want to learn Deep Learning from the ground up to advanced techniques, this open resource is a gem. Full notebook suite -> Link in comments
17
312
2K
Still one of the best roadmaps and resource dumps of AI Engineering in 2025. 50 papers, models, blogs across 10 fields in AI Eng: LLMs, Benchmarks, Prompting, RAG, Agents, Vision, Diffusion, Finetuning.
12
158
1K
Finally, an open-source, enterprise-grade RAG solution! If you're building an enterprise-grade RAG system, youโll run into 2 big challenges: - Data scattered across 100s of sources - Need for real-time sync Knowledge bases by MindsDB is an open-source solution that tackles
18
130
742
Our team's AI-Researcher has been accepted by NeurIPS 2025 and selected as a Spotlight! ๐ The project has also garnered 2.4K stars on GitHub and made it to the GitHub Trending list. Congratulations to our core team members: Jiabin, Lianghao, and Zhonghang! ๐ Over the past six
13
41
323