Rajiv Shah
@rajistics
Followers
2K
Following
4K
Media
437
Statuses
2K
Chief Evangelist @ContextualAI - slightly funny videos along with practical AI posts, was @huggingface @datarobot @snorkelai
Illinois, USA
Joined September 2014
Heading to the AI Summit next week? Catch my talk on engineering the context layer to build better AI Solutions. I have 4 big lessons for everyone building. Catch the session on the Finance Stage: The Context Layer: Your Shortcut to AI-Driven Alpha Dec 10, 2:25 PM - 2:50 PM
0
0
0
@burkov Groundbreaking, yes. But let’s strip away the marketing gloss. This paper is indeed a technical marvel, but characterizing the dominance of tree-based methods as an "embarrassment" misses the point of why trees ruled in the first place. Here is a more nuanced take on where
4
6
45
Excited that my team’s new doc is out! It’s a pitch for a more pragmatic direction for interpretability. The reasoning: recent work (science of misalignment, eval awareness, lie deception, trawling) has been most successful when it’s focused on problems, not new methods.
The GDM mechanistic interpretability team has pivoted to a new approach: pragmatic interpretability Our post details how we now do research, why now is the time to pivot, why we expect this way to have more impact and why we think other interp researchers should follow suit
0
4
33
An agentic alternative to GraphRAG. We built a Metadata Search Tool to solve reference traversal without the rigid complexity of static graphs. The result? Agents resolve complex queries in fewer steps with higher accuracy. 🧵 1/4
1
5
10
Did a quick post on GSMA Open-Telco LLM Benchmarks and the Telelogs dataset. I compared the AT&T and Huawei approaches for Root Cause Analysis: https://t.co/kxg7DDdn1V
reddit.com
Explore this post and more from the rajistics community
0
0
0
Have some fun, check out the @kaggle Santa Tree Packing Challenge
It’s that time of year again! 🎄 The Santa 2025 - Christmas Tree Packing Challenge is now live! Santa’s got a packing problem - his Christmas trees won’t fit in the boxes! Help him find the smallest square box to fit 1 - 200 trees. 🧑🎄 More info 👇 https://t.co/VzSH5uKC8P
0
0
0
Introducing "The Eiffel Tower Llama"!🗼 Remember Golden Gate Claude? Unfortunately Anthropic's viral demo was shut down after 24h, and key technical details remained hidden. So we recreated it, uncovering key insights on steering LLMs using SAEs⚒️ Full blog post + live demo 👇
9
42
173
LLM-as-a-Judge doesn't scale. After 100+ test cases, you're stuck with: → Binary scoring that treats 95% correct the same as 5%. → Judge prompts that drift every update. → Zero insight into WHY something failed. So we built something better. 🧵
1
5
12
Check out the blost post to learn more: https://t.co/GTysk1Mv5r Great work from: @RulinShao @AkariAsai @shannonzshen @hamishivi @varsha_kishore_ @JingmingZhuo @xinranz3 @IAmSamFin @david_sontag @CoachMurray47 @sewon__min @pdasigi @soldni @faeze_brh @scottyih @tongshuangwu
allenai.org
We introduce Deep Research Tulu (DR Tulu), an open post-training recipe and framework for long-form deep research agents.
0
0
2
@ankrgyl If you do this it’s likely because customers are asking for these generic metrics so it’s hard to resist You could go against the grain and say we help you discover your applications failures, not show you generic metrics And show specific errors in an ad. Maybe this helps
1
2
18
Nano Banana Pro is Amazing This infographic is crazy good and took less than a minute to create. (I have been going through the DR Tulu paper - video soon) Prepare to be overloaded with "useful" infographics (Like all Gen AI, it can hallucinate, so check over everything
0
1
8
Ever use AI to extract structured data from a 400-page financial document, only to spend more time verifying the output than you saved? Template-based tools break on complex docs. LLMs hallucinate. You're stuck manually checking everything anyway. We built something better. 🧵
1
6
11
Stop model shopping. Start learning from users. Feedback & Annotation now in @ContextualAI • Real-time capture • Intelligent categorization • Dashboards for faster fixes Blog post for details: https://t.co/lHueBcRZo2 Try it live: https://t.co/Dc73itsVuo
0
1
4
How to transform UMAP from a black box into a glass box: by using a special type of deep network, we can now compute exact linear equivalents that reveal which features drive each point's position in the embedding. @ArcadiaScience [1/8]
2
40
220
What happens when you build AI for one of the most demanding users in enterprise software? Claimwise is a legal tech company that serves patent attorneys—professionals with advanced degrees in science & law—who need to verify every single AI-generated result before using it in
1
8
16
Ever wondered how massive AI models like GPT-4 and Mixtral work under the hood? In this video, we build a "Mixture of Experts" (MoE) model completely from scratch using PyTorch. This step-by-step tutorial is perfect for anyone in the AI field looking to gain a deep, intuitive
0
1
4
Side effect of blocking Chinese firms from buying the best NVIDIA cards: top models are now explicitly being trained to work well on older/cheaper GPUs. The new SoTA model from @Kimi_Moonshot uses plain old BF16 ops (after dequant from INT4); no need for expensive FP4 support.
🚀 "Quantization is not a compromise — it's the next paradigm." After K2-Thinking's release, many developers have been curious about its native INT4 quantization format. 刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice
61
307
3K
Had a great crowd for talking through the evolution from static RAG to multi-agentic retrieval. Talked about BM25, Big Vector, and Multi-Agents. Well run conference with great attendees, Thanks @wandb for #FullyConnected2025 in London.
1
1
8