 
            
              RDB
            
            @Rajath_DB
Followers
                940
              Following
                6K
              Media
                90
              Statuses
                794
              It's not what it looks like / AI Engineer prev Rootly / building @kwalityai Will post about things I like and some technical findings that seem interesting
              
              Toronto, Ontario
            
            
              
              Joined September 2015
            
            
           Do anybody still download @WordPress themes on @ThemeForest and host it? Those were the days 
          
                
                1
              
              
                
                0
              
              
                
                1
              
             Here’s also the numbers from Jeff on “Numbers Every Engineer Should Know” in case you need it 
          
                
                0
              
              
                
                0
              
              
                
                0
              
             I stumbled up on a GitHub repo while reading some docs on optimizing LLM inference which talks about “Numbers Every LLM Developer” should know inspired by @JeffDean — here’s the repo should be helpful if you’re trying to serve your own model  https://t.co/gWIL0ZlRc6  It’s a bit 
          
            
            github.com
              Numbers every LLM developer should know. Contribute to ray-project/llm-numbers development by creating an account on GitHub.
            
                
                1
              
              
                
                0
              
              
                
                0
              
             𝕏 algo seems really on point. Gives me exactly what I need when I need. Almost like some kinda telepathy 
          
                
                0
              
              
                
                0
              
              
                
                0
              
             Yesterday we shared our @Alibaba_Qwen's 3Guard benchmark results. Today, here’s the NotebookLM video that walks through what those numbers actually mean how context length impacts LLM safety, and why 7k-token chunks hit the sweet spot. 
          
                
                0
              
              
                
                2
              
              
                
                1
              
             I like the @LangChainAI docs so much better now. @hwchase17 is there a way to remove inbuilt tools without getting too internal in the deep research agent like read and write etc. I’m using that for a use case where that’s not needed. It sometimes makes those tool calls when not 
          
                
                2
              
              
                
                0
              
              
                
                7
              
             we’re not “halfway to AGI” - we’ve built something completely alien. superhuman at some tasks, worse than a kid at others. the shape is totally jagged. Also they’ve defined all the terms everyone keeps mixing up - “recursive AI” (AI building better AI) vs “superintelligence” vs 
          
                
                1
              
              
                
                0
              
              
                
                0
              
             GPT-5 scores 58% overall - sounds decent but look at that chart - it’s absolutely cracked at math and knowledge but literally can’t do spatial reasoning or basic planning. GPT-4 scored 0% on reasoning. 
          
                
                1
              
              
                
                0
              
              
                
                0
              
             they broke intelligence into 10 pieces based on real psychometric science (the same stuff psychologists use to test humans). Math, reasoning, memory, visual processing, all of it. 
          
                
                1
              
              
                
                0
              
              
                
                0
              
             okay so i’ve been going through the paper “A Definition of AGI” by Hendrycks et al. and it’s genuinely the first paper that actually DEFINES what AGI means. like we’ve all been arguing “when AGI” for years but nobody agrees on what it even IS. This paper defines: AGI = 
          
                
                1
              
              
                
                0
              
              
                
                0
              
             It’s surprising to consider that in the future, many autonomous agents might rely on the cloud, and if AWS experiences downtime, they would also cease to function, resembling a dystopian technology glitch. #AGI is just AWS backend. 
          
                
                1
              
              
                
                0
              
              
                
                1
              
             Indian devs out on diwali and @awscloud is down. Surely one has nothing to do with the other. Right? Right? 
          
                
                0
              
              
                
                0
              
              
                
                0
              
             Take a loot at this thread! If you’re building on LLMs (most of us are 😌) get your LLMs protected from various threats. 
           🔬 We benchmarked Qwen3Guard's prompt injection detection at scale. The results? Context window size DRAMATICALLY affects detection rates—even at the same malicious percentage. Here's what we found testing 1% malicious content across 1k-12k token chunks: 🧵 
          
                
                2
              
              
                
                0
              
              
                
                6
              
             Only me or @AnthropicAI ERROR agent - Anthropic API call failed: Error code: 529 - {'type': 'error', 'error': {'type': 'overloaded_error', 'message': 'Overloaded'}} 
          
                
                4
              
              
                
                0
              
              
                
                2
              
             A Friend: What do you do as an AI Engineer? Me: I spend most of my time reading @LangChainAI docs and slide into @AnthropicAI Claude's DM. 😌 
          
                
                0
              
              
                
                0
              
              
                
                3
              
            