 
            
              Scaled Cognition
            
            @ScaledCognition
Followers
                483
              Following
                36
              Media
                4
              Statuses
                22
              The first AI system designed and trained for agentic applications. Register for early access: https://t.co/26WXpyCnde
              
              SF, NYC, Boston
            
            
              
              Joined January 2023
            
            
          
            @ScaledCognition + @Genesys = a new era of action-driven AI for CX. Together, we’re helping enterprises deploy deterministic systems that deliver reliable, policy-aligned outcomes — built for action, not just words. Learn more →
          
          
            
            genesys.com
              Partnership includes Genesys investment in Scaled Cognition to support large action model innovation for CX workflows that enable a new level of reliabi...
            
                
                0
              
              
                
                9
              
              
                
                13
              
             We’re excited to share that our co-founder and CTO, Dan Klein, will be speaking at this year's @FinRegLab AI Symposium in Washington, D.C. The event brings together leaders shaping the future of AI and financial services — from building trustworthy systems to understanding how 
          
                
                0
              
              
                
                0
              
              
                
                6
              
             Evolution when you need safety. Revolution when you’re ready. Build AI you can trust, then scale without limits. 
           Every company we talk to has the same challenge: they have existing systems that work, mostly. Dialog trees that handle millions of interactions. Business logic encoded in flow charts. They want AI but they can't afford to rip everything out and start over. It's too risky and 
          
                
                0
              
              
                
                0
              
              
                
                1
              
             We’re growing fast at @ScaledCognition — and we’re hiring across a range of roles! If you’re excited about building specialized agentic LLMs — models that are reliable and ready for real-world use — we’d love to hear from you. Our work spans from low-level modeling advances to 
          
            
            scaledcognition.applytojob.com
              Explore open job opportunities at Scaled Cognition.
            
                
                0
              
              
                
                1
              
              
                
                3
              
             Khosla backed model lab doing cutting edge work on specialized agentic LLMs 
           We’re actively hiring researchers! If you’re interested in building highly reliable specialized models for agentic use cases, come join us @ScaledCognition! Our work ranges from low-level modeling advances to synthetic data generation and evaluation, and is directly impacting 
          
                
                0
              
              
                
                1
              
              
                
                4
              
             We’re actively hiring researchers! If you’re interested in building highly reliable specialized models for agentic use cases, come join us @ScaledCognition! Our work ranges from low-level modeling advances to synthetic data generation and evaluation, and is directly impacting 
          
                
                2
              
              
                
                6
              
              
                
                10
              
             We read this paper in our reading group today. It's a cool paper; it spurred some interesting discussion. Our high-level reaction: it sure looks like the results can be explained by sampling k tokens at a time, taking the most likely sequence, then continuing. 
           We found a new way to get language models to reason. 🤯 No RL, no training, no verifiers, no prompting. ❌ With better sampling, base models can achieve single-shot reasoning on par with (or better than!) GRPO while avoiding its characteristic loss in generation diversity. 
            
                
                4
              
              
                
                8
              
              
                
                120
              
             Most people don’t yet realize that systems based on general purpose LLMs are like building on jello. Models trained from the tangled mess of internet data and RL optimized for plausible sounding output are not well suited for workflow automation where precision and actual 
          
                
                0
              
              
                
                7
              
              
                
                13
              
             The customer service world has been stuck between two extremes. On one side: rigid dialog trees. Every interaction follows predefined paths. Want to search for restaurants while booking a hotel? Sorry, that's not in the script. These systems are predictable but inflexible. On 
          
                
                0
              
              
                
                6
              
              
                
                15
              
             🤝 @ScaledCognition is proud to partner with @Genesys to bring responsible agentic AI to the next level of customer experience orchestration. By integrating Scaled Cognition’s specialized agentic models with #GenesysCloud™, we’re helping organizations deploy AI built for 
          
            
            scaledcognition.com
              Today, we’re excited to announce a new partnership with Genesys®, a global cloud leader in AI-Powered Experience Orchestration.
            
                
                0
              
              
                
                0
              
              
                
                3
              
             Very proud of the team at @ScaledCognition and excited for this partnership with @Genesys! It's great to see our technology making an impact after months of work ranging from fundamental research to core engineering, figuring out how to bring robust agentic AI to CX. 
          
              @ScaledCognition + @Genesys = a new era of action-driven AI for CX. Together, we’re helping enterprises deploy deterministic systems that deliver reliable, policy-aligned outcomes — built for action, not just words. Learn more →
            
          
                
                0
              
              
                
                1
              
              
                
                4
              
             Some pretty interesting research has gone into training the models behind this. It's been fun to work on this the last ~2.5 years, and I'm excited for what's coming. Also, we're hiring ;). 
          
              @ScaledCognition + @Genesys = a new era of action-driven AI for CX. Together, we’re helping enterprises deploy deterministic systems that deliver reliable, policy-aligned outcomes — built for action, not just words. Learn more →
            
          
                
                1
              
              
                
                2
              
              
                
                4
              
             Genesys + Scaled Cognition = a new chapter in agentic AI for CX. Together, we will be helping organizations deploy autonomous agents that can act with accuracy, control, and confidence. Learn more →  https://t.co/zjmEXIkYEQ 
          
          
                
                3
              
              
                
                3
              
              
                
                10
              
             This @FortuneMagazine article outlines the risk of using general purpose LLMs for CX. Leading AI startup @cursor_ai CX agent hallucinated a key policy causing confusion amongst its users. General purpose LLMs lack determinism, and are prone to consequential errors. AI agents, 
           NEW: AI startup Anysphere has been riding high over the past two months, thanks to the skyrocketing popularity of its AI-powered software coding assistant, Cursor. But this week, Cursor's customer support AI went rogue, triggering a wave of cancellations.  https://t.co/KKF4S4b0o0 
            
          
                
                0
              
              
                
                2
              
              
                
                7
              
             Our CTO Dan Klein was featured in @nytimes, sharing insights on building AI agents that can reason through tasks. Accelerate your company’s development of AI agents with APT-1 with its built-in reasoning, action-taking, and decision making capabilities. 
          
                
                2
              
              
                
                2
              
              
                
                10
              
             We’ve accomplished these gains through (1) optimizing the model for actions rather than tokens, (2) a new kind of synthetic agentic training data, and (3) a novel RL approach using agent-to-agent self play. (1) Standard models are focused on token sequences, but business logic 
          
                
                0
              
              
                
                1
              
              
                
                23
              
             APT-1 currently outperforms all other models on the Tau-Bench and ComplexFuncBench agentic leaderboards, which test the ability to invoke sequences of complex APIs and comply with business policies. 
          
                
                1
              
              
                
                3
              
              
                
                25
              
             We’re also announcing our Agent Builder platform, which allows you to create, test, and deploy an enterprise-grade AI agent using APT-1 in under an hour. Our GenAPI technology lets you test agent behaviors without needing to integrate with real APIs during development. Learn 
          
                
                2
              
              
                
                0
              
              
                
                25
              
             We’re Scaled Cognition, developing the first ever models trained specifically for agentic applications: 1. Our first system, APT-1, is now #1 on agentic benchmarks. 2. It was developed by a US team for a total cost of less than $11M. 3. Khosla Ventures led our seed round ($21M 
          
                
                9
              
              
                
                43
              
              
                
                238
              
             
               
             
             
               
            