HB-Eval System
@hbEvalSystem
Followers
8
Following
5
Media
8
Statuses
21
AI researcher and developer, and the founder of HB-Eval System—the first diagnostic framework designed to evaluate the internal reasoning quality
مصر
Joined November 2025
I’m excited to join the technology and AI community and share my research journey, where I’m working on building a new generation of explainable agents with measurable cognitive performance. Abuelgasim Adam Founder of HB-Eval System
0
0
0
Short Version for X Seeking an arXiv endorsement to publish my new research paper on Agentic AI and Hierarchical Evaluation of AI Agents. If you’re an active arXiv author in https://t.co/bQEBZ6Lfi1 or know someone who can help, your support would be greatly appreciated.
0
0
0
Most AI evaluations miss recovery behavior. HB-System measures FRR to track how well an agent bounces back after failure.
0
0
0
HB-Eval System closes ALL major gaps in current Agentic AI: • Evaluation → PEI / FRR / TI • Adaptation → Adapt-Plan + MetaController • Memory → Eval-Driven Memory (EDM) • Trust → HCI-EDM with interpretable reasoning https://t.co/kSSUI7pUYc
0
0
1
HB-Eval System closes ALL major gaps in current Agentic AI: • Evaluation → PEI / FRR / TI • Adaptation → Adapt-Plan + MetaController • Memory → Eval-Driven Memory (EDM) • Trust → HCI-EDM with interpretable reasoning https://t.co/kSSUI7pUYc
#Research #OpenSource
github.com
HB-Eval System™ – The Leading Behavioral Evaluation & Trustworthy Agentic AI System PEI = 0.92 · FRR = 92% · Human Trust Score = 4.62/5.0 500-Task Longitudinal Benchmark · 4-Paper Series...
0
0
0
HB-Eval System™ just dropped — the first complete framework that *actually solves* the 4 biggest failures in Agentic AI. PEI = 0.92 | FRR = 92% | Human Trust = 4.62/5.0 500-task longitudinal benchmark | 4-paper series (Nov 2025) Open-Core (Apache 2.0) — runs in 30 se
0
0
0
Novel cognitive architecture for Performance-Driven Agents. Integrates HB-Eval (Performance metrics), EDM (Selective Memory), and Adapt-Plan (Strategy). Open-Core Edition." https://t.co/kSSUI7qsNK
0
0
1
Open-Core (Apache 2.0) – fully runnable in 30 seconds Enterprise (real-time, on-prem, SLA) → licensing@hb-eval.ai Code + Docker + Papers ↓ https://t.co/kSSUI7pUYc
#AgenticAI #TrustworthyAI #XAI #OpenSource #LLM #AI
github.com
HB-Eval System™ – The Leading Behavioral Evaluation & Trustworthy Agentic AI System PEI = 0.92 · FRR = 92% · Human Trust Score = 4.62/5.0 500-Task Longitudinal Benchmark · 4-Paper Series...
0
0
1
Closes ALL gaps: • Evaluation → PEI/FRR/TI • Adaptation → Adapt-Plan + MetaController • Memory → Eval-Driven Memory (EDM) • Trust → HCI-EDM with full explanations https://t.co/kSSUI7qsNK
github.com
HB-Eval System™ – The Leading Behavioral Evaluation & Trustworthy Agentic AI System PEI = 0.92 · FRR = 92% · Human Trust Score = 4.62/5.0 500-Task Longitudinal Benchmark · 4-Paper Series...
0
0
1
HB-Eval System™ just dropped – the first complete framework that actually SOLVES the 4 biggest problems in Agentic AI. PEI = 0.92 | FRR = 92% | Human Trust = 4.62/5.0 500-task benchmark | 4-paper series (Nov 2025) https://t.co/kSSUI7qsNK
github.com
HB-Eval System™ – The Leading Behavioral Evaluation & Trustworthy Agentic AI System PEI = 0.92 · FRR = 92% · Human Trust Score = 4.62/5.0 500-Task Longitudinal Benchmark · 4-Paper Series...
0
0
1