Pinjia He @PinjiaHE X Profile

Pinjia He

@PinjiaHE

Followers

1K

Following

811

Media

12

Statuses

246

Assistant Professor at The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen) @cuhksz.

https://t.co/ldVKT6dSHh

Shenzhen, China

Joined March 2015

Don't wanna be here? Send us removal request.

Pinjia He

@PinjiaHE

8 months

📢 Can LLMs locate software service failures? 🤔 My student @SiyuexiH's #ICLR2025 paper introduces OpenRCA, the first benchmark dataset for evaluating LLMs' root cause analysis capabilities in software systems. LLMs/Agents need to analyze system telemetry data to infer results

0

6

12

Daniel Kang

@ddkang

4 months

SWE-bench Verified is the gold standard for evaluating coding agents: 500 real-world issues + tests by OpenAI. Sounds bullet-proof? Not quite. We show passing its unit tests != matching ground truth. In our ACL paper, we fixed buggy evals: 24% of agents moved up or down the

11

36

199

Cindy Rubio González

@cindy_rubio

6 months

Are you interested in serving on the Program Committee for @issta_conf 2026? Please let us know by filling out this form:

docs.google.com

Until 30th June 2025, please indicate your interest to serve on the ISSTA 2026 program committee through filling out this form. Please include as much information as possible. After submitting the...

1

6

10

Chengyu Zhang

@chengyuzh

5 months

I'm looking for PhD students starting Fall 2026! If you're interested in automated testing and trustworthy program verification, feel free to reach out via email or come chat with me at ISSTA/FSE next week!

Chengyu Zhang

@chengyuzh

5 months

Excited to share that two of our papers will be presented next week: one at SIGMOD (Tuesday), and another at the FUZZING Workshop @ ISSTA (Saturday)! The student collaborators from @ECNUER will present the papers. I’ll be at ISSTA/FSE next week—come say hi! Looking forward to

3

11

42

Pinjia He

@PinjiaHE

6 months

My student Xiaoyuan Liu's @xyliu_cs collaboration work with Tencent. #ACL2025NLP

Zhaopeng Tu

@tuzhaopeng

6 months

When eyes and memory clash, who wins? 👁️🧠 Introducing a comprehensive study on vision-knowledge conflicts in MLLMs, where visual input contradicts the model's internal commonsense knowledge—and the results might surprise you. #ACL2025NLP 📈 We developed an automated framework

0

4

Dominik Winterer

@DominikWinterer

6 months

🚀 I'll be launching the Formal Methods Engineering Lab ( https://t.co/9pjKYVa89h) – and I am hiring! If you’re interested in working with me, feel free to reach out.

Dominik Winterer

@DominikWinterer

6 months

Super excited to share that I will be joining The University of Manchester (@OfficialUoM) as a Lecturer (Assistant Professor) in Cyber Security! The Systems and Software Security group at Manchester is already incredibly impressive, and I’m honored to help further strengthen it.

1

11

29

Pinjia He

@PinjiaHE

6 months

Check out my student Xiaoyuan Liu's @xyliu_cs collaboration work with Tencent: RISE (Reinforcing Reasoning with Self-Verification), enabling LLMs to simultaneously level-up BOTH their problem-solving AND self-checking skills.

Zhaopeng Tu

@tuzhaopeng

6 months

Trust your AI, but can it trust itself? 🤔 Introducing an online reinforcement learning framework, RISE (Reinforcing Reasoning with Self-Verification), enabling LLMs to simultaneously level-up BOTH their problem-solving AND self-checking skills! 🧐 Problems tackled: ✅

0

1

15

Chao Peng

@chao_peng_

7 months

We’re proud to bring @Trae_ai to @ICSEconf. Our booth, product showcase, banquet, and workshops were a great success. Huge thanks to everyone who joined our events. Looking forward to deeper collaboration in AI4SE research. See you again at @FSEconf !

2

24

Pinjia He

@PinjiaHE

9 months

Truly humbled and honored to receive the IEEE CS TCSE Rising Star Award. Thanks a lot for the help along the way from my supervisors, referees, students, and co-authors. Will continue to focus on impactful projects about AI4SE and SE4AI. 🎯 https://t.co/cUX83bzcoQ

10

7

58

Shin Hwei Tan

@tan_hwei

9 months

We invite you to nominate yourself to serve on the Program Committee for FSE'26. Please use the following link to access the nomination form: https://t.co/wCGjjsdCVv

docs.google.com

Please use this form to nominate yourself for the program committee of the ACM International Conference on the Foundations of Software Engineering (FSE 2026) by March 14, 2025. While we cannot select...

1

12

32

Ion Stoica

@istoica05

10 months

Agree DeepSeek is not as good as o1-pro and o3, but I think we need to look at the trends. This is what happened during the last nine months. What will happen in the next nine month if we do not change anything in the structure of the AI ecosystem in the US?

Nicolas

@NicolasSerna314

10 months

@istoica05 We are definitely not doing a good job in the USA given our resources. However, I am not entirely sure about the last claim. DeepSeek might be as good as o1 if not better, but I don't think it is as good as o1 pro or o3 (based on deep research). Additionally, one little detail

9

29

170

Pinjia He

@PinjiaHE

10 months

🚀Can LLMs repair programs without known buggy hunks?🤔 💡 My student @SiyuexiH's #ICSE2025 paper reveals that current infilling approaches constrain LLMs' repair potential. By simply aligning the task objective from fixing buggy hunks to rewriting the entire program with tests,

2

9

30

Yoshua Bengio

@Yoshua_Bengio

10 months

Today, we are publishing the first-ever International AI Safety Report, backed by 30 countries and the OECD, UN, and EU. It summarises the state of the science on AI capabilities and risks, and how to mitigate those risks. 🧵 Link to full Report: https://t.co/k9ggxL7i66 1/16

49

523

1K

Marcel Böhme👨‍🔬

@mboehme_

11 months

Nominate yourself for the ASE'25 PC! The 40th IEEE/ACM International Conference on Automated Software Engineering (@ASE_conf) is looking for PC nominations to maximize diversity of perspectives. 🖊️ https://t.co/Th6wpRfuX7 🧑‍💻 https://t.co/M0qPBkfwqo w/ @LingmingZhang

docs.google.com

Please indicate your interest to serve on the ASE 2025 program committee through filling out this form. Please include as much information as possible. After submitting the form you will receive a...

0

10

28

Pinjia He

@PinjiaHE

1 year

Dominik's research is solid and highly impactful! He is also very easy to get along with😃

Dominik Winterer

@DominikWinterer

1 year

🚀🔍🧑‍🏫 I am on the academic job market! My research focuses on advancing Formal Methods, Programming Languages, and Software Engineering. Website: https://t.co/ypqj71vafu Research Statement:

0

1

Dominik Winterer

@DominikWinterer

1 year

🚀🔍🧑‍🏫 I am on the academic job market! My research focuses on advancing Formal Methods, Programming Languages, and Software Engineering. Website: https://t.co/ypqj71vafu Research Statement:

3

23

65

Lilian Weng

@lilianweng

1 year

🦃 At the end of Thanksgiving holidays, I finally finished the piece on reward hacking. Not an easy one to write, phew. Reward hacking occurs when an RL agent exploits flaws in the reward function or env to maximize rewards without learning the intended behavior. This is imo a

lilianweng.github.io

Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing the intended...

68

227

2K

FSE 2025

@FSEconf

1 year

FSE'25 will be buzzing with 14 co-located workshops. Congratulations to the organizers for their hard work! More details will be posted in the next few days. #FSE25 #Workshops

0

6

20