Pao Siangliulue
@Siangliulue
Followers
336
Following
704
Media
37
Statuses
289
🎒 {Creativity, AI, People} | HCI researcher & software eng | @allen_ai | previously @B12, @Harvard, @Stanford | Also on Bluesky 🦋
Seattle, WA
Joined May 2009
Introducing Asta DataVoyager—our new AI capability in Asta that turns structured data into transparent, reproducible insights. Built for scientists, grounded in open, inspectable workflows. 🧵
5
27
114
Introducing Asta—our bold initiative to accelerate science with trustworthy, capable agents, benchmarks, & developer resources that bring clarity to the landscape of scientific AI + agents. 🧵
10
50
221
Best paper award at @sdpworkshop @aclmeeting : Our paper on evaluating the novelty of LLM-generated scientific ideas won the best paper award at the SDP workshop. Congrats Simra @its_sshahid Marissa @marissa_rad and team!
1
5
32
Are you a researcher in CS or a CS-adjacent field who could use help in refining your research ideas? Want to try our new AI-powered tool that helps with just that in a paid user study? Details and sign up here!
docs.google.com
Hi! 👋 We are researchers at the Allen Institute for Artificial Intelligence (Ai2) exploring AI-powered tools to support researchers in project ideation. We are conducting a study to learn more about...
2
7
20
We’ve upgraded ScholarQA, our agent that helps researchers conduct literature reviews efficiently by providing detailed answers. Now, when ScholarQA cites a source, it won’t just tell you which paper it came from–you’ll see the exact quote, highlighted in the original PDF. 🧵
6
35
197
Introducing SciArena, a platform for benchmarking models across scientific literature tasks. Inspired by Chatbot Arena, SciArena applies a crowdsourced LLM evaluation approach to the scientific domain. 🧵
12
64
407
@allen_ai @SemanticScholar is hiring an #ml #nlp #ai reasoning researcher for a Research Scientist, Agents for Science position with target start dates in 2025. Excited about developing AI systems with deep reasoning capabilities for science? Send an application our way!
1
10
21
Imagine AI doing science: reading papers, generating ideas, designing and running experiments, analyzing results… How many more discoveries can we reveal? 🧐 Meet CodeScientist, a promising next step toward autonomous scientific discovery. 🧵
6
97
369
Meet Ai2 Paper Finder, an LLM-powered literature search system. Searching for relevant work is a multi-step process that requires iteration. Paper Finder mimics this workflow — and helps researchers find more papers than ever 🔍
19
217
1K
We are looking for CS researchers to participate in a study exploring how AI can change the way we do literature reviews. 📚🧑🎓 Time: ~90 min, remote Compensation: $60 USD Sign up here: https://t.co/5P0hDpCUMQ
@dsweld @amyxzh @josephcc @marissa_rad @Siangliulue @turingmusician
docs.google.com
We are researchers from the University of Washington and AI2, and are currently recruiting participants for a user study that explores how AI tools can support scientific literature review. During...
0
12
28
We're seeking CS researchers to participate in a study on working with AI tools to come up with research project ideas! Recruitment survey: https://t.co/OTXhP71DVD Compensation: $50 Amazon gift card Time: 90 mins @dsweld @Hoper_Tom @its_sshahid @Siangliulue @rayrayfok
docs.google.com
We are a group of researchers from the Allen Institute for AI and University of Washington conducting a study to investigate how computer-science researchers work with AI tools to come up with...
2
8
28
We’re excited to share some updates to Ai2 ScholarQA: 🗂️ You can now sign in via Google to save your query history across devices and browsers. 📚 We added 108M+ paper abstracts to our corpus - expect to get even better responses! ✨ The backbone model has been updated to the
3
36
167
We took our most efficient model and made an open-source iOS app📱but why? As phones get faster, more AI will happen on device. With OLMoE, researchers, developers, and users can get a feel for this future: fully private LLMs, available anytime. Learn more from @soldni👇
47
106
653
🔬Research ideation is hard: After the spark of a brilliant initial idea, much work is still needed to further develop it into a well-thoughtout project by iteratively expanding and refining the initial idea and grounding it to relevant literature. How can we better support this?
11
83
646
Here is Tülu 3 405B 🐫 our open-source post-training model that surpasses the performance of DeepSeek-V3! The last member of the Tülu 3 family demonstrates that our recipe, which includes Reinforcement Learning from Verifiable Rewards (RVLR) scales to 405B - with performance on
151
372
2K
"Schedule send," an email feature that saves me hours of anxious email tweaking. Instead of fretting over the perfect words, "schedule send" it in the next hour. You can still edit for a while if you want to. The gain is the peace of mind.
0
0
0
Can AI really help with literature reviews? 🧐 Meet Ai2 ScholarQA, an experimental solution that allows you to ask questions that require multiple scientific papers to answer. It gives more in-depth, detailed, and contextual answers with table comparisons, expandable sections
14
73
221
🤔Giving complex tasks to AI agents is easy—getting them to do exactly what you want isn’t. How can human-AI collaboration give us more reliable & steerable agents? 🍫Introducing Cocoa, our new interaction paradigm to balance human & AI agency in complex human-AI workflows. 🧵
1
19
69
@allen_ai @SemanticScholar is hiring #nlproc #hci #ml #ai researchers for a Research Engineer position with target start dates in 2025, apply by *Jan 17, 2025*! Apply: https://t.co/oM9bAa0s7S
0
4
10
This is work with @yoonjoo_le2, @arnaik19, @Siangliulue, @rayrayfok, @imjuhokim, @dsweld, @josephcc, and @kylelostat conducted at @SemanticScholar @allen_ai @uwcse @kaist Code & Data: https://t.co/Jcfa7HGju1 Paper:
aclanthology.org
Benjamin Newman, Yoonjoo Lee, Aakanksha Naik, Pao Siangliulue, Raymond Fok, Juho Kim, Daniel S Weld, Joseph Chee Chang, Kyle Lo. Proceedings of the 2024 Conference on Empirical Methods in Natural...
0
2
4