
Alexandre L.-Piché
@alexpiche_
Followers
1K
Following
15K
Media
18
Statuses
126
Searching for Q* at @ServiceNowRSRCH
Montreal, Qc
Joined October 2011
Introducing ReSearch: An iterative self-reflection algorithm that enhances LLM's self-restraint abilities:. • Encouraging abstention when uncertain.• Producing accurate, informative content when confident. Result: Significant accuracy boost for Llama2 7B Chat and Mistral 7B! 🚀
1
45
102
RT @GabrielHuang9: As #ICML2025 kicks off in Vancouver, our AI talent is being quietly pushed out. 🇨🇦. We've been waiting 28 months for per….
0
10
0
RT @MassCaccia: 🎉 Our paper “𝐻𝑜𝑤 𝑡𝑜 𝑇𝑟𝑎𝑖𝑛 𝑌𝑜𝑢𝑟 𝐿𝐿𝑀 𝑊𝑒𝑏 𝐴𝑔𝑒𝑛𝑡: 𝐴 𝑆𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐𝑎𝑙 𝐷𝑖𝑎𝑔𝑛𝑜𝑠𝑖𝑠” got an 𝐨𝐫𝐚𝐥 at next week’s 𝗜𝗖𝗠𝗟 𝗪𝗼𝗿𝗸𝘀𝗵𝗼𝗽 𝗼𝗻 𝗖𝗼𝗺𝗽𝘂𝘁𝗲𝗿….
0
50
0
RT @DBahdanau: I am excited to open-source PipelineRL - a scalable async RL implementation with in-flight weight updates. Why wait until yo….
0
115
0
RT @alex_lacoste_: @AnthropicAI Early results with Claude 3.5 sonnet for our new paper. We're probably not even using it right yet and its….
0
7
0
RT @DjDvij: I am also hiring for my new team at @ServiceNowRSRCH, please reach out if you are at the conference and interested in building….
0
6
0
RT @DjDvij: The dominant paradigm in AI alignment is to learn from human feedback. But what form should this feedback take? A simple thumbs….
0
12
0
RT @DBahdanau: 🚨 New agent framework! 🚨. My team at @ServiceNowRSRCH is releasing TapeAgents: a holistic framework for agent development a….
0
40
0
RT @alexandredrouin: Interested in time series forecasting and LLMs?. We are looking for visiting researchers to work on context-aided fore….
0
21
0
RT @alex_lacoste_: Most of our team is at #ICML2024 , reach out if you want to meet. We'll be presenting WorkArena and BrowserGym:.Poster….
arxiv.org
We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on measuring the agents' ability to perform tasks that span the...
0
16
0
RT @rosieyzh: In our new work on evaluating optimizers for LLM training, we perform a series of experiments to investigate the role of adap….
0
31
0
RT @alexpiche_: We can tweak the target accuracy to obtain different behaviors. High target accuracy: ReSearch is very cautious and produce….
0
1
0