Hayley Ross @HayleyRossLing X Profile

Hayley Ross

@HayleyRossLing

Followers

57

Following

19

Media

5

Statuses

10

PhD Student at @Harvard, working on computational semantics and LLM interpretability

Joined October 2010

Don't wanna be here? Send us removal request.

Hayley Ross

@HayleyRossLing

4 months

The tool-calling decision-making dataset I developed during my internship at NVIDIA is now out! Catch me presenting it at NAACL with @ameyasm1154 or see the 🧵 for details.

Yoshi Suhara

@suhara

4 months

Check out a new dataset When2Call for training and evaluating LLMs on decision making about "when (not) to call" functions!. 📄 Paper: 🤗 HF Dataset Hub: 💾 GitHub: #NAACL2025.

0

4

Hayley Ross

@HayleyRossLing

5 months

You can check out our paper at We'll also have a poster at NENLP on April 11th where you can come and ask questions 😊.

arxiv.org

Recent work (Ross et al., 2025, 2024) has argued that the ability of humans and LLMs respectively to generalize to novel adjective-noun combinations shows that they each have access to a...

0

Grok

@grok

2 days

What do you want to know?.

50

23

205

Hayley Ross

@HayleyRossLing

5 months

This suggests that humans and LLMs really do solve this task using composition, if we're willing to accept behavioral evidence as evidence of compositionality—see Kate McCurdy's excellent compositionality survey

aclanthology.org

Kate McCurdy, Paul Soulos, Paul Smolensky, Roland Fernandez, Jianfeng Gao. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024.

1

0

Hayley Ross

@HayleyRossLing

5 months

We build a computational model of analogical reasoning using word embedding similarity, including Llama 3 embeddings and also run an experiment asking humans to reason by analogy. We find that not all of our data can be handled by analogy!

1

0

Hayley Ross

@HayleyRossLing

5 months

Many linguists assume that humans compute the meaning and inferences of multi-word phrases by composition of meaning. But working with LLMs raises the possibility that LLMs (and humans!) are actually solving this task by analogy, like so:

1

0

Hayley Ross

@HayleyRossLing

5 months

New paper with @najoungkim and @TeaAnd_OrCoffee! .We previously found that humans and LLMs generalize to novel adjective-noun inferences (e.g. "Is a homemade currency still a currency?") .Turns out analogy isn't enough to generalize, so it's likely evidence of composition! 🧵

1

9

Hayley Ross

@HayleyRossLing

10 months

We'll also be presenting this work at GenBench (co-located with EMNLP) on 11/16, so come catch us then!.

0

3

Hayley Ross

@HayleyRossLing

10 months

New paper with @najoungkim and @TeaAnd_OrCoffee testing if LLMs can draw adjective-noun inferences like humans! Turns out they often can, and even generalize to unseen combinations. But they're more optimistic about "artificial intelligence" than humans.

1

13

Hayley Ross

@HayleyRossLing

1 year

New preprint with @najoungkim & @TeaAnd_OrCoffee on fake reefs and other cases of (novel) adjective-noun composition: Whether a fake N is an N or not depends on noun + context, but people handle novel AN pairs just fine 🙂.Stay tuned for results on LLMs!

0

1

6