
Emilia Wiśnios
@wisnios_emilia
Followers
133
Following
524
Media
10
Statuses
92
On the lookout for PhD adventures 🚀
Warsaw, Poland
Joined February 2022
🚀 We just released our open-source codebase and a public leaderboard for evaluating reasoning and progress sense of AI models! Big things are coming - don’t miss out! 👀 thanks to: @pfbudzianowski, @gracjan_goral, @wisnios_emilia, @ihorbeaver, @viktor_vrp and @KrzysztofTWalas
We are happy to share our v1 version of the temporal progress leaderboard ( https://t.co/wD6s9uhGEf). Visual temporal reasoning is an essential characteristic that VLM/VLA should exhibit natively for proper data curation and adaptation for robotics tasks.
0
1
7
We are happy to share our v1 version of the temporal progress leaderboard ( https://t.co/wD6s9uhGEf). Visual temporal reasoning is an essential characteristic that VLM/VLA should exhibit natively for proper data curation and adaptation for robotics tasks.
huggingface.co
2
2
9
now, you can send us your vlm or dataset, and we'll tell you how good it is. seriously, send it. thnaks to: @pfbudzianowski, @wisnios_emilia, @ihorbeaver, @viktor_vrp, @mtyrolski and @KrzysztofTWalas
We are happy to share our v1 version of the temporal progress leaderboard ( https://t.co/wD6s9uhGEf). Visual temporal reasoning is an essential characteristic that VLM/VLA should exhibit natively for proper data curation and adaptation for robotics tasks.
1
1
7
If you're in Vienna for ACL, @adhiraj_ghosh98 and I will be presenting our work on benchmarking language and vision-language models in the post-dataset era on Wednesday from 11 am in Hall 4/5. Come say hi! Project page 🔗 https://t.co/1SVlmJZ27S
bethgelab.github.io
ONEBench: a new paradigm for open-ended benchmarking and evaluation of foundation models, aggregating sample-level tests across datasets.
0
4
12
Instead of complaining that peer review is dead, take a positive step to improve it today. The reviewers are not aliens, they are us! - Revise your review and make it clear. Identify the crucial points that impacted your score negatively and positively. - If the paper is
10
14
153
I'm so happy when VLM spatial folks look into robotics to find new pre-training tasks! @gracjan_goral @Emilia Wiśnios did awesome job bringing up generative task completion (from @JasonMa2020 and co.) with open-source models. 1/2
4
11
64
🚨 One week left to submit contributed talks, posters, and tutorials to the ML in PL Conference 2025! Don't miss your chance to present at this year's conference: 👉 Main Conference talks and posters (October 15-17, 2025) 👉 Tutorials (October 18, 2025) ✓ Talk and poster
0
7
16
Check out both of my papers: - Wait, that's not an option: LLMs Robustness with Incorrect Multiple-Choice Options ( https://t.co/837C7dJn6q) - Behind Closed Words: Creating and Investigating the forePLay Annotated Dataset for Polish Erotic Discourse ( https://t.co/UzTSqeiADY)
0
0
0
Presenting two posters at the same poster session with a sprained ankle might be challenging, but I hope I'll manage. See you on Monday, July 28, at 11 a.m. I will be the one with the orthopaedic crutch 😂
what if instructions didn't just tell you what to do, but validated your inner reasoning? with @wisnios_emilia, @piotrsankowski, and @pfbudzianowski, we explored this idea. excited to present our work at #ACL2025 in Vienna next week! paper: https://t.co/A8k3wRlpcp
1
0
4
so some time ago we asked: what’s the threshold between following instructions and accuracy in llms; like: what is the capital of austria? reply only a or b; a. an apple b. euler we know the answer and will share it at #ACL2025 check it out: https://t.co/A8k3wRlpcp
1
2
9
W informatyce bardzo lubimy konferencje oznaczać skrótami. A ACL to coroczne spotkanie stowarzyszenia lingwistyki obliczeniowej (Annual Meeting of the Association for Computational Linguistics) i jest to jedna z konferencji o randze A*. Miło mi poinformować, że nasza praca
4
14
177
wow guys; in 3 days, our perspective datasets have over 200 downloads from hf (in total); i know; that isn't much; but it is honest work; thanks! datasets: https://t.co/CGFSnAyuKx let me know if you want another blender-perspective dataset to test VLMs and humans;
0
1
7
people need to take perspective (emotional and geometrical); robots too (I think); emotional is hard to measure (at least for me lol); geometrical slightly easier;
1
4
8
Take a look at our new paper!
Quick poll: What's 2 + 2? [ ] 3 [ ] 5 [ ] 22 [ ] 🍎 Feeling confused? That's the point! Our new study explores how AI (and humans) handle questions with no correct answers. 🧠🤖 #LLM #alignment
0
0
5
An ability to reason about the truth when shown different options is fundamental for both llms and robots. See more on our paper website https://t.co/V8Yj5FGG8A Congrats @gracjan_goral and @wisnios_emilia on leading this!
sites.google.com
Read
Quick poll: What's 2 + 2? [ ] 3 [ ] 5 [ ] 22 [ ] 🍎 Feeling confused? That's the point! Our new study explores how AI (and humans) handle questions with no correct answers. 🧠🤖 #LLM #alignment
2
5
27
Sunday parameters: Max-pooling before hitting dropout 1.0 🤖🏖️ Your CNN needs a break, so do you.
0
1
3
Należy zwrócić uwagę na pokorę @piotrsankowski który dla większego dobra znosi tę dziecinadę polityczną. Ego i pokora tym się właśnie różnią.
11
25
651
Dziękuję moim studentom z @IDEAS_NCBR za skuteczne dowiezienie dziś publikacji na ICLR. Mimo niesprzyjających okoliczności. Tak się robi AI Panie ......... i Panie .....
5
11
224
Z 7 osób w mojej grupie badawczej w @IDEAS_NCBR , 3 już rozważają odejście, a 2 nowe które miały dołączyć nie dołączą - przyjmą inne oferty. Odechciewa się. @OficjalneZero, @Przegaa, @JemielniakD @wina_Mazurka, @sylvcz, @TrajektoriaAI, @annawitten , @K_Stanowski ,
Przyjęłam dzisiaj zaproszenie Roberta Mazurka i @OficjalneZero do debaty o AI o 20 (w kontekście fatalnej sytuacji powstałej w @IDEAS_NCBR, którego dorobek jest zaprzepaszczany). W debacie uczestniczyć będzie @jarokrolewski oraz Minister @m_gdula - do zobaczenia. @piotrsankowski
35
99
506