Helsinki-NLP
@HelsinkiNLP
Followers
1K
Following
76
Media
25
Statuses
198
Natural Language Processing and Language Technology research at University of Helsinki
Helsinki, Finland
Joined September 2018
More information and details about the call:
ellisinstitute.fi
Call for new PIs in artificial intelligence and machine learning
0
0
0
Come and join us to work on robust, efficient and trustworthy AI across languages and domains. You will be part of the ELLIS Institute Finland as one of the prestigious PIs in their top-level research unit. Application deadline is January 12:
ellisinstitute.fi
ELLIS Institute Finland PI positions (second call, winter 2025โ26)
1
1
2
It's happening now. Our HPLT v2 dataset language coverage is awesome, provides competitive and stable results and complements other data beautifully. We are at @aclmeeting, come and say hi! #hplt #datasets
0
5
10
We are happy to announce the second release of HPLT bilingual datasets: - 50 English-centric language pairs = 380M parallel sentences (HPLT) ๐คฉ - 1,275 non-English-centric language pairs = 16.7B parallel sentences (MultiHPLT) ๐ฎ Available at the HPLT dataset catalogue and OPUS.
0
13
16
Postdoc, research fellow and PhD positions: the call to join the Finnish Center for #ArtificialIntelligence is now open. Check out our research areas and supervisors and apply by Feb 2, 2025: https://t.co/unn4sNUe4p
0
15
16
New funding from the @ERC_Research for โMultilingual Assets and Resources for Modular Translationโ (MARMoT) https://t.co/yhxBpZ029r
0
1
4
The 18th MT marathon will be organized in beautiful Helsinki in the end of August, 2025. We invite you to a week-long gathering of researchers, developers and students with lectures, labs and hacking projects. More information will come - stay tuned!
1
7
23
Work from @realzihaolee, with the help of @shaoxiongji @Vsegonne and @TiedemannJoerg โ how good is machine translation as a pretraining multilingual objective? Looking forward to the #EMNLP2024 poster session, Nov 12, 14h00!
I'll present our work ๐ ๐๐จ๐ฆ๐ฉ๐๐ซ๐ข๐ฌ๐จ๐ง ๐จ๐ ๐๐๐ง๐ ๐ฎ๐๐ ๐ ๐๐จ๐๐๐ฅ๐ข๐ง๐ ๐๐ง๐ ๐๐ซ๐๐ง๐ฌ๐ฅ๐๐ญ๐ข๐จ๐ง ๐๐ฌ ๐๐ฎ๐ฅ๐ญ๐ข๐ฅ๐ข๐ง๐ ๐ฎ๐๐ฅ ๐๐ซ๐๐ญ๐ซ๐๐ข๐ง๐ข๐ง๐ ๐๐๐ฃ๐๐๐ญ๐ข๐ฏ๐๐ฌ at #EMNLP2024๐ด Nov 12 14-15:30 Riverfront Hall https://t.co/zKmERcESs8 cc @linguistickus
1
3
7
The new ELLIS Institute is excellent news for Europe and Finland. A significant investment to the future of Europe - its growth, and security of supply of expertise. Visionary act by Finnish Government, Ministry of Education and Culture, and Peter Sarlin!
ELLIS Institute Finland is launching with public (@okmfi) and private support (@petersarlin) ๐ค A significant boost for AI R&D and a signal that Finland is investing in AI to attract the best talents and fuel growth & expertise in Europe. https://t.co/aS7YYNM7FG
@ELLISforEurope
0
8
42
๐ฃ Two weeks until the paper submission deadline (21 October) ๐ฃ We have reached the submission month and we're hoping to see all your work submitted ๐ Happy writing everyone โ๏ธ!! More info here: https://t.co/dJw3W0rQ3c
#NLProc
0
2
4
The next big release of data is out from our HPLT project!
๐ INTRODUCING THE LATEST HPLT MONOLINGUAL DATASETS! TL;DR: ๐ 4.5 PB of web crawls ๐ 21 billion documents ๐ careful extraction, dedup, annotation and cleaning ๐ฅ 193 languages! Explore and download the new HPLT Monolingual Datasets NOW! https://t.co/Kj5XNjfjFQ
#HPLT
0
0
8
Links related to EMMA-500: ๐ [Paper]( https://t.co/r6D1XyMAsw) ๐ค [Model]( https://t.co/VzGtC92Bih) ๐พ [Data]( https://t.co/3gOjJ2CTqv)
1
0
10
๐ Excited to introduce EMMA-500! ๐โจ A multilingual model continue-trained on 546 languages, enhancing coverage for low-resource languages. With the MaLA corpus and Llama 2 7B, we're pushing boundaries in cross-lingual transfer. Check it out:
huggingface.co
2
25
102
Today @josephnlp and I presented KD4MT in the MT Marathon in Prague organized by @ufal_cuni, one of my favourite events of the year! @HelsinkiNLP @hplt_eu
2
4
28
A new round of fully funded PhD positions in AI in Finland!
Applications are open for the doctoral program in AI! ๐ Fully funded PhD positions across 10 Finnish universities ๐ซ๐ฎ ๐น Watch the video with @arnosolin below โก๏ธApply here by Sept. 9 https://t.co/G4MkmpB152
0
0
6
We are pleased to announce that SHROOM has been selected as the Best Task Paper for SemEval 2024! ๐ฅณ๐คฏ๐ https://t.co/Jmrs4r5PGq Thank you to all our participants for making this shared task a success! ๐ค
semeval.github.io
The 18th International Workshop on Semantic Evaluation
0
3
10
@LrecColing has arrived! We will presenting our work on how we built the HPLT datasets! ๐
Friday 24th of May โฐ 9.20h-9.40h ๐Room Londra โ๏ธSession D3-S1-R3 - Multilinguality, Machine Translation, and Translation Aids II
We will be presenting the HPLT datasets HOW-TO and insights at @LrecColing in Torino. Paper already in https://t.co/JiXooVNzvz:
https://t.co/iPlRX7Cfj5.
1
5
9
Will you be at @LrecColing next week? HPLT will! ๐ฅณ Don't miss: - our poster on Thursday 23, 15:30, about FastSpell, one of the langID technologies of our dataset pipeline. (paper 1571) - our presentation on Friday 24, 9:20 for all details about HPLT massive dataset (paper 2199)
0
5
7
๐โผ๏ธCall for Workshop Proposals NoDaLiDa/Baltic-HLT 2025โผ๏ธ๐ ๐๏ธ 2 and 5 March, before and after the main conference. Options are full or half-day workshops. Deadline: 26 August 2024. Find more information on our website: https://t.co/R8tKHXoTAr
0
3
10
๐ข NoDaLiDa/Baltic-HLT 2025 *call for papers* is up! ๐ข Paper Deadline: *21 October 2024* Find more information on our โจbrand new websiteโจ: https://t.co/dJw3W0ridE 165/166 days until deadline โโโโโโโโโโโโโโโโโโโโ 0.6% #NLProc #nodalida_baltichlt
0
8
13