davidlassner
@davidlassner
Followers
93
Following
130
Media
3
Statuses
123
Our Humanistic NLP dream team from left to right: @davidlassner @apjanco, moi, Natasha Ermolaev (locked out of Twitter — apparently “a long story” 😀) & @thatbudakguy. We loved presenting our @DARIAHeu + @PrincetonDH collab at the #DH2023 Language Models sessions earlier today.
0
3
39
Now online 😍 ! My book "From Handwriting to Footprinting: Text and Heritage in the Age of Climate Crisis" is now published and can be downloaded at:
openbookpublishers.com
Integrating historical, archival and environmental perspectives, From Handwriting to Footprinting illuminates the impact that digitisation has had on the dissemination and preservation of textual...
1
7
11
Woooo! 😳 Our #DraCor team just won the 2022 #Rahtz Prize for #TEI Ingenuity. What! An! Honor! Thanks to @TEIconsortium – and thanks to all the "DraCorians" out there.
🏆And the winner of the 2022 Rahtz Prize for #TEI Ingenuity is: 🏅DraCor – Drama Corpora Project: https://t.co/Egz3GiDTR9.
2
7
41
📢to all historians looking for digital tools to analyze historical sources: CorDeep, a #ML based web application, is able to extract 4 different classes of visual elements within historical sources from the 15th to the 17th centuries. 👉 https://t.co/m2QrelpSNi
0
8
21
I am delighted to share that "Domain-Specific Word Embeddings with Structure Prediction" has been accepted to TACL. This is joint work with @davidlassner @AnneBaillot & Shin Nakajima @coastalcph & @bifoldberlin. Preprint here: https://t.co/pfqZJO8jnR. Code follows soon! TL;DR:
arxiv.org
Complementary to finding good general word embeddings, an important question for representation learning is to find dynamic word embeddings, e.g., across time or domain. Current methods do not...
1
7
27
📝 Preprint is now online at https://t.co/O9fL4XE57x 👩💻 Code here:
github.com
Code to run the analysis from the paper "Every word counts: A multilingual analysis of individual human alignment with model attention." (accepted to AACL) - stephaniebrandl/eyetr...
Happy to share that our paper "Every word counts: A multilingual analysis of individual human alignment with model attention" together with @norahollenstein has been accepted to AACL-IJCNLP @coastalcph @CST_UCPH. Preprint follows soon, for now TL;DR:
0
2
9
Just saw that @stjoweil uploaded a kraken model for German fraktur. I just want to thank you! Are compatible with eScriptorium and have a very solid recognition without any fine tuning on my documents. It will be a jump start for fraktur projects that rely on a free OCR pipeline.
2
4
17
The Sphere project is an incubator of multiple #Digital #Humanities (DH) approaches. Want to learn more about the evolution and transmission of knowledge in the early modern period? 👉 https://t.co/P6z9yBRv7D
0
7
12
We invite the TEI community to nominate prospective candidates for the Rahtz Prize for TEI Ingenuity 2022: https://t.co/yghcUlgTf2. Nominations and self-submissions should only be submitted through this form: https://t.co/XHodmoKy8z. Deadline: 30 June 2022 by midnight (HAST).
0
5
3
May 11–12: Join us for the New Languages for NLP: Building Linguistic Diversity in the Digital Humanities Conference feat. participants from the @nlp_new Institute and keynotes by David Bamman and Ines Montani. In person and online. Open to all. Sign up! https://t.co/pazeuKzM7x
0
18
21
That moment where you look at the results of last night's model training on a new data set and, in shock, realise you must have screwed up your data preparation - why is my PhD supervisor a label in my dataset? @AnneBaillot (turns out https://t.co/yghVCaGFL5)
0
1
6
Hey everybody: ⚠️ There is currently a problem with one of our hosting providers. We are working with them to resolve the issue. Sorry for the inconvenience!
78
153
643
We are thrilled to be hosting @davidlassner (@TUBerlin+@bifoldberlin) for an online seminar on “Translatorship attribution with strong confounders & how to make friends between TEI and NLP” on the 4th Feb 2022, 11am CET. See https://t.co/pPbWfkALM4. Link to be shared on the day!
1
4
11
Finishing up our second New Languages for NLP workshop. Totally zoomed out, but it was a really fun week. Still hoping to meet in person at Princeton in May. Thanks, everybody. See you again at our next monthly check-in. @nlp_new @PrincetonDH @DARIAHeu
https://t.co/AYqFQTeecH
0
5
27
This was such a fun project! Among other things, we surveyed cultural heritage institutions with data republication requests (that we considered Public Domain) and, honestly, we were not prepared for the diversity of responses 😅
Es gibt einen neuen Artikel aus #vDHdPub: "Publishing an OCR ground truth data set for reuse in an unclear copyright setting" von @davidlassner, Julius Coburger, @cneudecker und @AnneBaillot. https://t.co/NvNPCRRRPV
@MelusinaPress @vDHd2021 @MWWForschung
0
0
3
The "Brace your digital edition" work session on November 19th, feat. @davidlassner @laurentromary @MissBrutus @Alix_Tz @HugoSchtr @StephanieBrandl @AnnaArchiv @sabine_seifert @torstenroeder, is hosted by @bifoldberlin - a huge thanks to them for making this meeting possible!
1
1
6
Ich freue mich, dass unser Workshop “Textexplorationen in der digitalen Literaturwissenschaft” mit @AnneBaillot & @davidlassner bei der @dhd2022 angenommen wurde. Auf dass die Diskussionen ähnlich anregend werden wie beim letzten Mal. Wir sehen uns in Potsdam 💜Mehr Details bald!
0
2
11
Oct 26, 4:30 pm (Zoom)! “Machine Predictions & Synthetic Text: A Roundtable on Large Language Models in the Humanities,” featuring Angelina McMillan-Major, Margaret Mitchell, @gimenadelr, @laurenfklein, @Ted_Underwood, @ttasovac Learn more and register: https://t.co/jVelrkWIeC
3
38
81
At the #DARIAH2021 event @mellymeldubs just mentioned the idea of characterization of languages in order to have fewer NLP "pathways" that can be adapted. Are there already ressources that give an overview of such characterizations for different languages?
1
1
5