Paul Groth
@pgroth
Followers
3K
Following
3K
Media
423
Statuses
9K
professor - university of amsterdam. thinking: data, links, remixing, knowledge, provenance, espresso. My opinions. Mastodon: @[email protected]
Amsterdam
Joined July 2009
Also the Satellite proceedings for #ESWC2024 are out! Thanks to the whole team @eswc_conf and the General Chair @albertmeronyo Part I: https://t.co/wyJlgWtvMo Part II: https://t.co/6Uba4UImT3
#semanticweb #computerscience #knowledgegraphs #llm
1
6
11
📢 New course alert 📢 I am currently teaching a course on "Language Models and Structured Data" at Institut Polytechnique de Paris. Topics: Language Models, LoRa, Quantization, RAG, Graphs, Tabular Data, Text2SQL Zenodo:
zenodo.org
This repository keeps the lecture slides for the course on Language Models and Structured Data. The topics covered are:- Introduction to Language Models - Prompt Engineering- Low-rank adaptation,...
0
2
11
I spent 60+ hours finding 78 tacit knowledge videos. After going viral last year, my LW post is the Schelling point for sharing the type of vid Richard is talking about. If curious, check out the vids and pls share videos of this type in the comments! https://t.co/Gcok46bDOg
Hypothesis: the world's most valuable data is screen captures of outlier competent people going about their work. But very little of this data is recorded, let alone made publicly available. You should seriously consider recording all work you do, even if just for personal use.
25
216
3K
Buckle up because we're crashing into the new year with my annual database retrospective: License change blowbacks! @databricks vs. @SnowflakeDB gangwar! @DuckDB shotgun weddings! Buying a college quarterback with database money for your new lover!
cs.cmu.edu
Andy rises from the ashes of his dead startup and discusses what happened in 2024 in the database game.
18
165
738
In our new @PNASNews paper, across 21 experiments with 23,000+ participants, we identify a critical distortion that shapes decisions involving tradeoffs: we find that people systematically overweight quantified information in such decisions. Paper: https://t.co/q6bgaJpEPq 🧵
pnas.org
People often rely on numeric metrics to make decisions and form judgments. Numbers can be difficult to process, leading to their underutilization, ...
4
27
106
We have an opening for a PhD student investigating concept drift in sensor rich environments. Come work with the awesome @vdegeler
https://t.co/simCZyuaZb
0
3
3
Really proud of @James_G_Nevin - a fantastic PhD student. Was fun to supervise him together with @mhlees . We know that data handling (i.e. data integration, cleaning, etc) can have lots of downstream impacts. Here's evidence.
Congratulations to Dr. @James_G_Nevin who successfully defended his PhD thesis The Ramifications of Data Handling for Computational Models. Check it out: https://t.co/BnqVyhREpx A collaboration with @UvA_CSL in the @UvA_IvI co-supervised @mhlees @pgroth
0
0
3
We're at #EKAW2024 this week across the street @CWInl . We have two papers: one on object influence retrieval & the other on the impact of entity linking. We also have multiple workshop contributions as well. Info at: https://t.co/o3wGU6tW6s
@ekawconference
0
1
4
Tomorrow the #EKAW2024 party will really kick off, but the proceedings are already available! Click on the link at https://t.co/y7qZTVCqB2 for temporary free access! #semweb #knowledgeengineering #knowledgemanagement #languagemodels #conference @ekawconference
1
4
19
Fascinating talk by @ioanamanol on dealing with all the data models for data journalism at @iswc_conf #ISWC2024 Very cool use of gittables to retrieve names for entities (cc @MadelonHulsebos)
0
0
15
We're excited to be at #iswc2024 this week. Come talk to us about our work on knowledge graphs, LLMs, tables, and knowledge engineering: @pgroth @bradleypallen @LiseStork @JanCKalo . @iswc_conf @lm_kbc
0
2
8
This is the best paper written so far about the impact of AI on scientific discovery
106
2K
8K
🚨 What’s the best way to select data for fine-tuning LLMs effectively? 📢Introducing ZIP-FIT—a compression-based data selection framework that outperforms leading baselines, achieving up to 85% faster convergence in cross-entropy loss, and selects data up to 65% faster. 🧵1/8
10
44
247
We have an amazing community around AI and data science here @UvA_Amsterdam @ai4science_lab @sobedsc @BibliotheekUvA
Recap of our 2024 #DataScience Day - 5 events across 5 sites at the university.
0
1
6
✨#cikm2024 👉CYCLE: Cross-Year Contrastive Learning in Entity-Linking ⏲️Talk: 14:30 – 14:45, Oct 23 (Wed), 4FP29 📍Location: Room 130 😊Big thanks to my collaborators @Congfeng_Cao, @KlimZaporojets and @pgroth! If you're interested, come check out our talk for a discussion!
1
2
6
✨#ecai2024 👉TIGER: Temporally Improved Graph Entity Linker ⏲️Talk: 11:30 – 11:45 AM, Oct 23 (Wed), No. M511 📍Location: Galicia Conference and Exhibition Centre, Hall A 😊Big thanks to my collaborators @Congfeng_Cao @pgroth! If you're interested, come check out our talk!
1
4
12
📢📢We have a new data systems faculty position @ITUkbh @dasyaITU. Application deadline: Nov 28. For more information, see the link below. Reach out to me if you have any questions. https://t.co/Er5TZ5kQQb
0
7
17