diptesh Profile Banner
Diptesh Kanojia Profile
Diptesh Kanojia

@diptesh

Followers
782
Following
4K
Media
30
Statuses
976

Senior Lecturer in NLP for AI, Institute for @PeopleCentedAI | University of Surrey | #nlproc

Guildford, United Kingdom
Joined June 2008
Don't wanna be here? Send us removal request.
@diptesh
Diptesh Kanojia
4 months
Deadline extended to 6th August AoE! :)
@diptesh
Diptesh Kanojia
4 months
📢 Test Set RELEASED! 🚀 The test set for the #WMT25 Shared Task on QE-informed Segment-level Error Correction is now LIVE! It's time to put your MT error correction / APE methods to the test. Let's see how well they can correct machine translation! #NLProc #MT #WMT2025
0
0
1
@CTS_Surrey
CTS Surrey
26 days
| 27 October: World Day for Audiovisual Heritage 2025 🎬 | Today we celebrate the sounds and images that tell humanity’s story and the professionals who ensure those stories transcend language and culture #WorldDayForAudiovisualHeritage #AudiovisualTranslation #Accessibility
0
1
2
@DrSabineBraun
Sabine Braun @drsabinebraun.bsky.social
1 month
💡 I'm happy to share our new article on how language & communication barriers impact mental healthcare for migrants across Europe — based on survey responses from over 600 health & social care professionals in 9 countries https://t.co/Tlj0dPKwKW #1nt #health #mentalhealth
0
3
7
@zouharvi
Vilém Zouhar #EMNLP
4 months
Organizers are happy to help with any questions. 🙂 Website with all details and contacts:
0
1
1
@zouharvi
Vilém Zouhar #EMNLP
4 months
📐Task 3: Quality-informed segment-level error correction Automatically post-edit machine-translated text using quality annotations to generate minimal and accurate corrections. Description: https://t.co/844QeBTI9A Submission platform:
1
1
1
@zouharvi
Vilém Zouhar #EMNLP
4 months
📐Task 2: Span-level error detection Identify and locate translation errors within each segment (start/end indices) and classify their severity. Description: https://t.co/baKvWUuPGq Submission platform:
1
1
1
@zouharvi
Vilém Zouhar #EMNLP
4 months
📐Task 1: Segment-level quality score prediction Predict a quality score for each source–target segment pair, using document-level context and either ESA or MQM annotations. Description: https://t.co/M9oEULegNk Submission platform:
1
1
1
@zouharvi
Vilém Zouhar #EMNLP
4 months
The 2025 MT Evaluation shared task brings together the strengths of the previous Metrics and Quality Estimation tasks under a single, unified evaluation framework. The following tasks are now open (deadline July 31st but participation has never been easier 🙂)
1
6
12
@diptesh
Diptesh Kanojia
4 months
Good luck to all participants! We are incredibly excited to see the innovative solutions you've developed. For full details, baselines, and data formats, visit the official task page. See you in Suzhou! #WMT25 #SharedTask #ComputationalLinguistics #nlproc
0
0
0
@diptesh
Diptesh Kanojia
4 months
🔗 Get the Data & Submit: 📥 Download the Test Set: https://t.co/wmoZPRL8JB (Link is in the "TEST DATA" section) 🏆 Submit on Codabench:
0
0
0
@diptesh
Diptesh Kanojia
4 months
📊 Evaluation: Systems will be ranked on two key metrics: 1️⃣ DeltaCOMET: Primary metric measuring the raw quality improvement over the original MT. 2️⃣ Gain-to-Edit Ratio: DeltaCOMET divided by TER, rewarding systems that are not just effective, but also efficient. #MTeval
1
0
0
@diptesh
Diptesh Kanojia
4 months
🌍 Language Pairs: We're running the task for 6 diverse language pairs, all translating from English: 🇬🇧 EN → 🇨🇳 Chinese (ZH) 🇬🇧 EN → 🇨🇿 Czech (CS) 🇬🇧 EN → 🇮🇸 Icelandic (IS) 🇬🇧 EN → 🇯🇵 Japanese (JA) 🇬🇧 EN → 🇷🇺 Russian (RU) 🇬🇧 EN → 🇺🇦 Ukrainian (UK)
1
0
0
@diptesh
Diptesh Kanojia
4 months
🎯 The Goal: Given a source text, a machine translation, and quality estimation annotations (scores & error spans), the task is to generate a corrected translation. The challenge? Maximum quality improvement while making the fewest possible edits #nlproc #QualityEstimation #APE
1
0
0
@diptesh
Diptesh Kanojia
4 months
Webpage: https://t.co/wmoZPRL8JB Codabench:
1
0
0
@diptesh
Diptesh Kanojia
4 months
📢 Test Set RELEASED! 🚀 The test set for the #WMT25 Shared Task on QE-informed Segment-level Error Correction is now LIVE! It's time to put your MT error correction / APE methods to the test. Let's see how well they can correct machine translation! #NLProc #MT #WMT2025
1
5
9
@prajdabre
Raj Dabre
7 months
Machine Translation is my first and final love. Every single work I do has some flavor of Machine Translation to it. Machine Translation is the best test bed for any sequence to sequence neural architecture. So it's best you read the book on NMT by the OG MT teacher Prof Philipp
2
11
200
@ai4bharat
AI4Bharat
9 months
📢 Presenting IndicSeamless: A Speech Translation Model for Indian Languages 🎙️🌍 IndicSeamless is a speech translation model fine-tuned from SeamlessM4Tv2-large on 13 Indian languages. Trained on a curated subset of BhasaAnuvaad, the largest open-source Speech Translation
Tweet card summary image
huggingface.co
9
34
192
@cfiltnlp
CFILT Lab
9 months
Prof. Pushpak Bhattacharyya, in conversation with @EconomicTimes, advocates for trinity models—smaller, cost-effective AI models tailored to India’s diverse languages, domains, and tasks. Link: https://t.co/9TxsCiI9nH #CFILT #NLP #AI #LLM
Tweet card summary image
economictimes.indiatimes.com
People from around the world hailed DeepSeek for demonstrating that a foundational model can leverage innovative techniques without having to shell out big bucks, and is pegged as an example of how...
0
3
7