Diptesh Kanojia
@diptesh
Followers
782
Following
4K
Media
30
Statuses
976
Senior Lecturer in NLP for AI, Institute for @PeopleCentedAI | University of Surrey | #nlproc
Guildford, United Kingdom
Joined June 2008
| 27 October: World Day for Audiovisual Heritage 2025 🎬 | Today we celebrate the sounds and images that tell humanity’s story and the professionals who ensure those stories transcend language and culture #WorldDayForAudiovisualHeritage #AudiovisualTranslation #Accessibility
0
1
2
💡 I'm happy to share our new article on how language & communication barriers impact mental healthcare for migrants across Europe — based on survey responses from over 600 health & social care professionals in 9 countries https://t.co/Tlj0dPKwKW
#1nt #health #mentalhealth
0
3
7
Organizers are happy to help with any questions. 🙂 Website with all details and contacts:
0
1
1
📐Task 3: Quality-informed segment-level error correction Automatically post-edit machine-translated text using quality annotations to generate minimal and accurate corrections. Description: https://t.co/844QeBTI9A Submission platform:
1
1
1
📐Task 2: Span-level error detection Identify and locate translation errors within each segment (start/end indices) and classify their severity. Description: https://t.co/baKvWUuPGq Submission platform:
1
1
1
📐Task 1: Segment-level quality score prediction Predict a quality score for each source–target segment pair, using document-level context and either ESA or MQM annotations. Description: https://t.co/M9oEULegNk Submission platform:
1
1
1
The 2025 MT Evaluation shared task brings together the strengths of the previous Metrics and Quality Estimation tasks under a single, unified evaluation framework. The following tasks are now open (deadline July 31st but participation has never been easier 🙂)
1
6
12
Good luck to all participants! We are incredibly excited to see the innovative solutions you've developed. For full details, baselines, and data formats, visit the official task page. See you in Suzhou! #WMT25 #SharedTask #ComputationalLinguistics #nlproc
0
0
0
🔗 Get the Data & Submit: 📥 Download the Test Set: https://t.co/wmoZPRL8JB (Link is in the "TEST DATA" section) 🏆 Submit on Codabench:
0
0
0
📊 Evaluation: Systems will be ranked on two key metrics: 1️⃣ DeltaCOMET: Primary metric measuring the raw quality improvement over the original MT. 2️⃣ Gain-to-Edit Ratio: DeltaCOMET divided by TER, rewarding systems that are not just effective, but also efficient. #MTeval
1
0
0
🌍 Language Pairs: We're running the task for 6 diverse language pairs, all translating from English: 🇬🇧 EN → 🇨🇳 Chinese (ZH) 🇬🇧 EN → 🇨🇿 Czech (CS) 🇬🇧 EN → 🇮🇸 Icelandic (IS) 🇬🇧 EN → 🇯🇵 Japanese (JA) 🇬🇧 EN → 🇷🇺 Russian (RU) 🇬🇧 EN → 🇺🇦 Ukrainian (UK)
1
0
0
🎯 The Goal: Given a source text, a machine translation, and quality estimation annotations (scores & error spans), the task is to generate a corrected translation. The challenge? Maximum quality improvement while making the fewest possible edits #nlproc #QualityEstimation #APE
1
0
0
Machine Translation is my first and final love. Every single work I do has some flavor of Machine Translation to it. Machine Translation is the best test bed for any sequence to sequence neural architecture. So it's best you read the book on NMT by the OG MT teacher Prof Philipp
2
11
200
📢 Presenting IndicSeamless: A Speech Translation Model for Indian Languages 🎙️🌍 IndicSeamless is a speech translation model fine-tuned from SeamlessM4Tv2-large on 13 Indian languages. Trained on a curated subset of BhasaAnuvaad, the largest open-source Speech Translation
huggingface.co
9
34
192
Prof. Pushpak Bhattacharyya, in conversation with @EconomicTimes, advocates for trinity models—smaller, cost-effective AI models tailored to India’s diverse languages, domains, and tasks. Link: https://t.co/9TxsCiI9nH
#CFILT #NLP #AI #LLM
economictimes.indiatimes.com
People from around the world hailed DeepSeek for demonstrating that a foundational model can leverage innovative techniques without having to shell out big bucks, and is pegged as an example of how...
0
3
7