Diptesh Kanojia @diptesh X Profile

Diptesh Kanojia

@diptesh

Followers

782

Following

4K

Media

30

Statuses

976

Senior Lecturer in NLP for AI, Institute for @PeopleCentedAI | University of Surrey | #nlproc

https://t.co/GdirqkMQqu

Guildford, United Kingdom

Joined June 2008

Don't wanna be here? Send us removal request.

Diptesh Kanojia

@diptesh

4 months

Deadline extended to 6th August AoE! :)

Diptesh Kanojia

@diptesh

4 months

📢 Test Set RELEASED! 🚀 The test set for the #WMT25 Shared Task on QE-informed Segment-level Error Correction is now LIVE! It's time to put your MT error correction / APE methods to the test. Let's see how well they can correct machine translation! #NLProc #MT #WMT2025

0

1

CTS Surrey

@CTS_Surrey

26 days

| 27 October: World Day for Audiovisual Heritage 2025 🎬 | Today we celebrate the sounds and images that tell humanity’s story and the professionals who ensure those stories transcend language and culture #WorldDayForAudiovisualHeritage #AudiovisualTranslation #Accessibility

0

1

2

Sabine Braun @drsabinebraun.bsky.social

@DrSabineBraun

1 month

💡 I'm happy to share our new article on how language & communication barriers impact mental healthcare for migrants across Europe — based on survey responses from over 600 health & social care professionals in 9 countries https://t.co/Tlj0dPKwKW #1nt #health #mentalhealth

0

3

7

Vilém Zouhar #EMNLP

@zouharvi

4 months

Organizers are happy to help with any questions. 🙂 Website with all details and contacts:

0

1

Vilém Zouhar #EMNLP

@zouharvi

4 months

📐Task 3: Quality-informed segment-level error correction Automatically post-edit machine-translated text using quality annotations to generate minimal and accurate corrections. Description: https://t.co/844QeBTI9A Submission platform:

1

Vilém Zouhar #EMNLP

@zouharvi

4 months

📐Task 2: Span-level error detection Identify and locate translation errors within each segment (start/end indices) and classify their severity. Description: https://t.co/baKvWUuPGq Submission platform:

1

Vilém Zouhar #EMNLP

@zouharvi

4 months

📐Task 1: Segment-level quality score prediction Predict a quality score for each source–target segment pair, using document-level context and either ESA or MQM annotations. Description: https://t.co/M9oEULegNk Submission platform:

1

Vilém Zouhar #EMNLP

@zouharvi

4 months

The 2025 MT Evaluation shared task brings together the strengths of the previous Metrics and Quality Estimation tasks under a single, unified evaluation framework. The following tasks are now open (deadline July 31st but participation has never been easier 🙂)

1

6

12

Diptesh Kanojia

@diptesh

4 months

Good luck to all participants! We are incredibly excited to see the innovative solutions you've developed. For full details, baselines, and data formats, visit the official task page. See you in Suzhou! #WMT25 #SharedTask #ComputationalLinguistics #nlproc

0

Diptesh Kanojia

@diptesh

4 months

🔗 Get the Data & Submit: 📥 Download the Test Set: https://t.co/wmoZPRL8JB (Link is in the "TEST DATA" section) 🏆 Submit on Codabench:

0

Diptesh Kanojia

@diptesh

4 months

📊 Evaluation: Systems will be ranked on two key metrics: 1️⃣ DeltaCOMET: Primary metric measuring the raw quality improvement over the original MT. 2️⃣ Gain-to-Edit Ratio: DeltaCOMET divided by TER, rewarding systems that are not just effective, but also efficient. #MTeval

1

0

Diptesh Kanojia

@diptesh

4 months

🌍 Language Pairs: We're running the task for 6 diverse language pairs, all translating from English: 🇬🇧 EN → 🇨🇳 Chinese (ZH) 🇬🇧 EN → 🇨🇿 Czech (CS) 🇬🇧 EN → 🇮🇸 Icelandic (IS) 🇬🇧 EN → 🇯🇵 Japanese (JA) 🇬🇧 EN → 🇷🇺 Russian (RU) 🇬🇧 EN → 🇺🇦 Ukrainian (UK)

1

0

Diptesh Kanojia

@diptesh

4 months

🎯 The Goal: Given a source text, a machine translation, and quality estimation annotations (scores & error spans), the task is to generate a corrected translation. The challenge? Maximum quality improvement while making the fewest possible edits #nlproc #QualityEstimation #APE

1

0

Diptesh Kanojia

@diptesh

4 months

Webpage: https://t.co/wmoZPRL8JB Codabench:

1

0

Diptesh Kanojia

@diptesh

4 months

📢 Test Set RELEASED! 🚀 The test set for the #WMT25 Shared Task on QE-informed Segment-level Error Correction is now LIVE! It's time to put your MT error correction / APE methods to the test. Let's see how well they can correct machine translation! #NLProc #MT #WMT2025

1

5

9

elvis

@omarsar0

7 months

https://t.co/B9O8bAhk4u

jeremykun.com

I have a little secret: I don’t like the terminology, notation, and style of writing in statistics. I find it unnecessarily complicated. This shows up when trying to read about Markov Chain Monte...

1

26

233

Raj Dabre

@prajdabre

7 months

Machine Translation is my first and final love. Every single work I do has some flavor of Machine Translation to it. Machine Translation is the best test bed for any sequence to sequence neural architecture. So it's best you read the book on NMT by the OG MT teacher Prof Philipp

2

11

200

AI4Bharat

@ai4bharat

9 months

📢 Presenting IndicSeamless: A Speech Translation Model for Indian Languages 🎙️🌍 IndicSeamless is a speech translation model fine-tuned from SeamlessM4Tv2-large on 13 Indian languages. Trained on a curated subset of BhasaAnuvaad, the largest open-source Speech Translation

huggingface.co

9

34

192

CFILT Lab

@cfiltnlp

9 months

Prof. Pushpak Bhattacharyya, in conversation with @EconomicTimes, advocates for trinity models—smaller, cost-effective AI models tailored to India’s diverse languages, domains, and tasks. Link: https://t.co/9TxsCiI9nH #CFILT #NLP #AI #LLM

economictimes.indiatimes.com

People from around the world hailed DeepSeek for demonstrating that a foundational model can leverage innovative techniques without having to shell out big bucks, and is pegged as an example of how...

0

3

7