Ryota Tanaka @rtanaka_lab X Profile

Ryota Tanaka

@rtanaka_lab

Followers

925

Following

3K

Media

39

Statuses

496

NLP, Vision&Language @ NTT Human Informatics Laboratories

https://t.co/RCJhNYIfar

Joined May 2018

Don't wanna be here? Send us removal request.

Ryota Tanaka

@rtanaka_lab

8 months

Our #CVPR2025 work is out!🚀 𝘾𝙖𝙣 𝙬𝙚 𝙗𝙪𝙞𝙡𝙙 𝙍𝘼𝙂 𝙩𝙝𝙖𝙩 𝙪𝙣𝙙𝙚𝙧𝙨𝙩𝙖𝙣𝙙𝙨 𝙫𝙞𝙨𝙪𝙖𝙡𝙡𝙮-𝙧𝙞𝙘𝙝 𝙙𝙤𝙘𝙪𝙢𝙚𝙣𝙩𝙨 𝙡𝙞𝙠𝙚 𝙘𝙝𝙖𝙧𝙩𝙨/𝙩𝙖𝙗𝙡𝙚𝙨? Yes! VDocRAG understands them through visual features. 📰 https://t.co/5y0rHXg7E5 🌐 https://t.co/ObCVMbBzx7

2

16

43

Kyosuke Nishida

@kyoun

2 months

そして、12/2からのNeurIPSにてNTTがスポンサーになりブースを出します。tsuzumi 2を含め様々なNTT研究所の技術が紹介されますので、現地ご参加の方、ぜひお立ち寄りください！ https://t.co/LQS7Pe9Y0o 私も現地参加予定です！どうぞ宜しくお願いします〜

0

4

15

Kyosuke Nishida

@kyoun

2 months

本日、記者会見があり、NTTが研究開発しております「tsuzumi 2」が提供開始になりました🚀 ニュースリリース👉 https://t.co/QNqoLSDyai tsuzumi 2はパラメータ数28.6B・10Tトークン学習の、日本語の理解・生成・指示遂行に強みを持つモデルです。 2025年11月19日から開催される NTT R&D フォーラム

NTT広報室

@NTTPR

2 months

／更なる進化を遂げた #tsuzumi 2 の提供開始📢✨ ＼軽量でありながら高性能な日本語処理性能を持つ LLM「tsuzumi 2」の提供を本日開始しました💫 サイバーセキュリティ分野への応用、自律的に連携し議論する AI コンステレーション等の開発も進めます！ #NTTRD

4

153

617

Daiki Chijiwa

@dchiji_en

2 months

📜Lossless Vocabulary Reduction for LLMs🤖 In this paper, we established a theoretical framework that can flexibly shrink the vocabulary of a given LLM to an arbitrary sub-vocabulary, efficiently in inference-time. 🔗 https://t.co/bhrgGTppls See the video for a quick overview👇

0

10

17

NTT広報室

@NTTPR

4 months

8/17～21ににオランダのロッテルダムで開催される、音声言語処理における世界最大の国際学会 #Interspeech2025 に、NTTから18本の論文が採択されました🎉 #NTTRD #Celebration ▼詳細はこちら https://t.co/F62RAB8d1G

group.ntt

2025年8月17日～21日にオランダのロッテルダムで開催される国際会議Interspeech2025（the 26th edition of the Inte...

0

12

35

NTT広報室

@NTTPR

5 months

7/13～19までバンクーバーで開催される国際会議 #ICML2025 において、NTT研究所より提出された9件の論文が採択されました🏅 ICMLは機械学習分野の基礎理論やアルゴリズムに関する世界最高峰とされる国際会議として、近年の人工知能の発展に大きく寄与しています #NTTRD https://t.co/GP6cBYkq7j

group.ntt

2025年7月13日から19日まで（太平洋夏時間）カナダバンクーバーで開催される国際会議ICML（International Conference on Ma...

0

31

112

Shin'ya Yamaguchi

@syamaguchi_en

6 months

This is also an awesome work by Ryota Tanaka @rtanaka_lab , enabling visually document processing by RAG with related textual images! Come NOW to #363 at #CVPR2025 poster session!

0

1

7

Ryota Tanaka

@rtanaka_lab

6 months

🎉🎉🎉

NTT広報室

@NTTPR

6 months

6/11～15までアメリカナシュビルで開催されるコンピュータビジョン分野の最高峰国際会議 #CVPR2025 において、NTT研究所より提出された5件の論文が採択されました🎉 #NTTRD #Celebration ▼詳細はこちら https://t.co/boI2fjkCcr

0

21

ヤギユキ

@yagiyuki06

7 months

マルチモーダルLLMのRAG手法：VDocRAGの詳細解説｜tossyy https://t.co/liu1OqmX7S #zenn

zenn.dev

0

1

7

Taku Hasegawa

@th_freiburg

8 months

🎉 Excited to announce our ICML 2025 paper “Portable Reward Tuning: Towards Reusable Fine‑Tuning across Different Pretrained Models,” co‑first‑authored with @dchiji_en 🤝(equal contribution)! #ICML2025 Preprint 👉 https://t.co/neYxa06i23

1

6

23

Rohan Paul

@rohanpaul_ai

8 months

Standard RAG struggles with visually-rich documents, losing information by converting everything to text. This paper introduces VDocRAG, processing documents directly as images using Large Vision-Language Models (LVLMs) to preserve visual context for accurate retrieval and

0

4

24

Ryota Tanaka

@rtanaka_lab

8 months

#CVPR2025 に採択された図表が含まれる文書を読み解くVDocRAGに関する研究を公開しました！

Ryota Tanaka

@rtanaka_lab

8 months

Our #CVPR2025 work is out!🚀 𝘾𝙖𝙣 𝙬𝙚 𝙗𝙪𝙞𝙡𝙙 𝙍𝘼𝙂 𝙩𝙝𝙖𝙩 𝙪𝙣𝙙𝙚𝙧𝙨𝙩𝙖𝙣𝙙𝙨 𝙫𝙞𝙨𝙪𝙖𝙡𝙡𝙮-𝙧𝙞𝙘𝙝 𝙙𝙤𝙘𝙪𝙢𝙚𝙣𝙩𝙨 𝙡𝙞𝙠𝙚 𝙘𝙝𝙖𝙧𝙩𝙨/𝙩𝙖𝙗𝙡𝙚𝙨? Yes! VDocRAG understands them through visual features. 📰 https://t.co/5y0rHXg7E5 🌐 https://t.co/ObCVMbBzx7

0

16

102

Sumit

@_reachsumit

8 months

VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents @rtanaka_lab et al. introduce a RAG framework that directly understands diverse document formats through visual features. 📝 https://t.co/k29XOi9Bee 👨🏽‍💻 https://t.co/jsZFphLVQy

0

5

6

Ryota Tanaka

@rtanaka_lab

8 months

💪Key enhancements of VDocRAG (2/2) 🔥𝐍𝐞𝐰 𝐃𝐚𝐭𝐚𝐬𝐞𝐭: OpenDocVQA is the first unified collection of open-domain DocumentVQA datasets encompassing a wide range of document types and formats.

0

Ryota Tanaka

@rtanaka_lab

8 months

💪Key enhancements of VDocRAG (1/2) 🔥𝐍𝐞𝐰 𝐏𝐫𝐞𝐭𝐫𝐚𝐢𝐧𝐢𝐧𝐠 𝐓𝐚𝐬𝐤𝐬: RCR and RCG compress the entire image representation into a dense token representation, by aligning the text in documents via retrieval and generation tasks.

1

0

3

Ryota Tanaka

@rtanaka_lab

9 months

本日、NTT人間情報研究所　准特別研究員を拝命しました。NTTのマルチモーダル研究を更に加速していきます！また、3/25に東北大学にて、博士号(情報科学)と総長賞を頂きました。関係者の皆さん、ありがとうございました。引き続きよろしくお願いします！

0

12

160

Kyosuke Nishida

@kyoun

9 months

#NLP2025 にて4件受賞しました！年次大会優秀賞は8年連続9件目になりました！併せて、今年度は共著でたくさんのトップ会議採択がありました。主著の皆さんの頑張りに感謝します！

2

9

75

Ryota Tanaka

@rtanaka_lab

9 months

受賞しました！🎉 ありがとうございます！

Tohoku NLP Group

@tohoku_nlp

9 months

言語処理学会第31回年次大会 #NLP2025 において、優秀賞1件・若手奨励賞3件・スポンサー賞2件・委員特別賞4件を受賞しました。 https://t.co/eULAWMyKG1

2

9

80

Masatoshi Suzuki

@fivehints

9 months

#AI王の論文（共著）が、今年度の言語処理学会最優秀論文賞を受賞しました！ 🙌 論文を選考くださった方々、「AI王」に関わってくださったすべての皆さまに、心より感謝いたします。 @tohoku_nlp @AioJaqket

NLP2026 UTSUNOMIYA

@anlpmeeting

9 months

2024年度の言語処理学会最優秀論文賞🎉 クイズコンペティションの結果分析から見た日本語質問応答の到達点と課題 ○有山知希，鈴木潤，鈴木正敏，田中涼太，赤間怜奈，西田京介 Vol.31 No.1, pp.47-78 https://t.co/9repdZJUeT おめでとうございます！

0

13

61

Daiki Shiono

@onely7_deep

9 months

#NLP2025 では、主著１本、共著２本の発表があります。主著は、LLMのファインチューニング段階におけるPadding戦略とPacking戦略の下流タスクに対する影響を調査した話です。現地参加の方は、・03/11 14:50-16:20 1F Q4(ポスター)会場にぜひお越しください！お待ちしてます！ @tohoku_nlp

0

9

31