took @wataru9871 X Profile

took

@wataru9871

Followers

1K

Following

19K

Media

254

Statuses

6K

takeの過去形長岡高専➡︎東大シス創➡︎東大院情報理工

麺屋　松

Joined November 2012

Don't wanna be here? Send us removal request.

took

@wataru9871

3 months

ｴｯﾎｴｯﾎｴｯﾎｴｯﾎ .残響を保持した音声復元ができるって伝えなきゃ .ｴｯﾎｴｯﾎｴｯﾎｴｯﾎ .残響の制御もできるって伝えなきゃ. ｴｯﾎみんなに伝えなきゃ.paper: demo:

2

42

203

took

@wataru9871

4 days

安い家電を求めて秋葉原来たけど、外国人向けの免税商店街となっており、もはや国際線ターミナルと変わらん。。。.

0

11

took

@wataru9871

4 days

ABCI，もしかして縮退運転してる？？？.

0

1

took

@wataru9871

11 days

Here are some interesting result with sidon.

sarulab-speech-sidon-demo-beta.hf.space

Click to try out the app!

0

3

took

@wataru9871

11 days

🚀 We just released Sidon — a multilingual speech restoration model built on the Miipher & Miipher-2 resynthesis framework!.Trained on 103 languages and robust to real-world artifacts like wind noise & packet loss 🌍.🔧 Try Sidon with your speech samples!.

huggingface.co

1

21

53

took

@wataru9871

14 days

RT @hyama5_: 来月のSpeech Synthesis Workshop 2025 (SSW13)で発表します！.韻律ラベルつきTTSのために、HuBERT、Whisperの音響モデルとPnG BERTなどの言語モデルを使うと、音声のアクセントや境界強度の推定精度が上….

0

11

0

took

@wataru9871

15 days

paper is available on arxiv.

arxiv.org

We introduce our submission to the AudioMOS Challenge (AMC) 2025 Track 3: mean opinion score (MOS) prediction for speech with multiple sampling frequencies (SFs). Our submitted model integrates an...

0

6

4

took

@wataru9871

15 days

🚀 We just released MSR-UTMOS — a powerful model for speech quality prediction that supports 16kHz, 24kHz, and 48kHz audio!.🔍 Powered by a sampling frequency-independent convolutional layer on top of SSL models. 🎧 Upload your own samples and try it now:　

huggingface.co

1

26

48

took

@wataru9871

20 days

WASPAAで発表します！.

arXiv Sound

@ArxivSound

20 days

Wataru Nakata, Yuma Koizumi, Shigeki Karita, Robin Scheibler, Haruko Ishikawa, Adriana Guevara-Rukoz, Heiga Zen, Michiel Bacchiani, "ReverbMiipher: Generative Speech Restoration meets Reverberation Characteristics Controllability,"

0

3

26

took

@wataru9871

20 days

RT @ArxivSound: Wataru Nakata, Yuma Koizumi, Shigeki Karita, Robin Scheibler, Haruko Ishikawa, Adriana Guevara-Rukoz, Heiga Zen, Michiel Ba….

arxiv.org

Reverberation encodes spatial information regarding the acoustic source environment, yet traditional Speech Restoration (SR) usually completely removes reverberation. We propose ReverbMiipher, an...

0

3

0

took

@wataru9871

22 days

espnet,依存おおすぎるんだよな．espnetで完結すればいいけど他のライブラリと合わせると大体コンフリクト起きる.

0

18

took

@wataru9871

23 days

RT @ArxivSound: Kentaro Seki, Shinnosuke Takamichi, Takaaki Saeki, Hiroshi Saruwatari, "Active Learning for Text-to-Speech Synthesis with I….

arxiv.org

The construction of high-quality datasets is a cornerstone of modern text-to-speech (TTS) systems. However, the increasing scale of available data poses significant challenges, including storage...

0

4

0

took

@wataru9871

25 days

ablation studyほど和訳が難しい単語あるか？.

0

5

took

@wataru9871

1 month

RT @yuma_koizumi: All three papers from our project have been accepted to WASPAA⛰️!!. Miipher-2.ReverbMiipher.https….

0

14

0

took

@wataru9871

1 month

RT @acai_berry0805: 春の音響学会の発表が学生優秀発表賞を受賞しました🎉.ありがとうございます。.

0

5

0

took

@wataru9871

1 month

RT @ysaito_human: M2 淺井さんの発表「話者オーバーラップ音声からの特徴抽出に向けた自己教師あり学習モデルの検討」が音響学会 2025年春季研究発表会で学生優秀発表賞を受賞しました．おめでとうございます！👏

0

3

0

took

@wataru9871

1 month

音声LLM，テキストの指標をそのまま使うのではなくて，音声特有のなにかを評価してほしいという気持ちが強い．.

0

2

18

took

@wataru9871

2 months

RT @trgkpc: Our paper is now available on arXiv!.We propose TTSOps, a closed-loop framework for building multi-speaker TTS from noisy web d….

0

15

0

took

@wataru9871

2 months

RT @hsaruwatari727: Our paper titled "Language-Queried Target Speech Extraction Using Para-linguistic and Non-linguistic Prompts" has been….

0

11

0

took

@wataru9871

2 months

RT @trgkpc: LASS（言語クエリ音源分離）に基づくTSE（目標音声抽出）の論文がacceptされました！.こちらの内容は秋ASJにて発表させていただきますので、ぜひご議論いただけますと幸いです。.

0

6

0

took

@wataru9871

2 months

これ僕がやってたやつじゃん忘れてた.

0