Shinichi Takaŷanagi @_stakaya X Profile

Shinichi Takaŷanagi

@_stakaya

Followers

6K

Following

3K

Media

296

Statuses

4K

Principal @BCG / @BCGX_ . 博士(統計科学). 株式会社ホクソエムの妖精・情報処理学会ビッグデータ研究グループ幹事を兼任. 著訳/監修書: 評価指標入門, データ分析失敗事例集, 効果検証入門等

https://t.co/c0hQ55866F

Tokyo, Japan

Joined February 2016

Don't wanna be here? Send us removal request.

Shinichi Takaŷanagi

@_stakaya

27 days

Humanity’s Last Examについて書きました。第10回　AIの知力を測る究極の難問「人類最後の試験」とは？

bcg-jp.com

AIの進化に伴い、その性能を評価するベンチマークの難化が進んでいます。世界中の研究者たちが結集して作った難問「人類最後の試験」について、BCG XのAIエキスパート高柳慎一が紹介します。

0

4

28

Shinichi Takaŷanagi

@_stakaya

2 days

AI時代のソフトウェア開発生産性指標まとめ。リードタイム/保守性等の古典的なものから、"AIならでは"のコスト/節約時間/利用率まで各社様々(図引用)。特にMSが”開発者の徒労感が高い日(Bad Developer Day)”も見てるのが面白い How tech companies measure the impact of AI https://t.co/3EVSJHiou8

0

1

14

Shinichi Takaŷanagi

@_stakaya

2 days

Officialはこれかな https://t.co/pmQBoZcIRG

anthropic.com

Anthropic research on data-poisoning attacks in large language models

0

2

Shinichi Takaŷanagi

@_stakaya

2 days

AIのトレーニングデータを汚染して意図しない動作を引き起こさせるデータポイズニング攻撃はモデルのサイズやデータ量と無関係に250件ほどの悪意ある文書があれば実行可能 - GIGAZINE https://t.co/er0aHOn3iu

gigazine.net

イギリスのAIセキュリティ研究所とアラン・チューリング研究所がAI企業のAnthropicと共同で行った研究により、わずか250件の悪意ある文書があれば、データポイズニングにより、モデルのサイズやトレーニングデータ量とは関係なく、大規模言語モデルに対してバックドアの脆弱(ぜいじゃく)性を生成できる可能性が明らかになりました。

1

5

AshutoshShrivastava

@ai_for_success

3 days

🚨 Remember Andrew Tulloch, who was rumored to have rejected a $1.5B offer from Meta two months ago? According to WSJ latest news, he’s now joining Meta. Imagine how much they must have paid him this time?

57

26

444

Pioneer Institute

@PioneerBoston

4 days

How do Academically Intensive Charter Schools create learning environments where students thrive? Hear insights from Robert Maranto and Sean Woytek on the mission-driven approach that sets them apart. Listen below!

2

3

12

Shinichi Takaŷanagi

@_stakaya

3 days

昨年に続き（今年で8回目らしい）AIR STREETCAPITALから「AIレポート2025」。研究（推論元年）、産業（電力供給がボトルネック）、政治/政策、安全性（予算低、犯罪組織がAI活用）、利用調査分析、今後の予測の記載あり。 STATE OF AI REPORT 2025 https://t.co/BXYMD3CGS3

stateof.ai

The State of AI Report analyses the most interesting developments in AI. Read and download here.

0

2

Shinichi Takaŷanagi

@_stakaya

3 days

長年の経験から学んだ機械学習に関する一言アドバイス。データ重視・SMOTEするな・ログ取れ・小規模テストしろ・性能向上の逓減を見極めろ等、示唆に富む https://t.co/aOctAZY9cZ

reddit.com

Explore this post and more from the MachineLearning community

0

14

168

Shinichi Takaŷanagi

@_stakaya

4 days

LLMをちゃんと理解している中の人による、想像以上にマニアックな書きぶりで良かった。。。 ※”ちゃんと理解している”は未定義日本の法令に関する多肢選択式QAデータセット公開の背景｜デジタル庁 @digital_jpn

digital-gov.note.jp

デジタル庁は、今般、「日本の法令に関する多肢選択式QAデータセット」を試験的に公開しました。AIを業務で安心して使うためには、そのAIがどれくらい賢く、正しい答えを出せるのかを事前に確認することが不可欠です。このデータセットは、特に企業の法務部門で実際に起こりうる課題を想定して作成しており、AIが法務部門の実務でどこまで通用するのかを測ることを目標としています。このデータセットには問題文と...

0

37

256

Shinichi Takaŷanagi

@_stakaya

4 days

第54回Elasticsearch勉強会 2023.6.28 #elasticsearchjp ESRE とか ELSER とか RRF ってナニ!? もろもろ理解してスッキリしよう

speakerdeck.com

第54回Elasticsearch勉強会 2023.6.28 #elasticsearchjp で発表した資料です。

0

4

OpenAI Developers

@OpenAIDevs

8 days

You can start building and testing apps in ChatGPT with the Apps SDK preview, which we're releasing today as an open standard built on MCP. Later this year, we’ll begin accepting app submissions for publication. https://t.co/pj4gUgso22

developers.openai.com

Learn how to use Apps SDK by OpenAI. Our framework to build apps for ChatGPT.

OpenAI

@OpenAI

8 days

You can now chat with apps in ChatGPT.

82

317

3K

中條麟太郎 | Rintaro Chujo

@rintaro_chujo

9 days

論文探索の新しい方法です。 1つの論文を起点に、次に読むべき論文をAIの1行要約付きで無限に発見できるツール「PaperDive」ができました！サーベイが爆速になります。研究者・大学院生に届け！！ https://t.co/dVxFk3gbGq

8

1K

7K

Shinichi Takaŷanagi

@_stakaya

9 days

Andrew NgのStanford大学での MLコース。直近のエンジニア不況の話しとか、パンチカードコーディングから現在開発環境まで見ると結局エンジニアは減るではなく増えるのような話もあり面白かった Stanford CS230 | Autumn 2025 | Lecture 1: Introduction to Deep Learning

0

1

14

Shinichi Takaŷanagi

@_stakaya

11 days

a16zよりMercuryの20万社以上の"スタートアップ利用中AIプロダクト"分析レポート。ChatGPT等のHorizontalに留まらず、カスタマーサービス・営業・採用などのVertical領域も成長。ToC→ToBのスケール化がより短期間で可能に等、示唆に富む The AI Application Spending Report https://t.co/oAaU7uCd55

0

1

5

Shinichi Takaŷanagi

@_stakaya

11 days

AnthropicよりまたもAIエージェント開発話。有限Attensionに何を入れるか？が重要で、それを成すためにコンパクション(文脈要約)、構造化メモ(外部保存化)、エージェント構造(計画・統合とタスク実施分離)で長時間タスク可能と Effective context engineering for AI agents https://t.co/w2qrex87YI

anthropic.com

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

0

3

21

Danijar Hafner

@danijarh

14 days

Excited to introduce Dreamer 4, an agent that learns to solve complex control tasks entirely inside of its scalable world model! 🌎🤖 Dreamer 4 pushes the frontier of world model accuracy, speed, and learning complex tasks from offline datasets. co-led with @wilson1yan

79

355

3K

Shinichi Takaŷanagi

@_stakaya

16 days

Evidently AIより650件以上の生成AI活用事例分析。検索や推薦といった「古典的」な機械学習の応用領域が依然として中核、Ops領域が特に強く効率化やコスト削減効果が高くニーズあり、等、示唆に富む Gen AI use cases in 2025: learnings from 650+ real-world examples https://t.co/ZwgIWxe8jb

evidentlyai.com

Since 2023, we've been curating a database of real-world AI and ML use cases. Here is what we've learned from 650+ examples from top companies.

0

6

Shinichi Takaŷanagi

@_stakaya

18 days

Google CloudのDORAレポート（115カ国・3.6万人の専門家対象）が公表された。AI導入の、生産性向上、バーンアウト軽減、開発者体験改善への寄与が明らかに。更に組織的な取り組みと組合せることで、効果的と強調 How are developers using AI? Inside our 2025 DORA report https://t.co/04n0lYdNkD

blog.google

Google Cloud’s 2025 DORA report released new findings about how AI is changing software development.

0

7

Shinichi Takaŷanagi

@_stakaya

21 days

LangChainより「ClaudeCode使うときはとりあえず https://t.co/d3Ha3YWVBr セットアップしておけ」を言わんとしている記事。 How to turn Claude Code into a domain specific coding agent https://t.co/T8fEjT3VXl

docs.claude.com

Learn about Claude Code, Anthropic's agentic coding tool that lives in your terminal and helps you turn ideas into code faster than ever before.

0

19

Qwen

@Alibaba_Qwen

22 days

🚀 Introducing Qwen3-Omni — the first natively end-to-end omni-modal AI unifying text, image, audio & video in one model — no modality trade-offs! 🏆 SOTA on 22/36 audio & AV benchmarks 🌍 119L text / 19L speech in / 10L speech out ⚡ 211ms latency | 🎧 30-min audio

118

802

5K

Shinichi Takaŷanagi

@_stakaya

23 days

Google DeepMindより「仮想エージェント経済」の台頭可能性とその運営に関する論文。システムリスクや不平等拡大の課題もあるため、公平な資源配分のためのオークション設計やSDGs協調型「ミッション経済」、信頼基盤とハイブリッド監督体制を提案。 Virtual Agent Economies https://t.co/wHW66YtNGf

arxiv.org

The rapid adoption of autonomous AI agents is giving rise to a new economic layer where agents transact and coordinate at scales and speeds beyond direct human oversight. We propose the "sandbox...

0

1

17

Shinichi Takaŷanagi

@_stakaya

23 days

KDD25よりRAGを越えたRetrieval And Structuring（RAS）の提唱。RASとして検索(Sparse/Dense/Hybrid)と知識構造(グラフ/構造化)を統合し、LLMの推論(KG推論、GoT等）を強化。 A Survey on Retrieval And Structuring Augmented Generation with Large Language Models https://t.co/zhrAidCfQO

arxiv.org

Large Language Models (LLMs) have revolutionized natural language processing with their remarkable capabilities in text generation and reasoning. However, these models face critical challenges...

0

27

161