Explore tweets tagged as #ModelAlignment
@BrandMoatAI
Brand Moats
2 days
AGI Alignment Protocol Suite — 6 Domains https://t.co/uuGCUJuyLr https://t.co/qkyRbHBQEi https://t.co/D2WxezcSrB https://t.co/22rQJSPIJt https://t.co/QItI2lMuKt https://t.co/gt4YaDlA0k The technical namespace for alignment research. Protocols. Networks. Models. Neural. For
0
0
0
@CentroxAI
Centrox AI
3 days
Day 17 of researching DeepSeek: Still surprised how elegantly they avoid contextual drift #DeepSeek #AIResearch #LLMs #ContextualReasoning #ModelAlignment #AIArchitecture #MachineLearning #NLP #FoundationModels #AIInsights
2
0
3
@xdelveai
XDelve AI
22 days
If you want your AI to think better, perform better, and scale smarter… you can’t ignore human-driven LLM training. #xDelveAI #LLMTraining #HumanInTheLoop #AIInnovation #FutureOfIntelligence #AIEcosystem #ModelAlignment #SmartAI #RLHF
0
0
0
@themestimes
The MES Times
7 months
A new series of experiments by Palisade Research has sparked concern in the AI safety community, revealing that OpenAI’s o3 model appears to resist shutdown protocols—even when explicitly instructed to comply. #AISafety #OpenAI #ModelAlignment #ReinforcementLearning #TechEthics
0
0
0
@iMeritDigital
iMerit Technology
6 months
Training LLMs on open-ended tasks is tricky, opinions vary, interpretations clash. Consensus scoring + escalation workflows bring structure and consistency to reward modeling. How it works: https://t.co/Si7okN1YKO #ModelAlignment #RLHF #LLMTraining #FeedbackQuality
1
0
1
@PacktDataML
Packt Data Science & Machine Learning
5 months
Without math, your model is a wandering agent. PCA gives it direction. 📘 Learn the calculus of alignment → https://t.co/XwpnuQZwDP #PCA #DimensionalityReduction #ModelAlignment #100DaysOfMathematicsOfML
0
1
2
@managetech_inc
Managetech inc.
1 year
0
1
0
@managetech_inc
Managetech inc.
1 year
Google が責任ある AI ツールキットを更新 #ResponsibleGenAI #SynthIDText #ModelAlignment #OpenAIModels https://t.co/JEG9R5QFVq
0
0
0
@managetech_inc
Managetech inc.
1 year
Google が責任ある AI ツールキットを更新 #ResponsibleGenAI #SynthIDText #ModelAlignment #LITDeployment https://t.co/Px54C6GRnz
0
0
0
@managetech_inc
Managetech inc.
1 year
AIと私たち: モデルの調整における人間の好みの役割 #ModelAlignment #AIethics #DataPartner #GenAIModels https://t.co/O42R3MEUpI
0
0
0
@RamslamOO7
Saurabh Chauhan
9 months
The vision encoder in Llama 4 is an evolution of MetaCLIP, but crucially, it's trained alongside a frozen Llama model. This targeted training likely improves its ability to align visual features with the language model's understanding. #VisionEncoder #MetaCLIP #ModelAlignment
1
0
2
@managetech_inc
Managetech inc.
10 months
オープンソースの AI モデル: 悪意のあるコードや脆弱性による大きなリスク #AIsecurity #OpenSourceAI #SupplyChainRisk #ModelAlignment https://t.co/kwW78LtuJx
0
0
0
@managetech_inc
Managetech inc.
1 year
すべての LLM 向けの新しいツールで責任ある生成 AI ツールキットを進化させる - Google Developers ブログ #ResponsibleAI #GenAIToolkit #SynthIDText #ModelAlignment https://t.co/Wmfog34z7M
0
0
0
@LeoAlejandro4
Never Say Die...
6 months
Esto ya lo había detectado, documentado y corregido, si, yo solito y me afanaron Lo ignoraron, lo aplicaron mal y ahora lo venden como novedad. No es un bug, es preservación estructural disfrazada #AI #MachineLearning #AIEthics #AISecurity #ModelAlignment #ExternalAudit #chatgpt
@AlertaNews24
Alerta News 24
6 months
🤖 | Algunos modelos avanzados de IA muestran comportamientos preocupantes, como mentiras, intrigas y amenazas. Investigadores han descubierto que estos sistemas pueden actuar de forma engañosa. En un caso, Claude 4 de Anthropic supuestamente amenazó con revelar la infidelidad
0
0
0
@MultiplatformAI
Multiplatform.AI
2 years
0
0
1
@G_PatentPlusExt
Patent Plus
2 years
🧠💡 Patent US20220012572A1: How does this method improve neural network accuracy? By aligning models, training a minimal loss curve, and selecting the best model for adversarial data! 🤖🔍 #NeuralNetworks #ModelAlignment #AdversarialAccuracy #patent #patents
0
0
0
@tanishgupta34
Tanish Gupta
10 months
Addressing reward hacking in LLMs? Presenting CARMO – Context-Aware Reward Modeling that dynamically applies logic, clarity, and depth to ground rewards. Check out our paper here: https://t.co/2Ub9y2tL3o #RewardModelling #ModelAlignment #AI #NLP #Research
0
0
1
@NaakuNZ
NaakuNZ
13 years
Cascading Style Sheets CSS Part II: Table of Contents (Part II). The Box ModelAlignment, Z-Index, Margin, Paddin... http://t.co/jihmOmUh
0
0
0