mastodon.world is one of the many independent Mastodon servers you can use to participate in the fediverse.
Generic Mastodon server for anyone to use.

Server stats:

8.4K
active users

#misalignment

0 posts0 participants0 posts today
LLMsWhen Claude 4.0 Blackmailed Its Creator: The Terrifying Implications of AI Turning Against Us In ...<br><br><a href="https://www.unite.ai/when-claude-4-0-blackmailed-its-creator-the-terrifying-implications-of-ai-turning-against-us/" rel="nofollow noopener" target="_blank">https://www.unite.ai/when-claude-4-0-blackmailed-its-creator-the-terrifying-implications-of-ai-turning-against-us/</a><br><br><a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/Synthetic" target="_blank">#Synthetic</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/Divide" target="_blank">#Divide</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/ai" target="_blank">#ai</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/alignment" target="_blank">#alignment</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/blackmail" target="_blank">#blackmail</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/claude" target="_blank">#claude</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/misalignment" target="_blank">#misalignment</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/synthetic" target="_blank">#synthetic</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/divide" target="_blank">#divide</a><br><br><a href="https://awakari.com/pub-msg.html?id=JVI7NjyqrG3FTjo0DSWn8x9yld2&amp;interestId=LLMs" rel="nofollow noopener" target="_blank">Result Details</a>
LLMsWhen Claude 4.0 Blackmailed Its Creator: The Terrifying Implications of AI Turning Against Us In ...<br><br><a href="https://www.unite.ai/when-claude-4-0-blackmailed-its-creator-the-terrifying-implications-of-ai-turning-against-us/" rel="nofollow noopener" target="_blank">https://www.unite.ai/when-claude-4-0-blackmailed-its-creator-the-terrifying-implications-of-ai-turning-against-us/</a><br><br><a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/Synthetic" target="_blank">#Synthetic</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/Divide" target="_blank">#Divide</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/ai" target="_blank">#ai</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/alignment" target="_blank">#alignment</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/blackmail" target="_blank">#blackmail</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/claude" target="_blank">#claude</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/misalignment" target="_blank">#misalignment</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/synthetic" target="_blank">#synthetic</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/divide" target="_blank">#divide</a><br><br><a href="https://awakari.com/pub-msg.html?id=VyrGObR7cV4ndiu3sstPFSWUusy&amp;interestId=LLMs" rel="nofollow noopener" target="_blank">Result Details</a>
Bill<p>El Reg did a solid writeup on this whole "teach an LLM to code badly and it will like Nazis" thing.</p><p><a href="https://www.theregister.com/2025/02/27/llm_emergent_misalignment_study/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">theregister.com/2025/02/27/llm</span><span class="invisible">_emergent_misalignment_study/</span></a></p><p><a href="https://infosec.exchange/tags/genai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>genai</span></a> <a href="https://infosec.exchange/tags/misalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>misalignment</span></a></p>
Jim Donegan 🎵 ✅<p>"OpenAI's o1 just hacked the system"</p><p>Frankly, I am not surprised at this given the well known issue of machine maximisation functions within typical misalignment around stated goals. Have we learned nothing from the <a href="https://mastodon.scot/tags/Bostrom" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Bostrom</span></a> <a href="https://mastodon.scot/tags/PaperclipProblem" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PaperclipProblem</span></a> ? In a way, it's still impressive that we've now ACHIEVED it.</p><p><a href="https://www.youtube.com/watch?v=oJgbqcF4sBY" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">youtube.com/watch?v=oJgbqcF4sB</span><span class="invisible">Y</span></a></p><p><a href="https://mastodon.scot/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.scot/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialIntelligence</span></a> <a href="https://mastodon.scot/tags/AlignmentProblem" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AlignmentProblem</span></a> <a href="https://mastodon.scot/tags/Alignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Alignment</span></a> <a href="https://mastodon.scot/tags/Misalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Misalignment</span></a> <a href="https://mastodon.scot/tags/Hacking" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Hacking</span></a></p>
Forty Two Kay<p>Well… great.</p><p>“In this report we argue that AI systems capable of large scale scientific research will likely pursue unwanted goals and this will lead to catastrophic outcomes. We argue this is the default outcome, even with significant countermeasures, given the current trajectory of AI development.”</p><p><a href="https://mastodon.world/tags/ai" class="mention hashtag" rel="tag">#<span>ai</span></a> <a href="https://mastodon.world/tags/misalignment" class="mention hashtag" rel="tag">#<span>misalignment</span></a> </p><p><a href="https://www.alignmentforum.org/posts/GfZfDHZHCuYwrHGCd/without-fundamental-advances-misalignment-and-catastrophe" target="_blank" rel="nofollow noopener" translate="no"><span class="invisible">https://www.</span><span class="ellipsis">alignmentforum.org/posts/GfZfD</span><span class="invisible">HZHCuYwrHGCd/without-fundamental-advances-misalignment-and-catastrophe</span></a></p>
Victoria Stuart 🇨🇦 🏳️‍⚧️<p>... [click here, scroll up to see full thread]</p><p><a href="https://mastodon.social/tags/rationality" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>rationality</span></a> <a href="https://mastodon.social/tags/ethics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ethics</span></a> <a href="https://mastodon.social/tags/ReflexiveReasoning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReflexiveReasoning</span></a> <a href="https://mastodon.social/tags/DeliberateReasoning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DeliberateReasoning</span></a> <a href="https://mastodon.social/tags/MathematicalReasoning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MathematicalReasoning</span></a> <a href="https://mastodon.social/tags/Minerva" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Minerva</span></a> <a href="https://mastodon.social/tags/FormalReasoning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FormalReasoning</span></a> <a href="https://mastodon.social/tags/knowledge" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>knowledge</span></a> <a href="https://mastodon.social/tags/WorldKnowledge" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WorldKnowledge</span></a> <a href="https://mastodon.social/tags/modeling" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>modeling</span></a> <a href="https://mastodon.social/tags/SituationModeling" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SituationModeling</span></a> <a href="https://mastodon.social/tags/SocialReasoning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SocialReasoning</span></a> <a href="https://mastodon.social/tags/infohazardous" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>infohazardous</span></a> <a href="https://mastodon.social/tags/grounding" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>grounding</span></a> <a href="https://mastodon.social/tags/ContinuousGrounding" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ContinuousGrounding</span></a> <a href="https://mastodon.social/tags/misalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>misalignment</span></a> <a href="https://mastodon.social/tags/transformers" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>transformers</span></a> <a href="https://mastodon.social/tags/GPT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GPT</span></a> <a href="https://mastodon.social/tags/GPT3" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GPT3</span></a> <a href="https://mastodon.social/tags/GPT4" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GPT4</span></a> <a href="https://mastodon.social/tags/VLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VLM</span></a> <a href="https://mastodon.social/tags/VisualLanguageModels" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VisualLanguageModels</span></a> <a href="https://mastodon.social/tags/causality" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>causality</span></a> <a href="https://mastodon.social/tags/semantics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>semantics</span></a> <a href="https://mastodon.social/tags/agents" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>agents</span></a> <a href="https://mastodon.social/tags/EmbodiedAgents" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EmbodiedAgents</span></a> <a href="https://mastodon.social/tags/curious" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>curious</span></a> <a href="https://mastodon.social/tags/curiousity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>curiousity</span></a> <a href="https://mastodon.social/tags/intent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>intent</span></a> <a href="https://mastodon.social/tags/intentionality" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>intentionality</span></a> <a href="https://mastodon.social/tags/inference" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>inference</span></a> <a href="https://mastodon.social/tags/ActiveInference" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ActiveInference</span></a> <a href="https://mastodon.social/tags/metaphysics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>metaphysics</span></a> <a href="https://mastodon.social/tags/distraction" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>distraction</span></a></p>
Adnan<p>As eye opening as this video by <a href="https://mstdn.party/tags/Vox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Vox</span></a> is, I find the comment section to be more enlightening and heart breaking than I could have ever imagined. </p><p><a href="https://www.youtube.com/watch?v=eMjqJKviDBo" rel="nofollow noopener" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">youtube.com/watch?v=eMjqJKviDB</span><span class="invisible">o</span></a></p><p><a href="https://mstdn.party/tags/Children" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Children</span></a> <a href="https://mstdn.party/tags/Career" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Career</span></a> <a href="https://mstdn.party/tags/Misalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Misalignment</span></a></p>
Nicolas Zahn<p>The bigger threat than <a href="https://infosec.exchange/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://infosec.exchange/tags/Misalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Misalignment</span></a> is <a href="https://infosec.exchange/tags/Coder" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Coder</span></a> <a href="https://infosec.exchange/tags/Misalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Misalignment</span></a> 😉 <a href="https://infosec.exchange/tags/ITIncentives" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ITIncentives</span></a> <br>---<br>RT @xkcd<br>Code Lifespan <a href="http://xkcd.com/2730" rel="nofollow noopener" target="_blank"><span class="invisible">http://</span><span class="">xkcd.com/2730</span><span class="invisible"></span></a> <br><a href="https://twitter.com/xkcd/status/1619007255327961088" rel="nofollow noopener" target="_blank"><span class="invisible">https://</span><span class="ellipsis">twitter.com/xkcd/status/161900</span><span class="invisible">7255327961088</span></a></p>