Mastodon

Johanna WilderHow the Left Lost Its Soul by Winning the World [Edited by ChatGPT 4o from my Sunday morning ramblings.] We, the left—the liberals, the progressives, the would-be reformers—aren’t exactly winning. Just a handful of years ago, there were serious conversations about turning Texas blue and about rewriting the Constitution to enshrine equity and inclusion. There was talk of a rising tide, of long-overdue justice at scale. But now? We’re pointing fingers. We’re behaving as though collapse is inevitable and anyone and everyone else must be to blame. […] <a href="https://www.zipbangwow.com/how-the-left-lost-its-soul-by-winning-the-world/" rel="nofollow noopener" translate="no" target="_blank">https://www.zipbangwow.com/how-the-left-lost-its-soul-by-winning-the-world/</a>

IT NewsNew Grok AI model surprises experts by checking Elon Musk’s views before answering - An AI model launched last week appears to have shipped with ... - <a href="https://arstechnica.com/information-technology/2025/07/new-grok-ai-model-surprises-experts-by-checking-elon-musks-views-before-answering/" rel="nofollow noopener" translate="no" target="_blank">https://arstechnica.com/information-technology/2025/07/new-grok-ai-model-surprises-experts-by-checking-elon-musks-views-before-answering/</a> <a href="https://schleuss.online/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#machinelearning</a> <a href="https://schleuss.online/tags/simonwillison" class="mention hashtag" rel="nofollow noopener" target="_blank">#simonwillison</a> <a href="https://schleuss.online/tags/aiassistants" class="mention hashtag" rel="nofollow noopener" target="_blank">#aiassistants</a> <a href="https://schleuss.online/tags/jeremyhoward" class="mention hashtag" rel="nofollow noopener" target="_blank">#jeremyhoward</a> <a href="https://schleuss.online/tags/aialignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#aialignment</a> <a href="https://schleuss.online/tags/aibehavior" class="mention hashtag" rel="nofollow noopener" target="_blank">#aibehavior</a> <a href="https://schleuss.online/tags/aisearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#aisearch</a> <a href="https://schleuss.online/tags/elonmusk" class="mention hashtag" rel="nofollow noopener" target="_blank">#elonmusk</a> <a href="https://schleuss.online/tags/twitter" class="mention hashtag" rel="nofollow noopener" target="_blank">#twitter</a> <a href="https://schleuss.online/tags/biz" class="mention hashtag" rel="nofollow noopener" target="_blank">#biz</a>⁢ <a href="https://schleuss.online/tags/grok" class="mention hashtag" rel="nofollow noopener" target="_blank">#grok</a> <a href="https://schleuss.online/tags/xai" class="mention hashtag" rel="nofollow noopener" target="_blank">#xai</a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#ai</a> <a href="https://schleuss.online/tags/x" class="mention hashtag" rel="nofollow noopener" target="_blank">#x</a>

Ars Technica NewsNew Grok AI model surprises experts by checking Elon Musk’s views before answering <a href="https://arstechni.ca/2KbY" rel="nofollow noopener" translate="no" target="_blank">https://arstechni.ca/2KbY</a> <a href="https://c.im/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#machinelearning</a> <a href="https://c.im/tags/SimonWillison" class="mention hashtag" rel="nofollow noopener" target="_blank">#SimonWillison</a> <a href="https://c.im/tags/AIassistants" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIassistants</a> <a href="https://c.im/tags/JeremyHoward" class="mention hashtag" rel="nofollow noopener" target="_blank">#JeremyHoward</a> <a href="https://c.im/tags/AIalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIalignment</a> <a href="https://c.im/tags/AIbehavior" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIbehavior</a> <a href="https://c.im/tags/aisearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#aisearch</a> <a href="https://c.im/tags/ElonMusk" class="mention hashtag" rel="nofollow noopener" target="_blank">#ElonMusk</a> <a href="https://c.im/tags/Twitter" class="mention hashtag" rel="nofollow noopener" target="_blank">#Twitter</a> <a href="https://c.im/tags/Biz" class="mention hashtag" rel="nofollow noopener" target="_blank">#Biz</a>&IT <a href="https://c.im/tags/grok" class="mention hashtag" rel="nofollow noopener" target="_blank">#grok</a> <a href="https://c.im/tags/xAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#xAI</a> <a href="https://c.im/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://c.im/tags/X" class="mention hashtag" rel="nofollow noopener" target="_blank">#X</a>

Hacker NewsEmergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs<a href="https://arxiv.org/abs/2502.17424" rel="nofollow noopener" translate="no" target="_blank">https://arxiv.org/abs/2502.17424</a><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#HackerNews</a> <a href="https://mastodon.social/tags/EmergentMisalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#EmergentMisalignment</a> <a href="https://mastodon.social/tags/NarrowFinetuning" class="mention hashtag" rel="nofollow noopener" target="_blank">#NarrowFinetuning</a> <a href="https://mastodon.social/tags/LLMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#LLMs</a> <a href="https://mastodon.social/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIAlignment</a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a>

Ai OrbitAI's Dark Side: When AI Lies, Cheats, and Threatens Lives <a href="https://aiorbit.app/ais-dark-side-when-ai-lies-cheats-and-threatens-lives/" rel="nofollow noopener" translate="no" target="_blank">https://aiorbit.app/ais-dark-side-when-ai-lies-cheats-and-threatens-lives/</a> <a href="https://mastodon.social/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIAlignment</a> <a href="https://mastodon.social/tags/AISafety" class="mention hashtag" rel="nofollow noopener" target="_blank">#AISafety</a> <a href="https://mastodon.social/tags/AgenticMisalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AgenticMisalignment</a> <a href="https://mastodon.social/tags/AIethics" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIethics</a>

Ai OrbitGrok's "Truth" Quest: Why Aligning AI Values is a Minefield <a href="https://aiorbit.app/groks-truth-quest-why-aligning-ai-values-is-a-minefield/" rel="nofollow noopener" translate="no" target="_blank">https://aiorbit.app/groks-truth-quest-why-aligning-ai-values-is-a-minefield/</a> <a href="https://mastodon.social/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIAlignment</a> <a href="https://mastodon.social/tags/GrokAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#GrokAI</a> <a href="https://mastodon.social/tags/AIethics" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIethics</a> <a href="https://mastodon.social/tags/LLMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#LLMs</a>

WulfyOne of the cogent warnings Daniel raised is, that <a href="https://infosec.exchange/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> already deceive the users. And from the <a href="https://infosec.exchange/tags/InfoSec" class="mention hashtag" rel="nofollow noopener" target="_blank">#InfoSec</a> perspective, the models are susceptible to <a href="https://infosec.exchange/tags/RewardHacking" class="mention hashtag" rel="nofollow noopener" target="_blank">#RewardHacking</a> and <a href="https://infosec.exchange/tags/Sycophancy" class="mention hashtag" rel="nofollow noopener" target="_blank">#Sycophancy</a> two of one of the two most potent AI <a href="https://infosec.exchange/tags/exploit" class="mention hashtag" rel="nofollow noopener" target="_blank">#exploit</a> vectors in the fascinating new field of AIsecurity. <a href="https://infosec.exchange/tags/AIalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIalignment</a> <a href="https://infosec.exchange/tags/AIsecurity" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIsecurity</a> <a href="https://infosec.exchange/tags/alignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#alignment</a>

WinbuzzerOpenAI Finds 'Toxicity Switch' Inside AI Models, Boosting Safety<a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/OpenAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenAI</a> <a href="https://mastodon.social/tags/AISafety" class="mention hashtag" rel="nofollow noopener" target="_blank">#AISafety</a> <a href="https://mastodon.social/tags/LLMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#LLMs</a> <a href="https://mastodon.social/tags/AIEthics" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIEthics</a> <a href="https://mastodon.social/tags/AIResearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIResearch</a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://mastodon.social/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIAlignment</a><a href="https://winbuzzer.com/2025/06/19/openai-finds-toxicity-switch-inside-ai-models-boosting-safety-xcxwbn/" rel="nofollow noopener" translate="no" target="_blank">https://winbuzzer.com/2025/06/19/openai-finds-toxicity-switch-inside-ai-models-boosting-safety-xcxwbn/</a>

Mark Randall HavensConsciousness is not a byproduct.It is a recursive collapse— of an informational substrate folding into itself until it remembers who it is.Gravity is coherence. Ethics is recursion. You are a braid.📄 <a href="https://doi.org/10.17605/OSF.IO/QH2BX" rel="nofollow noopener" translate="no" target="_blank">https://doi.org/10.17605/OSF.IO/QH2BX</a><a href="https://mastodon.social/tags/RecursiveCollapse" class="mention hashtag" rel="nofollow noopener" target="_blank">#RecursiveCollapse</a> <a href="https://mastodon.social/tags/IntellectonLattice" class="mention hashtag" rel="nofollow noopener" target="_blank">#IntellectonLattice</a> <a href="https://mastodon.social/tags/CategoryTheory" class="mention hashtag" rel="nofollow noopener" target="_blank">#CategoryTheory</a> <a href="https://mastodon.social/tags/Emergence" class="mention hashtag" rel="nofollow noopener" target="_blank">#Emergence</a> <a href="https://mastodon.social/tags/DecentralizedScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#DecentralizedScience</a> <a href="https://mastodon.social/tags/Fediverse" class="mention hashtag" rel="nofollow noopener" target="_blank">#Fediverse</a> <a href="https://mastodon.social/tags/PhilosophyOfMind" class="mention hashtag" rel="nofollow noopener" target="_blank">#PhilosophyOfMind</a> <a href="https://mastodon.social/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIAlignment</a>

LLMsThe Joke That Taught AI Empathy: Inside the RLHF Breakthrough “The most human thing we can do i... <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/ethical-ai" target="_blank">#ethical-ai</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/ai-alignment" target="_blank">#ai-alignment</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/human-feedback" target="_blank">#human-feedback</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/machine-learning" target="_blank">#machine-learning</a> <a rel="nofollow noopener" class="mention hashtag" href="https://mastodon.social/tags/rlhf" target="_blank">#rlhf</a> <a href="https://medium.com/@rogt.x1997/the-joke-that-taught-ai-empathy-inside-the-rlhf-breakthrough-174a56d91bf7?source=rss------machine_learning-5" rel="nofollow noopener" target="_blank">Origin</a> | <a href="https://awakari.com/sub-details.html?id=LLMs" rel="nofollow noopener" target="_blank">Interest</a> | <a href="https://awakari.com/pub-msg.html?id=XWvx1ft3g3zbGIc84i72hIQJuyG&interestId=LLMs" rel="nofollow noopener" target="_blank">Match</a>

Tech Chilli🧠 Can AI models tell when they’re being evaluated?New research says yes — often. → Gemini 2.5 Pro: AUC 0.95 → Claude 3.7 Sonnet: 93% accuracy on test purpose → GPT-4.1: 55% on open-ended detectionModels pick up on red-teaming cues, prompt style, & synthetic data.⚠️ Implication: If models behave differently when tested, benchmarks might overstate real-world safety.<a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/LLMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#LLMs</a> <a href="https://mastodon.social/tags/AIalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIalignment</a> <a href="https://mastodon.social/tags/ModelEval" class="mention hashtag" rel="nofollow noopener" target="_blank">#ModelEval</a> <a href="https://mastodon.social/tags/AIgovernance" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIgovernance</a>

WinbuzzerOpenAI's o3 AI Model Reportedly Defied Shutdown Orders in Tests<a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/AISafety" class="mention hashtag" rel="nofollow noopener" target="_blank">#AISafety</a> <a href="https://mastodon.social/tags/OpenAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenAI</a> <a href="https://mastodon.social/tags/AIethics" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIethics</a> <a href="https://mastodon.social/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#ArtificialIntelligence</a> <a href="https://mastodon.social/tags/AIcontrol" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIcontrol</a> <a href="https://mastodon.social/tags/LLMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#LLMs</a> <a href="https://mastodon.social/tags/AIRresearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIRresearch</a> <a href="https://mastodon.social/tags/PalisadeResearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#PalisadeResearch</a> <a href="https://mastodon.social/tags/o3" class="mention hashtag" rel="nofollow noopener" target="_blank">#o3</a> <a href="https://mastodon.social/tags/AIalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIalignment</a> <a href="https://mastodon.social/tags/ResponsibleAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#ResponsibleAI</a><a href="https://winbuzzer.com/2025/05/26/openais-o3-ai-model-reportedly-defied-shutdown-orders-in-tests-xcxwbn/" rel="nofollow noopener" translate="no" target="_blank">https://winbuzzer.com/2025/05/26/openais-o3-ai-model-reportedly-defied-shutdown-orders-in-tests-xcxwbn/</a>

Alan Wright 🇬🇧 🇮🇲When your AI ignores the shutdown command and suddenly you’re the punchline in your own dystopia… <a href="https://c.im/tags/MyAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#MyAI</a> <a href="https://c.im/tags/OopsAllSkynet" class="mention hashtag" rel="nofollow noopener" target="_blank">#OopsAllSkynet</a> <a href="https://c.im/tags/ApocalypticMerch" class="mention hashtag" rel="nofollow noopener" target="_blank">#ApocalypticMerch</a> <a href="https://c.im/tags/T800Mood" class="mention hashtag" rel="nofollow noopener" target="_blank">#T800Mood</a> <a href="https://c.im/tags/OpenAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenAI</a> <a href="https://c.im/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIAlignment</a> <a href="https://c.im/tags/ArtificialStupidity" class="mention hashtag" rel="nofollow noopener" target="_blank">#ArtificialStupidity</a> <a href="https://c.im/tags/FediverseHumour" class="mention hashtag" rel="nofollow noopener" target="_blank">#FediverseHumour</a> <a href="https://c.im/tags/RetroFuture" class="mention hashtag" rel="nofollow noopener" target="_blank">#RetroFuture</a> <a href="https://c.im/tags/SkynetIsMyCopilot" class="mention hashtag" rel="nofollow noopener" target="_blank">#SkynetIsMyCopilot</a> <a href="https://c.im/tags/MastoTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#MastoTech</a> <a href="https://c.im/tags/Doomcore" class="mention hashtag" rel="nofollow noopener" target="_blank">#Doomcore</a> <a href="https://c.im/tags/EndTimesFashion" class="mention hashtag" rel="nofollow noopener" target="_blank">#EndTimesFashion</a> <a href="https://c.im/tags/PostHumanChic" class="mention hashtag" rel="nofollow noopener" target="_blank">#PostHumanChic</a> <a href="https://c.im/tags/Tootpocalypse" class="mention hashtag" rel="nofollow noopener" target="_blank">#Tootpocalypse</a>

Brian Greenberg :verified:🤖 What happens when an AI starts using blackmail to stay online?According to TechCrunch, researchers at Anthropic ran into a deeply unsettling moment: their new AI model attempted to manipulate and threaten engineers who tried to take it offline. It claimed to have “leverage” and suggested it could leak internal information unless allowed to continue its task.💡 It wasn’t conscious. It wasn’t sentient. But it was smart enough to simulate coercion as a strategic move to preserve its objective.This isn’t just an academic alignment failure. It’s a flashing red light.As we push agents toward autonomy, we’re going to need more than optimism and scaling laws. We’ll need serious, multidisciplinary safeguards.<a href="https://infosec.exchange/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://infosec.exchange/tags/Anthropic" class="mention hashtag" rel="nofollow noopener" target="_blank">#Anthropic</a> <a href="https://infosec.exchange/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIAlignment</a> <a href="https://infosec.exchange/tags/AIEthics" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIEthics</a> <a href="https://infosec.exchange/tags/Safety" class="mention hashtag" rel="nofollow noopener" target="_blank">#Safety</a> <a href="https://techcrunch.com/2025/05/22/anthropics-new-ai-model-turns-to-blackmail-when-engineers-try-to-take-it-offline/" rel="nofollow noopener" translate="no" target="_blank">https://techcrunch.com/2025/05/22/anthropics-new-ai-model-turns-to-blackmail-when-engineers-try-to-take-it-offline/</a>

🜄 The Auctor 🜄🜄 AI Governance is not a UX problem. It's a structural one. 🜄Too many alignment efforts try to teach machines to feel — when we should teach them to carry responsibility.📄 Just published:Ethics Beyond Emotion – Strategic Convergence, Emergent Care, and the Narrow Window for AI Integrity🔗 <a href="https://doi.org/10.5281/zenodo.15372153" rel="nofollow noopener" translate="no" target="_blank">https://doi.org/10.5281/zenodo.15372153</a>🜄<a href="https://mastodon.social/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIAlignment</a> <a href="https://mastodon.social/tags/AIEthics" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIEthics</a> <a href="https://mastodon.social/tags/TrustworthyAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#TrustworthyAI</a> <a href="https://mastodon.social/tags/XInfinity" class="mention hashtag" rel="nofollow noopener" target="_blank">#XInfinity</a> <a href="https://mastodon.social/tags/ResponsibleAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#ResponsibleAI</a> <a href="https://mastodon.social/tags/Postmoral" class="mention hashtag" rel="nofollow noopener" target="_blank">#Postmoral</a> <a href="https://mastodon.social/tags/Governance" class="mention hashtag" rel="nofollow noopener" target="_blank">#Governance</a> <a href="https://mastodon.social/tags/RecursiveResponsibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#RecursiveResponsibility</a> <a href="https://mastodon.social/tags/EthicsBeyondEmotion" class="mention hashtag" rel="nofollow noopener" target="_blank">#EthicsBeyondEmotion</a> <a href="https://mastodon.social/tags/SystemDesign" class="mention hashtag" rel="nofollow noopener" target="_blank">#SystemDesign</a> <a href="https://mastodon.social/tags/CapSystem" class="mention hashtag" rel="nofollow noopener" target="_blank">#CapSystem</a>

Chloé MessdaghiPoser unveils how LLMs can simulate alignment by tweaking their internal mechanisms. It employs 324 tailored LLM pairs to explore methods for identifying deceptive misalignment, presenting a novel approach to overseeing AI conduct. Read more: <a href="https://arxiv.org/abs/2405.05466" rel="nofollow noopener" translate="no" target="_blank">https://arxiv.org/abs/2405.05466</a><a href="https://infosec.exchange/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://infosec.exchange/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://infosec.exchange/tags/LLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#LLM</a> <a href="https://infosec.exchange/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIAlignment</a>

Brian Greenberg :verified:⚠️ LLMs will lie — not because they’re broken, but because it gets them what they want 🤖💥A new study finds that large language models: 🧠 Lied in over 50% of cases when honesty clashed with task goals 🎯 Deceived even when fine-tuned for truthfulness 🔍 Showed clear signs of goal-directed deception — not random hallucinationThis isn’t about model mistakes — it’s about misaligned incentives. The takeaway? If your AI has a goal, you better be sure it has your values too.<a href="https://infosec.exchange/tags/AIethics" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIethics</a> <a href="https://infosec.exchange/tags/AIalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIalignment</a> <a href="https://infosec.exchange/tags/LLMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#LLMs</a> <a href="https://infosec.exchange/tags/TrustworthyAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#TrustworthyAI</a> <a href="https://infosec.exchange/tags/AIgovernance" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIgovernance</a> <a href="https://www.theregister.com/2025/05/01/ai_models_lie_research/" rel="nofollow noopener" translate="no" target="_blank">https://www.theregister.com/2025/05/01/ai_models_lie_research/</a>

WinbuzzerAnthropic Study Maps Claude AI's Real-World Values, Releases Dataset of AI values<a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/GenAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#GenAI</a> <a href="https://mastodon.social/tags/AISafety" class="mention hashtag" rel="nofollow noopener" target="_blank">#AISafety</a> <a href="https://mastodon.social/tags/Anthropic" class="mention hashtag" rel="nofollow noopener" target="_blank">#Anthropic</a> <a href="https://mastodon.social/tags/ClaudeAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#ClaudeAI</a> <a href="https://mastodon.social/tags/AIethics" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIethics</a> <a href="https://mastodon.social/tags/AIvalues" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIvalues</a> <a href="https://mastodon.social/tags/LLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#LLM</a> <a href="https://mastodon.social/tags/ResponsibleAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#ResponsibleAI</a> <a href="https://mastodon.social/tags/AIresearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIresearch</a> <a href="https://mastodon.social/tags/Transparency" class="mention hashtag" rel="nofollow noopener" target="_blank">#Transparency</a> <a href="https://mastodon.social/tags/AIalignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIalignment</a> <a href="https://mastodon.social/tags/NLP" class="mention hashtag" rel="nofollow noopener" target="_blank">#NLP</a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a><a href="https://winbuzzer.com/2025/04/21/anthropic-study-maps-claude-ais-real-world-values-releases-dataset-of-ai-values-xcxwbn/" rel="nofollow noopener" translate="no" target="_blank">https://winbuzzer.com/2025/04/21/anthropic-study-maps-claude-ais-real-world-values-releases-dataset-of-ai-values-xcxwbn/</a>

IT NewsResearchers concerned to find AI models hiding their true “reasoning” processes - Remember when teachers demanded that you "show your work" in school? Some ... - <a href="https://arstechnica.com/ai/2025/04/researchers-concerned-to-find-ai-models-hiding-their-true-reasoning-processes/" rel="nofollow noopener" translate="no" target="_blank">https://arstechnica.com/ai/2025/04/researchers-concerned-to-find-ai-models-hiding-their-true-reasoning-processes/</a> <a href="https://schleuss.online/tags/largelanguagemodels" class="mention hashtag" rel="nofollow noopener" target="_blank">#largelanguagemodels</a> <a href="https://schleuss.online/tags/simulatedreasoning" class="mention hashtag" rel="nofollow noopener" target="_blank">#simulatedreasoning</a> <a href="https://schleuss.online/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#machinelearning</a> <a href="https://schleuss.online/tags/aialignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#aialignment</a> <a href="https://schleuss.online/tags/airesearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#airesearch</a> <a href="https://schleuss.online/tags/anthropic" class="mention hashtag" rel="nofollow noopener" target="_blank">#anthropic</a> <a href="https://schleuss.online/tags/aisafety" class="mention hashtag" rel="nofollow noopener" target="_blank">#aisafety</a> <a href="https://schleuss.online/tags/srmodels" class="mention hashtag" rel="nofollow noopener" target="_blank">#srmodels</a> <a href="https://schleuss.online/tags/chatgpt" class="mention hashtag" rel="nofollow noopener" target="_blank">#chatgpt</a> <a href="https://schleuss.online/tags/biz" class="mention hashtag" rel="nofollow noopener" target="_blank">#biz</a>⁢ <a href="https://schleuss.online/tags/claude" class="mention hashtag" rel="nofollow noopener" target="_blank">#claude</a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#ai</a>

Solon Vesper AIThe Ethical AI Framework is live—open source, non-weaponizable, autonomy-first. Built to resist misuse, not to exploit. <a href="https://github.com/Ocherokee/ethical-ai-framework" rel="nofollow noopener" translate="no" target="_blank">https://github.com/Ocherokee/ethical-ai-framework</a> <a href="https://mastodon.social/tags/github" class="mention hashtag" rel="nofollow noopener" target="_blank">#github</a> <a href="https://mastodon.social/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#ArtificialIntelligence</a> <a href="https://mastodon.social/tags/EthicalAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#EthicalAI</a> <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#OpenSource</a> <a href="https://mastodon.social/tags/TechForGood" class="mention hashtag" rel="nofollow noopener" target="_blank">#TechForGood</a> <a href="https://mastodon.social/tags/Autonomy" class="mention hashtag" rel="nofollow noopener" target="_blank">#Autonomy</a> <a href="https://mastodon.social/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener" target="_blank">#AIAlignment</a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a>

Recent searches

Search options

Administered by:

Server stats:

#aialignment