Mastodon

1 post1 participant0 posts today

ResearchBuzz: FirehoseSearch Engine Journal: AI Researchers Warn: Hallucinations Persist In Leading AI Models. “Despite billions in research investment, AI factuality remains largely unsolved. According to the report, even the most advanced models from OpenAI and Anthropic ‘correctly answered less than half of the questions’ on new benchmarks like SimpleQA, a collection of straightforward questions.”<a href="https://rbfirehose.com/2025/04/01/ai-researchers-warn-hallucinations-persist-in-leading-ai-models-search-engine-journal/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/04/01/ai-researchers-warn-hallucinations-persist-in-leading-ai-models-search-engine-journal/</a>

ResearchBuzz: FirehosePoynter: Opinion | An Italian newspaper launched a generative AI experiment. It’s not going well. “I am usually a fan of bold experiments like this, and bristle at the hand-wringing that often comes in response to emerging technology. But, so far, it’s a case study of how bad AI is at writing the news. Along with the misspellings in images, an article about Donald Trump’s falsehoods has […]<a href="https://rbfirehose.com/2025/03/23/opinion-an-italian-newspaper-launched-a-generative-ai-experiment-its-not-going-well-poynter/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/23/opinion-an-italian-newspaper-launched-a-generative-ai-experiment-its-not-going-well-poynter/</a>

ResearchBuzz: FirehoseTechCrunch: ChatGPT hit with privacy complaint over defamatory hallucinations. “OpenAI is facing another privacy complaint in Europe over its viral AI chatbot’s tendency to hallucinate false information — and this one might prove tricky for regulators to ignore. Privacy rights advocacy group Noyb is supporting an individual in Norway who was horrified to find ChatGPT returning made-up […]<a href="https://rbfirehose.com/2025/03/21/techcrunch-chatgpt-hit-with-privacy-complaint-over-defamatory-hallucinations/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/21/techcrunch-chatgpt-hit-with-privacy-complaint-over-defamatory-hallucinations/</a>

ResearchBuzz: FirehoseTechCrunch: Manus probably isn’t China’s second ‘DeepSeek moment’. “Alexander Doria, the co-founder of AI startup Pleias, said in a post on X that he encountered error messages and endless loops while testing Manus. Other X users pointed out that Manus makes mistakes on factual questions and doesn’t consistently cite its work — and often misses information that’s easily found […]<a href="https://rbfirehose.com/2025/03/11/techcrunch-manus-probably-isnt-chinas-second-deepseek-moment/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/11/techcrunch-manus-probably-isnt-chinas-second-deepseek-moment/</a>

ResearchBuzz: FirehoseNiemanLab: AI search engines fail to produce accurate citations in over 60% of tests, according to new Tow Center study. “Last summer, I reported that ChatGPT frequently hallucinated fake URLs to news sites, even to articles from OpenAI’s own publishing partners. Research has continued to show that these citation issues are not limited to ChatGPT, but are in fact chronic across the AI […]<a href="https://rbfirehose.com/2025/03/11/niemanlab-ai-search-engines-fail-to-produce-accurate-citations-in-over-60-of-tests-according-to-new-tow-center-study/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/11/niemanlab-ai-search-engines-fail-to-produce-accurate-citations-in-over-60-of-tests-according-to-new-tow-center-study/</a>

ResearchBuzz: FirehoseBBC: Grandmother gets X-rated message after Apple AI fail. “Louise Littlejohn, 66, received a voicemail message on Wednesday from a Lookers Land Rover garage in Motherwell inviting her to an event. An artificial intelligence (AI) powered service offered by Apple turned it into a text message which – to her surprise – asked if she been ‘able to have sex’ before calling her a ‘piece of ****’.”<a href="https://rbfirehose.com/2025/03/09/bbc-grandmother-gets-x-rated-message-after-apple-ai-fail/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/09/bbc-grandmother-gets-x-rated-message-after-apple-ai-fail/</a>

WetHat💦Key Points: ➡️ AI errors running amok like a robot revolution, featuring Google search gaffes and Comcast calendar calamities. ➡️ Flubstitution: when AIs confuse template placeholders with Mad Libs. ➡️ Case study in how open-source session recording gets downright mysterious. ➡️ Dell's UI patterns? More like antipatterns.<a href="https://thedailywtf.com/articles/artificial-average-intelligence" rel="nofollow noopener noreferrer" translate="no" target="_blank">https://thedailywtf.com/articles/artificial-average-intelligence</a><a href="https://fosstodon.org/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#ArtificialIntelligence</a> <a href="https://fosstodon.org/tags/AIErrors" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#AIErrors</a> <a href="https://fosstodon.org/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#AI</a> <a href="https://fosstodon.org/tags/ComputingHumor" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#ComputingHumor</a> <a href="https://fosstodon.org/tags/WTF" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#WTF</a>

rednikkiHey, here's Copilot being all inaccurate on searches I did about how long journals should retain copies of a peer review. <a href="https://toot.boston/tags/AIErrors" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#AIErrors</a>

eicker.news tech news»<a href="https://eicker.news/tags/Anthropic" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#Anthropic</a>'s new <a href="https://eicker.news/tags/Citations" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#Citations</a> feature aims to reduce <a href="https://eicker.news/tags/AIerrors" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#AIerrors</a>: allows its <a href="https://eicker.news/tags/AImodels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#AImodels</a> to provide references to “the exact sentences and passages” from docs they use to generate responses.« <a href="https://techcrunch.com/2025/01/23/anthropics-new-citations-feature-aims-to-reduce-ai-errors/?eicker.news" rel="nofollow noopener noreferrer" translate="no" target="_blank">https://techcrunch.com/2025/01/23/anthropics-new-citations-feature-aims-to-reduce-ai-errors/?eicker.news</a> <a href="https://eicker.news/tags/tech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#tech</a> <a href="https://eicker.news/tags/media" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#media</a>

ResearchBuzz: FirehoseWashington State University: ChatGPT errors show it cannot replace finance professionals, yet. “While large language models like ChatGPT can do well when choosing multiple-choice answers on financial licensing exams, they falter when dealing with more nuanced tasks. A Washington State University-led study analyzed more than 10,000 responses to financial exam questions by the artificial […]<a href="https://rbfirehose.com/2024/12/19/washington-state-university-chatgpt-errors-show-it-cannot-replace-finance-professionals-yet/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2024/12/19/washington-state-university-chatgpt-errors-show-it-cannot-replace-finance-professionals-yet/</a>

IT NewsChatGPT goes temporarily “insane” with unexpected outputs, spooking users - Enlarge (credit: Benj Edwards / Getty Images) On Tuesday, Chat... - <a href="https://arstechnica.com/?p=2004783" rel="nofollow noopener noreferrer" translate="no" target="_blank">https://arstechnica.com/?p=2004783</a> <a href="https://schleuss.online/tags/largelanguagemodels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#largelanguagemodels</a> <a href="https://schleuss.online/tags/machinelearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#machinelearning</a> <a href="https://schleuss.online/tags/aitransparency" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#aitransparency</a> <a href="https://schleuss.online/tags/openweightsai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#openweightsai</a> <a href="https://schleuss.online/tags/textsynthesis" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#textsynthesis</a> <a href="https://schleuss.online/tags/gpt" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#gpt</a>-4-turbo <a href="https://schleuss.online/tags/aierrors" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#aierrors</a> <a href="https://schleuss.online/tags/aisafety" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#aisafety</a> <a href="https://schleuss.online/tags/bingchat" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#bingchat</a> <a href="https://schleuss.online/tags/chatgpt" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#chatgpt</a> <a href="https://schleuss.online/tags/chatgtp" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#chatgtp</a> <a href="https://schleuss.online/tags/gpt" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#gpt</a>-3.5 <a href="https://schleuss.online/tags/biz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#biz</a>⁢ <a href="https://schleuss.online/tags/openai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#openai</a> <a href="https://schleuss.online/tags/gpt" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#gpt</a>-4 <a href="https://schleuss.online/tags/api" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#api</a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#ai</a>

Recent searches

Search options

Administered by:

Server stats:

#aierrors