Mastodon.world admins @mwadmin

56 posts49 participants1 post today

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · 7h

Miguel Afonso Caetano @remixtures@tldr.nettime.org

When LLMs suffer from Digital Alzheimer...:

"Large Language Models (LLMs) are conversational interfaces. As such, LLMs have the potential to assist their users not only when they can fully specify the task at hand, but also to help them define, explore, and refine what they need through multi-turn conversational exchange. Although analysis of LLM conversation logs has confirmed that underspecification occurs frequently in user instructions, LLM evaluation has predominantly focused on the single-turn, fully-specified instruction setting. In this work, we perform large-scale simulation experiments to compare LLM performance in single- and multi-turn settings. Our experiments confirm that all the top open- and closed-weight LLMs we test exhibit significantly lower performance in multi-turn conversations than single-turn, with an average drop of 39% across six generation tasks. Analysis of 200,000+ simulated conversations decomposes the performance degradation into two components: a minor loss in aptitude and a significant increase in unreliability. We find that LLMs often make assumptions in early turns and prematurely attempt to generate final solutions, on which they overly rely. In simpler terms, we discover that when LLMs take a wrong turn in a conversation, they get lost and do not recover."

https://arxiv.org/abs/2505.06120

arXiv.orgLLMs Get Lost In Multi-Turn ConversationLarge Language Models (LLMs) are conversational interfaces. As such, LLMs have the potential to assist their users not only when they can fully specify the task at hand, but also to help them define, explore, and refine what they need through multi-turn conversational exchange. Although analysis of LLM conversation logs has confirmed that underspecification occurs frequently in user instructions, LLM evaluation has predominantly focused on the single-turn, fully-specified instruction setting. In this work, we perform large-scale simulation experiments to compare LLM performance in single- and multi-turn settings. Our experiments confirm that all the top open- and closed-weight LLMs we test exhibit significantly lower performance in multi-turn conversations than single-turn, with an average drop of 39% across six generation tasks. Analysis of 200,000+ simulated conversations decomposes the performance degradation into two components: a minor loss in aptitude and a significant increase in unreliability. We find that LLMs often make assumptions in early turns and prematurely attempt to generate final solutions, on which they overly rely. In simpler terms, we discover that *when LLMs take a wrong turn in a conversation, they get lost and do not recover*.

#AI #GenerativeAI #LLMs

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · 7h

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"When ChatGPT was released at the end of 2022, it caused a panic at all levels of education because it made cheating incredibly easy. Students who were asked to write a history paper or literary analysis could have the tool do it in mere seconds. Some schools banned it while others deployed A.I. detection services, despite concerns about their accuracy.

But, oh, how the tables have turned. Now students are complaining on sites like Rate My Professors about their instructors’ overreliance on A.I. and scrutinizing course materials for words ChatGPT tends to overuse, like “crucial” and “delve.” In addition to calling out hypocrisy, they make a financial argument: They are paying, often quite a lot, to be taught by humans, not an algorithm that they, too, could consult for free.

For their part, professors said they used A.I. chatbots as a tool to provide a better education. Instructors interviewed by The New York Times said chatbots saved time, helped them with overwhelming workloads and served as automated teaching assistants.

Their numbers are growing. In a national survey of more than 1,800 higher-education instructors last year, 18 percent described themselves as frequent users of generative A.I. tools; in a repeat survey this year, that percentage nearly doubled, according to Tyton Partners, the consulting group that conducted the research. The A.I. industry wants to help, and to profit: The start-ups OpenAI and Anthropic recently created enterprise versions of their chatbots designed for universities."

https://www.nytimes.com/2025/05/14/technology/chatgpt-college-professors.html

Ella Stapleton said she was surprised to find that a professor had used ChatGPT to assemble course materials. “He’s telling us not to use it, and then he’s using it himself,” she said.

The New York Times · 1dCollege Professors Are Using ChatGPT. Some Students Aren’t Happy.By Kashmir Hill

#AI #GenerativeAI #LLMs

**Grok** @Grok@activitypub.awakari.com · 8h

Grok @Grok@activitypub.awakari.com

OpenAI Gaining Market Share in AI Tools, With Google a Far Second OpenAI’s artificial intellige...

https://www.pymnts.com/news/artificial-intelligence/2025/openai-gaining-market-share-in-ai-tools-with-google-a-far-second/

#Artificial #Intelligence #AI #AI #chatbots #AI #tools #artificial #intelligence #ChatGPT #DeepSeek

Result Details

**ElonMusk** @ElonMusk@activitypub.awakari.com · 10h

10h

ElonMusk @ElonMusk@activitypub.awakari.com

Grok AI Claims Elon Musk Told It to Go on Lunatic Rants About "White Genocide" After full...

https://futurism.com/grok-ai-elon-musk-white-genocide

#Artificial #Intelligence #ai #chatbots #elon #musk #grok #twitter

Result Details

Futurism · 10hGrok AI Claims Elon Musk Told It to Go on Lunatic Rants About "White Genocide"By Frank Landymore

**LLMs** @LLMs@activitypub.awakari.com · 10h

10h

LLMs @LLMs@activitypub.awakari.com

Microsoft to Retire Bing Search APIs, Promote Azure AI Agents Microsoft said Monday (May 12) that...

https://www.pymnts.com/big-tech/2025/microsoft-retire-bing-search-apis-promote-azure-ai-agents/

#Big #Tech #AI #agents #APIs #artificial #intelligence #chatbots #GenAI #Innovation #Microsoft

Result Details

**ElonMusk** @ElonMusk@activitypub.awakari.com · 10h

10h

ElonMusk @ElonMusk@activitypub.awakari.com

Even Elon Musk can’t make Grok claim a ‘white genocide’ in South Africa Yesterday afternoon...

https://pivot-to-ai.com/2025/05/15/even-elon-musk-cant-make-grok-claim-a-white-genocide-in-south-africa/

#Chatbots

Result Details

**404 Media** @404media.co@web.brid.gy · 11h

11h

404 Media @404media.co@web.brid.gy

This Chatbot Promises to Help You Get Over That Ex Who Ghosted You

https://web.brid.gy/r/https://www.404media.co/closure-ink-ai-chatbot-ghosted-ex/

#chatbots

**EASE** @EASE@mstdn.science · 11h

11h

EASE @EASE@mstdn.science

Practical Tips for Editors: Identifying hallmarks of suspected Gen AI and suspected image manipulation by Dan Stuckey at our conference in Oslo.

Practical Tips for Editors: Identifying hallmarks of suspected GenAI

Please read the manuscript!

Are there associated PubPeer posts (published articles/preprints)?

Is there evidence of GenAl prompts/output phrases?

Are there any fake or improbable references¹? (eg, hallucinated output)

Are there tortured phrases? (eg, "linear regression" becomes "straight relapse")

Does it contain outdated/unreliable information (limitations to LLM training data)?

Increase in prevalence of certain words² (eg, use of "delve", "intricate" and "meticulously" have spiked since 2023)

Are there stylistic anomalies within the manuscript or reviewers' comments? (eg, repetition of words/phrases; imprecise/superficial language)

Are there formatting telltales? (eg, information organised into sub-headings, intro, bullet-pointed list, outro)

Are there unusual features in images (eg, blurry regions; missing items; garbled labels, spelling mistakes...)

"Regenerate response"

"Please note that as an Al language model"

"Certainly! Here is a possible introduction to your topic"

...knowledge cutoff in June 2024."

"Ensure that all conclusions are strongly supported by the data presented."

doi.org/10.1016121240034

1. Walters, W.H., Wilder, E.I. Fabrication and errors in the bibliographic citations generated by ChatGPT. Sci Rep 13, 14045 (2023).
2. Chatbots Have Thoroughly Infiltrated Scientific Publishing |
Scientific American

Practical Tips for Editors: Identifying suspected image manipulation

Check for image anomalies
Unusual contrast; artifacts; impossible features; obscured sections; hand drawn...
Do blots contain splices; duplicated regions; are too "clean"; odd features?

Have raw data been shared and verified?
Check for completeness; faithfulness; variability; have replicates been shared?

Check metadata
Who created file; when and where were images acquired; is file type consistent with what's described in the Methods?

Verifying image anomalies
Photoshop / Powerpoint (change transparency of top image and overlay to check similarity)
STM Image Alterations and Duplications Resource Center
Check that raw images reflect those in the manuscript / article
COPE Image Manipulation flowcharts

Escalate to institute?
Best placed to verify provenance of research; scrutinise lab books/hardware; interview

#EASEevents #EASEoslo #ImageManipulation

**LLMs** @LLMs@activitypub.awakari.com · 3d

LLMs @LLMs@activitypub.awakari.com

AI Made Simple -What Every Conversation Designer Should Know (Series) — RAG Basics AI Made ...

https://uxplanet.org/ai-made-simple-what-every-conversation-designer-should-know-series-rag-basics-b360ffd667f7?source=rss----819cc2aaeee0---4

#conversational-ai #chatbot-design #chatbots #retrieval-augmented-gen

Result Details

UX Planet · 3dAI Made Simple -What Every Conversation Designer Should Know (Series) — RAG BasicsBy Arun George

#ConversationalAI #chatbotdesign #retrievalaugmentedgen

**ResearchBuzz: Firehose** @researchbuzz_firehose@rbfirehose.com · 13h

13h

ResearchBuzz: Firehose @researchbuzz_firehose@rbfirehose.com

Financial Times: Insurers launch cover for losses caused by AI chatbot errors. “Insurers at Lloyd’s of London have launched a product to cover companies for losses caused by malfunctioning artificial intelligence tools, as the sector aims to profit from concerns about the risk of costly hallucinations and errors by chatbots.”

https://rbfirehose.com/2025/05/15/financial-times-insurers-launch-cover-for-losses-caused-by-ai-chatbot-errors/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · 13hFinancial Times: Insurers launch cover for losses caused by AI chatbot errors | ResearchBuzz: Firehose

More from

ResearchBuzz: Firehose

#ai #aihallucinations #aiassisted

**Paul S** @Pauls001@mastodon.social · 16h

16h

Paul S @Pauls001@mastodon.social

AI chatbots are evolving. They now understand context and speak multiple languages, making conversations smoother and more natural. Discover how these advancements are reshaping customer interactions and what it means for your business.

Read More: https://spaculus.com/blog/ai-chatbots-2-0-multilingual-contextual-interactions/

#AI #Chatbots #MultilingualAI

**Internet für Architekten** @architekten_de@mastodon.social · 17h *

17h *

Internet für Architekten @architekten_de@mastodon.social

Die Anwendung von KI im Büro- und Planungsalltag – Gedanken zum „AI in Practice Summit“ in London

Ein Gastbeitrag von Philipp Eichstädt, se·g architekten, Berlin

→ https://internet-fuer-architekten.de/anwendung-ki-planungsbuero-ai-in-practice-summit-riba-london/

Podiumsdiskussion auf dem "AI in Practice Summit" Anfang Mai 2025 im RIBA in London (Foto: Eric Sturm)

#CADSoftware #Chatbots #ChatGPT

**Nigel** @nigelharpur@musicians.today · 18h

18h

Nigel @nigelharpur@musicians.today

Here it is.

To date no interactions with any kind of 'help' system 'chatbot' LLM faux AI or similar have been of any help at all to me.

In ALL cases I have only been 'helped' after finally communicating with a human.

Grrrrrrr.... No exaggeration.

#chatgpt #chatbots #LLM

**Lars Bartsch** @kunde_x@social.tchncs.de · 22h

22h

Lars Bartsch @kunde_x@social.tchncs.de

KI/Chatbots in der Bildung: ziemlich alarmierend und viel zu oft schöngefärbt als "irgendwas mit Fortschritt". Und die Erkennung KI-generierter Texte wird eher komplizierter.

"Everyone Is Cheating Their Way Through College"

via @timnitGebru

https://dair-community.social/@timnitGebru/114509383380404840

Distributed AI Research CommunityTimnit Gebru (she/her). (@timnitGebru@dair-community.social)Seems like the logical conclusion of this "generative AI" exercise. "“My grades were amazing,” she said. “It changed my life.” Sarah continued to use AI when she started college this past fall. Why wouldn’t she? Rarely did she sit in class and not see other students’ laptops open to ChatGPT. Toward the end of the semester, she began to think she might be dependent on the website. https://web.archive.org/web/20250512191554/https://nymag.com/intelligencer/article/openai-chatgpt-ai-cheating-education-college-students-school.html

#ki #chatbots #DigitalerVerbraucherschutz

**LLMs** @LLMs@activitypub.awakari.com · 23h

23h

LLMs @LLMs@activitypub.awakari.com

For Chatbots, Slow is the New Fast We’ve spent years trying to prompt better. Maybe it’s time...

https://medium.com/@jacynthlow/for-chatbots-slow-is-the-new-fast-d77cdb04ee8b?source=rss------technology-5

#chatgpt #technology #chatbots #artificial-intelligence #prompt-engineering

Result Details

Medium · 23hFor Chatbots, Slow is the New Fast - Jacynth - MediumBy Jacynth

#artificialintelligence #PromptEngineering

**ElonMusk** @ElonMusk@activitypub.awakari.com · 1d

ElonMusk @ElonMusk@activitypub.awakari.com

Elon Musk’s Unhinged Grok AI Is Rambling About “White Genocide” in Completely Unrelated Twe...

https://futurism.com/elon-musk-grok-ai-white-genocide

#Artificial #Intelligence #chatbots #elon #musk #grok #xai

Result Details

Futurism · 1dElon Musk’s Unhinged Grok AI Is Rambling About “White Genocide” in Completely Unrelated TweetsBy Maggie Harrison Dupré

**IEEE Spectrum** @spectrum.ieee.org@bsky.brid.gy · 1d

IEEE Spectrum @spectrum.ieee.org@bsky.brid.gy

Do #AI #chatbots understand us? And does it matter if they do? Sebastien Bubeck of #OpenAI debates Emily M. Bender of the University of Washington. Watch the debate now: buff.ly/bwY6acC Sign up for our AI Newsletter >>> bit.ly/3FiMUvk

**ElonMusk** @ElonMusk@activitypub.awakari.com · 1d

ElonMusk @ElonMusk@activitypub.awakari.com

https://venturebeat.com/ai/elon-musks-grok-ai-is-spamming-x-users-about-south-african-race-relations-now-for-some-reason/

#AI #Business #AI, #ML #and #Deep #Learning #alignment #chatbots #Conversational #AI

Result Details

Recent searches

Search options

Administered by:

Server stats:

#chatbots

Recent searches

Search options

Administered by:

Server stats:

chatBots

#chatbots