gtbarry<p>A jargon-free explanation of how AI large language models work</p><p>Word vectors - Humans represent words with letters. Language models use a long list of numbers</p><p>Each layer of an LLM is a transformer - Each layer takes a sequence of inputs—each word—and adds information</p><p>Feed-forward layers predict the next word</p><p><a href="https://mastodon.social/tags/LLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLM</span></a> <a href="https://mastodon.social/tags/neuralnetwork" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>neuralnetwork</span></a> <a href="https://mastodon.social/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>artificialintelligence</span></a> <a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/generativeAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>generativeAI</span></a> <a href="https://mastodon.social/tags/WordVectors" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WordVectors</span></a> <a href="https://mastodon.social/tags/transformer" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>transformer</span></a> <a href="https://mastodon.social/tags/FeedForward" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FeedForward</span></a> <a href="https://mastodon.social/tags/data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>data</span></a> <a href="https://mastodon.social/tags/bigdata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>bigdata</span></a> <a href="https://mastodon.social/tags/tech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>tech</span></a> <a href="https://mastodon.social/tags/innovation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>innovation</span></a></p><p><a href="https://arstechnica.com/science/2023/07/a-jargon-free-explanation-of-how-ai-large-language-models-work/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">arstechnica.com/science/2023/0</span><span class="invisible">7/a-jargon-free-explanation-of-how-ai-large-language-models-work/</span></a></p>