Mastodon.world admins @mwadmin

Continued thread

**➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ** @AeonCypher@lgbtqia.space · Jul 16

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ @AeonCypher@lgbtqia.space

Pre #2020: #Factorizing Tools

These #AI wre #DeepLearning breakthroughs. #Word2Vec, #DeepDream and #AlphaGo solved novel, previously unsolvable, problems.

If you weren't in the field, you might not think these were AI, and #GPT 2 might have surprised you.

**Habr** @habr@zhub.link · Jun 22

Jun 22

Habr @habr@zhub.link

[Перевод] Простой механизм поиска с нуля

Мы с Крисом недавно «с нуля» буквально за пару часов создали механизм поиска для моего блога. Основную часть проделал именно Крис, так как до этого с word2vec я был знаком лишь отдалённо. Разработанный нами поисковик основывается на векторных представлениях (эмбеддингах) слов. Принцип здесь следующий. Функция получает слово и отображает его в N-мерное пространство (в данном случае N=300 ), где каждое измерение отражает определённый оттенок смысла. Вот хорошая статья (англ.) о том, как обучить собственную модель word2vec, и её внутреннем устройстве. Суть работы созданного нами поиска заключается в преобразовании моих статей, а точнее составляющих их слов, в эмбеддинги, сохраняемые в общем пространстве. Затем при выполнении конкретного поиска текст его запроса преобразуется аналогичным образом и сопоставляется с векторами статей. В результате этого сопоставления, используя метрику косинусного сходства , мы ранжируем статьи по их релевантности запросу. Уравнение ниже может показаться пугающим, но в нём говорится, что косинусное сходство, представляющее косинус угла между двух векторов cos(theta) , определяется в виде скалярного произведения, поделённого на произведение величин каждого вектора. Разберём всё это подробнее.

https://habr.com/ru/companies/ruvds/articles/920174/

ХабрПростой механизм поиска с нуляМы с Крисом недавно «с нуля» буквально за пару часов создали механизм поиска для моего блога. Основную часть проделал именно Крис, так как до этого с word2vec я был знаком лишь отдалённо....

#ruvds_перевод #поиск #word2vec

cs @cs@mastodon.sdf.org · May 20

May 20

cs @cs@mastodon.sdf.org

This was shared by someone recently, but I can't find it now. Otherwise, I would have boosted or h/t the original post.
A simple search engine from scratch
#search #word2vec
https://bernsteinbear.com/blog/simple-search/

Max Bernstein · May 20A simple search engine from scratch*

More from

Max B

**Habr** @habr@zhub.link · May 13

May 13

Habr @habr@zhub.link

История развития языковых моделей: ч. 2, от ChatGPT до рассуждающего режима

Доброго времени суток, «Хабр»! В предыдущей части мы рассмотрели историю языковых моделей от робких шагов Маркова до долгой краткосрочной памяти. Сегодня мы продолжим, пройдемся по ключевым архитектурам последних лет и разберём, как модели научились интерпретировать контекст, предсказывать и даже спорить логически. Пристегните токены — вход в зону трансформаций!

https://habr.com/ru/companies/bothub/articles/909100/

ХабрИстория развития языковых моделей: ч. 2, от ChatGPT до рассуждающего режимаДоброго времени суток, «Хабр»! В предыдущей части мы рассмотрели историю языковых моделей от робких шагов Маркова до долгой краткосрочной памяти. Сегодня мы продолжим, пройдемся по ключевым...

#word2vec #трансформер #токенизация

**LLMs** @LLMs@activitypub.awakari.com · May 13

May 13

LLMs @LLMs@activitypub.awakari.com

История развития языковых моделей: ч. 2, от ChatGPT до рас...

https://habr.com/ru/companies/bothub/articles/909100/?utm_source=habrahabr&utm_medium=rss&utm_campaign=909100

#word2vec #трансформер #токенизация #bert #chatgpt #t5

Result Details

**Harald Sack** @lysander07@sigmoid.social · May 11

May 11

Harald Sack @lysander07@sigmoid.social

Next stop in our NLP timeline is 2013, the introduction of low dimensional dense word vectors - so-called "word embeddings" - based on distributed semantics, as e.g. word2vec by Mikolov et al. from Google, which enabled representation learning on text.

T. Mikolov et al. (2013). Efficient Estimation of Word Representations in Vector Space.
https://arxiv.org/abs/1301.3781

#NLP #AI #wordembeddings #word2vec #ise2025 #historyofscience @fiz_karlsruhe @fizise @tabea @sourisnumerique @enorouzi

**Habr** @habr@zhub.link · Apr 7

Apr 7

Habr @habr@zhub.link

Ведущий разработчик ChatGPT и его новый проект — Безопасный Сверхинтеллект

Многие знают об Илье Суцкевере только то, что он выдающийся учёный и программист, родился в СССР, соосновал OpenAI и входит в число тех, кто в 2023 году изгнал из компании менеджера Сэма Альтмана. А когда того вернули, Суцкевер уволился по собственному желанию в новый стартап Safe Superintelligence («Безопасный Сверхинтеллект»). Илья Суцкевер действительно организовал OpenAI вместе с Маском, Брокманом, Альтманом и другими единомышленниками, причём был главным техническим гением в компании. Ведущий учёный OpenAI сыграл ключевую роль в разработке ChatGPT и других продуктов. Сейчас Илье всего 38 лет — совсем немного для звезды мировой величины.

https://habr.com/ru/companies/ruvds/articles/892646/

#Илья_Суцкевер #Ilya_Sutskever #OpenAI #10x_engineer #AlexNet #Safe_Superintelligence #ImageNet #неокогнитрон #GPU #GPGPU #CUDA #компьютерное_зрение #LeNet #Nvidia_GTX 580 #DNNResearch #Google_Brain #Алекс_Крижевски #Джеффри_Хинтон #Seq2seq #TensorFlow #AlphaGo #Томаш_Миколов #Word2vec #fewshot_learning #машина_Больцмана #сверхинтеллект #GPT #ChatGPT #ruvds_статьи

ХабрВедущий разработчик ChatGPT и его новый проект — Безопасный СверхинтеллектМногие знают об Илье Суцкевере только то, что он выдающийся учёный и программист, родился в СССР, соосновал OpenAI и входит в число тех, кто в 2023 году изгнал из компании менеджера Сэма Альтмана. А...

Replied in thread

**Steve Leach** @stevenaleach@sigmoid.social · Jan 30

Jan 30

Steve Leach @stevenaleach@sigmoid.social

@futurebird @krozruch That might be very interesting trained on large enough corpus... Who, for instance is exactly between A and B in "thought-space" and can they understand both sides and help bridge communication? Who is in the space further along the distance from group A and B but more extreme in both directions? Think #Word2Vec math... with writers voices and thinking. Then feed #politicians words in and see where they are relative to public thoughtspace, etc..

**mthpvg** @mthpvg@mastodon.social · Dec 22, 2024

Dec 22, 2024

mthpvg @mthpvg@mastodon.social

https://fchollet.substack.com/p/how-i-think-about-llm-prompt-engineering

"Word2vec enabled you to do basic things like plural(cat) → cats or male_to_female(king) → queen. Meanwhile LLMs can do pure magic — things like write_this_in_style_of_shakespeare(“…your poem…”) → “…new poem…”. And they contain millions of such programs."

Sparks in the Wind · Oct 9, 2023How I think about LLM prompt engineeringBy François Chollet

#AI #LLM #word2vec

**Hacker News** @ycombinator@rss-mstdn.studiofreesia.com · Sep 19, 2024

**Joachim Scharloth** @josch@chaos.social · Sep 12, 2024

Sep 12, 2024

Joachim Scharloth @josch@chaos.social

#WordEmbeddings - läuft...

#word2vec #ki #ai

**Hacker News** @ycombinator@rss-mstdn.studiofreesia.com · Aug 7, 2024

Continued thread

**Steve Leach** @stevenaleach@sigmoid.social · Jul 21, 2024

Jul 21, 2024

Steve Leach @stevenaleach@sigmoid.social

By vector math, recall #word2vec and nearest word to "king" - "man" + "woman" = "queen".

Learn a #vector representation for writers, philosophers, #political leaders and voices.

Map them and show a visualization of where major figures fall on this vector-space of their words.

Plug in writers who's ideas you like, average them and find the vector at the center and find others in that thought space.

Do the reverse - map voices you hate and see where they are and who's in between, etc.

**Tushar Chauhan** @tchauhan@mastodon.mit.edu · Jul 8, 2024

Jul 8, 2024

Tushar Chauhan @tchauhan@mastodon.mit.edu

LLMs and puzzles. An interesting read.

https://pivot-to-ai.com/2024/07/06/llms-can-solve-any-word-problem-as-long-as-they-can-crib-the-answer/

Apparently biology and intelligence doesn't bend to the will of storytellers.

Meanwhile, the word2vec yarn-spinning continues...

Pivot to AI · Jul 6, 2024LLMs can solve any word problem! As long as they can crib the answerAI companies keep claiming that LLMs do reasoning. The AI can think! The AI can figure out word puzzles! The AI can pass the LSAT! You’re fired! So our good friend Diz tested this by throwing logic…

#llm #word2vec #ai

**Habr** @habr@zhub.link · Jun 7, 2024

Jun 7, 2024

Habr @habr@zhub.link

Что такое векторизация текста в NLP и какая она бывает: One-hot encoding, Bag of words, TF-IDF, Word2Vec, BERT и другие

Привет, Хабр! Меня зовут Александр Троицкий, я автор канала AI для чайников , и в этой статье я расскажу про разные способы векторизации текстов. Всем привет! Вдохновившись прикольной и понятной статьей на английском языке, и не найдя сходу чего-то похожего в русскоязычном сегменте интернета, решил написать о том, как обрабатывается текст перед тем, как на нем начинают применять разные модели ИИ. Эту статью я напишу нетехническим языком, потому что сам не технарь и не математик. Надеюсь, что она поможет узнать о NLP тем, кто не сталкивается с AI в продуктах на ежедневной основе. О чем эта статья:

https://habr.com/ru/articles/820159/

#Onehot_encoding #Bag_of_words #TFIDF #Word2Vec #BERT #NLP #nlp_(natural_language_processing) #nlpмодели

ХабрЧто такое векторизация текста в NLP и какая она бывает: One-hot encoding, Bag of words, TF-IDF, Word2Vec, BERT и другиеПривет, Хабр! Меня зовут Александр Троицкий, я автор канала AI для чайников , и в этой статье я расскажу про разные способы векторизации текстов. Всем привет! Вдохновившись прикольной и...

**GripNews** @GripNews@mastodon.social · Apr 19, 2024

Apr 19, 2024

GripNews @GripNews@mastodon.social

插圖化的Word2vec - Jay Alammar - 一次性視覺化機器學習一個概念。
➤ 詞向量的概念和應用詳細解說
✤ https://jalammar.github.io/illustrated-word2vec/
插圖化的Word2vec，由Jay Alammar講解，透過例子和圖表解釋詞向量的概念，展示其在機器學習中的重要性和應用。
+ 解釋清晰，對於機器學習新手來說很有幫助。
+ 文中的插圖和例子很生動，有助於加深對Word2vec的理解。
#機器學習 #Word2vec #詞向量 #人格特質

jalammar.github.ioThe Illustrated Word2vec Discussions: Hacker News (347 points, 37 comments), Reddit r/MachineLearning (151 points, 19 comments) Translations: Chinese (Simplified), French, Korean, Portuguese, Russian “There is in all things a pattern that is part of our universe. It has symmetry, elegance, and grace - those qualities you find always in that which the true artist captures. You can find it in the turning of the seasons, in the way sand trails along a ridge, in the branch clusters of the creosote bush or the pattern of its leaves. We try to copy these patterns in our lives and our society, seeking the rhythms, the dances, the forms that comfort. Yet, it is possible to see peril in the finding of ultimate perfection. It is clear that the ultimate pattern contains it own fixity. In such perfection, all things move toward death.” ~ Dune (1965) I find the concept of embeddings to be one of the most fascinating ideas in machine learning. If you’ve ever used Siri, Google Assistant, Alexa, Google Translate, or even smartphone keyboard with next-word prediction, then chances are you’ve benefitted from this idea that has become central to Natural Language Processing models. There has been quite a development over the last couple of decades in using embeddings for neural models (Recent developments include contextualized word embeddings leading to cutting-edge models like BERT and GPT2). Word2vec is a method to efficiently create word embeddings and has been around since 2013. But in addition to its utility as a word-embedding method, some of its concepts have been shown to be effective in creating recommendation engines and making sense of sequential data even in commercial, non-language tasks. Companies like Airbnb, Alibaba, Spotify, and Anghami have all benefitted from carving out this brilliant piece of machinery from the world of NLP and using it in production to empower a new breed of recommendation engines. In this post, we’ll go over the concept of embedding, and the mechanics of generating embeddings with word2vec. But let’s start with an example to get familiar with using vectors to represent things. Did you know that a list of five numbers (a vector) can represent so much about your personality?

**Hacker News** @ycombinator@rss-mstdn.studiofreesia.com · Feb 24, 2024

Feb 13, 2024

Feb 13, 2024

ithoughthecamewithyou.com @ithoughthecamewithyou.com@ithoughthecamewithyou.com

Better related posts with word2vec (C#) #code #software #word2vec #ithcwy #ml How to use word2vec to create a vector representation of a blog post and then use the cosine distance between posts to select improved related posts.

https://fed.brid.gy/r/https://ithoughthecamewithyou.com/post/better-related-posts-with-word2vec-c

I Thought He Came With YouBetter related posts with word2vec (C#)How to use word2vec to create a vector representation of a blog post and then use the cosine distance between posts to select improved related posts.