Mastodon.world admins @mwadmin

**Computo** @computo@mathstodon.xyz · Jul 15

Summer read: a new paper on model-based clustering just appeared in Computo!

Julien Jacques and Brendan Thomas Murphy publish a new method for clustering multivariate count data. The method combines feature selection and clustering, and is based on conditionally independent Poisson mixture models and Poisson generalized linear models.

On simulations, the Adjusted Rand Index (ARI) of the model with selected variables is close to the optimal ARI obtained with the true clustering variables.

The paper and accompanying R code are available at https://computo-journal.org/published-202507-jacques-count-data/

#machineLearning #clustering #Rstats

**Minecraft** @Minecraft@activitypub.awakari.com · Jul 10

Jul 10

Minecraft @Minecraft@activitypub.awakari.com

How to Combine Junk Laptops into a Supercomputer Imagine stepping into your garage, finding a trio of old, almost forgotten family sedans from the early 2000s—each one a bit rusty, each one a who...

#Gadgets #Hardware #hardware #limitations #Minecraft #servers #proxmox #server #clustering #software #configuration

Origin | Interest | Match

Frank's World of Data Science & AI · Jul 10How to Combine Junk Laptops into a SupercomputerImagine stepping into your garage, finding a trio of old, almost forgotten family sedans from the early 2000s—each one a bit rusty, each one a who-knows-when-it-last-ran kind of machine—and decidin…

**Kubernetes** @Kubernetes@activitypub.awakari.com · Jul 6

Jul 6

Kubernetes @Kubernetes@activitypub.awakari.com

Machine Learning Fundamentals: clustering ## Clustering in Production Machine Learning Systems: A Deep Dive ### 1. Introduction In Q3 2023, a critical anomaly in our fraud detection system at FinTe...

#machinelearning #ai #clustering

Origin | Interest | Match

DEV CommunityMachine Learning Fundamentals: clustering## Clustering in Production Machine Learning Systems: A Deep Dive ### 1. Introduction In Q3 2023,...

Jun 26

Jun 26

2rZiKKbOU3nTafniR2qMMSE0gwZ @2rZiKKbOU3nTafniR2qMMSE0gwZ@activitypub.awakari.com

How to Set Up High Availability with SafeLine WAF The SafeLine WAF High Availability solution ensures business continuity and guarantees the availability of SafeLine. This tutorial introduces how t...

#ha #failover #clustering #web-application-security

Origin | Interest | Match

#webapplicationsecurity

**Enabla** @enabla@mathstodon.xyz · Jun 11

Jun 11

Enabla @enabla@mathstodon.xyz

Don't pass by the new insightful lecture from Dr. Alejandro Rodriguez Garcia, Abdus Salam International Centre for Theoretical Physics (ICTP)!

In this one, Alex provides a comprehensive overview of various clustering methods, including flat, fuzzy, and hierarchical approaches. His lecture not only discusses the mathematical foundations of techniques like k-means and k-medoids but also highlights their practical applications across fields such as image recognition and data classification.

This lecture is an excellent opportunity to deepen your understanding of unsupervised learning and engage critically with advanced clustering methods.

Join Enabla to watch the lecture and interact with Dr. Rodriguez Garcia for free! Ask questions and spark discussions with both him and the rest of the Enabla community: https://enabla.com/pub/1109/about

#UnsupervisedLearning #MachineLearning #DataScience

**Edchart** @edchartcare@mastodon.social · May 31

May 31

Edchart @edchartcare@mastodon.social

Clustering Machine Learning Certification

Take the exam online: https://www.edchart.com/certificate/clustering-machine-learning-certification-exam-free-test
Get your verified digital credential: https://www.credly.com/org/edchart-technologies/badge/edchart-certified-clustering-machine-learning-subje

EdChart now offers the Clustering Machine Learning Certification, recognized globally and trusted by professionals. Take the online exam from anywhere in the world, and pay only if you pass.

#MachineLearning #Clustering #DataScience

**Semantic-Search** @Semantic-Search@activitypub.awakari.com · May 29

May 29

Semantic-Search @Semantic-Search@activitypub.awakari.com

How To Automate SEO Keyword Clustering By Search Intent With Python There’s a lot to know about...

https://zephyrnet.com/how-to-automate-seo-keyword-clustering-by-search-intent-with-python/

#SEO #automate #clustering #intent #keyword #Python #Search #seo

Result Details

**LLMs** @LLMs@activitypub.awakari.com · May 23

May 23

LLMs @LLMs@activitypub.awakari.com

Unveiling Hidden Patterns: An Advanced Generalized Framework for Automated Insight Discovery in...

https://medium.com/@sanchitsatija55/unveiling-hidden-patterns-an-advanced-generalized-framework-for-automated-insight-discovery-in-5fd945d88dfe?source=rss------machine_learning-5

#llm #frequent-itemset-mining #machine-learning #clustering

Result Details

Medium · May 23Unveiling Hidden Patterns: An Advanced Generalized Framework for Automated Insight Discovery in Tabular DataBy Sanchitsatija

#frequentitemsetmining #machinelearning

**LLMs** @LLMs@activitypub.awakari.com · May 13

May 13

LLMs @LLMs@activitypub.awakari.com

Embeddings are underrated

https://technicalwriting.dev/ml/embeddings/overview.html

#embeddings #machinelearning #AI #nlp #clustering #similarity #llm

Result Details

technicalwriting.devEmbeddings are underrated

**Benjamin Rosemann** @b2m@mastodon.social · May 7

May 7

Benjamin Rosemann @b2m@mastodon.social

Ein lang ersehnter Wunsch von mir: Eigene #Clustering Methoden in #OpenRefine benutzen.

Verfügbar seit Version 3.9.0 und funktioniert seit 3.9.3 auch mit #Jython und #Clojure.

Hier eine Anleitung zur Benutzung im #FDMLab Blog.

https://fdmlab.landesarchiv-bw.de/workshop/openrefine-fortgeschrittene/19-erweitertes-clustering/

FDMLab@LABW · May 5Workshop - Erweitertes Clustering | FDMLab@LABWWir verwenden eigene Clustering Methoden in OpenRefine, um Schreibweisen zu vereinheitlichen.

#LandesarchivBW

**MottG** @mottg@researchbuzz.masto.host · May 3

May 3

MottG @mottg@researchbuzz.masto.host

Clustering Workbench of the Carrot2 search engine is working now. It can
cluster search results by 3 algorithms:
Lingo, STC, or k=means. STC is Suffix Tree Clustering method, a fast, phrase-based clustering method that groups documents based on common, frequent phrases. The screenshot shows search results using Lingo clustering for query:
"survey of AI tools for systematic reviews."

https://search.carrot2.org/#/workbench

#research #academia #Carrot2
#systematicReview
#clustering #Lingo #STC #k-means

**JMLR** @jmlr@sigmoid.social · Apr 25

Apr 25

JMLR @jmlr@sigmoid.social

'Curvature-based Clustering on Graphs', by Yu Tian, Zachary Lubberts, Melanie Weber.

http://jmlr.org/papers/v26/24-0781.html

#clustering #communities #clusters

**Kubernetes** @Kubernetes@activitypub.awakari.com · Apr 22

Apr 22

Kubernetes @Kubernetes@activitypub.awakari.com

Kubernetes: как мы развёртывали кластеры в условиях отсут...

https://habr.com/ru/companies/slurm/articles/903186/?utm_source=habrahabr&utm_medium=rss&utm_campaign=903186

#kubernetes #кластер #классификация #cluster #clustering #кубернетес #кубернетес #мега

Result Details

**LLMs** @LLMs@activitypub.awakari.com · Nov 28, 2023

Nov 28, 2023

LLMs @LLMs@activitypub.awakari.com

Dataset in a day A clustering-based approach to create deep learning datasets in a day Introduct...

https://medium.com/bumble-tech/dataset-in-a-day-7f369de3b178?source=rss----6353b5325b1a---4

#data-science #machine-learning #clustering #deep-learning #dataset

Event Attributes

#datascience #machinelearning #deeplearning

**Fabrice Tshimanga** @fabrice13@neuromatch.social · Mar 27

Mar 27

Fabrice Tshimanga @fabrice13@neuromatch.social

Exciting news, our paper is out!

"Behavioral Clusters and Lesion Distributions in Ischemic Stroke, Based on NIHSS Similarity Network" on Springer Journal of Healthcare Informatics Research https://rdcu.be/efgma

With my co-first-author Andrea Zanola and co-authors, we explore the relations between behavioral measures of impairment after stroke, and the underlying brain lesions.
Rather than focusing on covariances at the population level, we first cluster individual behavioral phenotypes, and then explore the typical and significant lesions of each cluster.

Our technique, Repeated Spectral Clustering is performed on a similarity network (derived from the General Distance Measure, handy for ordinal scales!), and the partitions are statistically robust thanks to the aggregation of results from multiple random initializations.

We end up with 5 clusters, 3 of which show reknown principal components of deficits (Left Motor, Righ Motor, Language), and their associate lesions.

Interestingly, this multi-item and multimodal approach allows to distinguish different etiologies for the same deficits, thanks to their different behavioral associations, and the different lesions characterizing each cluster. Even when the single NIHSS measure is a bit "vague"...

We hope that popularizing the General Distance Measure, Repeated Spectral Clustering and this clustering perspective aside of PCA / CCA studies can inspire multimodal approaches in other neuroscientific and biomedical domains!

Many thanks to our co-authors, Antonio Luigi Bisogno, Silvia Facchini, Lorenzo Pini, Manfredo Atzori and Maurizio Corbetta for data, analytic and medical insights, and their guidance throughout the whole process!

#stroke #neuroscience #clustering