mastodon.world is one of the many independent Mastodon servers you can use to participate in the fediverse.
Generic Mastodon server for anyone to use.

Server stats:

8.1K
active users

#datastructure

0 posts0 participants0 posts today

I recently made a very popular LinkedIn post about Simpson's Paradox, which resulted in an engaging conversation. Paul Julian made a great comment on the relationship between Mixed Effects Models and Simpson's Paradox that I wanted to share with you.

In the plot below (generated from reproducible code – thanks, Paul!), you can see how different models compare:

Original post: linkedin.com/posts/joachim-sch

Further details: eepurl.com/gH6myT

🟪 Harnessing the Power of Data Structure to Build Resilient Power Apps

Too often, Power Apps makers jump into app building—without giving enough thought to data structure. In Ep. 55 of #PowerTalks, Griffin Lickfeldt from Citizen Developer explains why getting this step right from the start is critical for building scalable, secure, and AI-ready apps.

💡 The Superpower of Dataverse
🔍 Strategic Table Structuring
⚖️ Future-Proof Your Apps
🔒 Security as a Cornerstone

▶︎ hubsite365.com/en-ww/citizen-d

Mean imputation is a common method for handling missing values in numerical data. It replaces missing values with the mean of the observed values, ensuring the data set remains complete and easy to use.

The image below illustrates the impact of mean imputation. The black line represents the original data distribution before imputation, while the red line shows the data distribution after imputation.

Tutorial: statisticsglobe.com/mean-imput

Newsletter: eepurl.com/gH6myT

Mean imputation is a straightforward method for handling missing values in numerical data, but it can significantly distort the relationships between variables.

For a detailed explanation of mean imputation, its drawbacks, and better alternatives, check out my full tutorial here: statisticsglobe.com/mean-imput

More details are available at this link: eepurl.com/gH6myT

gganimate is a powerful extension for ggplot2 that transforms static visualizations into dynamic animations. By adding a time dimension, it allows you to illustrate trends, changes, and patterns in your data more effectively.

The attached animated visualization, which I created with gganimate, showcases a ranked bar chart of the top 3 countries for each year based on inflation since 1980.

More information: statisticsglobe.com/online-cou

Visualizing gene structures in R? gggenes, an extension of ggplot2, simplifies the process of creating clear and informative gene diagrams, making genomic data easier to interpret and share.

Visualization: cran.r-project.org/web/package

Click this link for detailed information: statisticsglobe.com/online-cou

#ReleaseMonday — One of the recent (already very useful!) new package additions to #ThingUmbrella is:

thi.ng/leaky-bucket

Leaky buckets are commonly used in communication networks for rate limiting, traffic shaping and bandwidth control, but are equally useful in other domains requiring similar constraints.

A Leaky Bucket is a managed counter with an enforced maximum value (i.e. bucket capacity). The counter is incremented for each a new event to check if it can/should be processed. If the bucket capacity has already been reached, the bucket will report an overflow, which we can then handle accordingly (e.g. by dropping or queuing events). The bucket also has a configurable time interval at which the counter is decreasing (aka the "leaking" behavior) until it reaches zero again (i.e. until the bucket is empty). Altogether, this setup can be utilized to ensure both an average rate, whilst also supporting temporary bursting in a controlled fashion...

Related, I've also updated/simplified the rate limiter interceptor in thi.ng/server to utilize this new package...

thi.ng/leaky-bucketConfigurable, counter-based Leaky Bucket abstractions

I used to think that writing sophisticated R code meant using all the advanced features and chaining long functions together...

Fancy code can be fun, but clean code makes collaboration and debugging so much easier.

Stay informed on data science by joining my free newsletter. Check out this link for more details: eepurl.com/gH6myT

eepurl.comStatistics GlobeStatistics Globe Email Forms

Ordered map на Go

Omap — это пакет Golang для работы с потокобезопасными упорядоченными map. Упорядоченная map содержит map golang, list и mutex для выполнения функций упорядоченной map. Упорядоченная map— это map, которая запоминает порядок элементов. Map можно итерировать для извлечения элементов в том порядке, в котором они были добавлены.

habr.com/ru/articles/882828/

ХабрOrdered map на GoOmap — это пакет Golang для работы с потокобезопасными упорядоченными map. Упорядоченная map содержит map golang, list и mutex для выполнения функций упорядоченной map. Упорядоченная map— это map,...
#go#map#caching

In missing data imputation, it is crucial to compare the distributions of imputed values against the observed data to better understand the structure of the imputed values.

The visualization below can be generated using the following R code:

library(mice)
my_imp <- mice(boys)
densityplot(my_imp)

Take a look here for more details: statisticsglobe.com/online-wor

Avoiding text overlap in plots is essential for clarity, and R offers a great solution with the ggplot2 and ggrepel packages. By automatically repositioning labels, ggrepel keeps your plot clean and easy to interpret.

Video: youtube.com/watch?v=5lu4h_CPhi0
Website: statisticsglobe.com/avoid-over

Take a look here for more details: statisticsglobe.com/online-cou