mastodon.world is one of the many independent Mastodon servers you can use to participate in the fediverse.
Generic Mastodon server for anyone to use.

Server stats:

9.9K
active users

#hdfs

0 posts0 participants0 posts today
Doug Whitfield [Minneapolis]<p>so, gonna write some stuff on <a href="https://mastodon.social/tags/HDFS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HDFS</span></a> <a href="https://mastodon.social/tags/MapReduce" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MapReduce</span></a> <a href="https://mastodon.social/tags/yarn" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>yarn</span></a> and maybe clustering. Also, <a href="https://mastodon.social/tags/machinelearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>machinelearning</span></a> was suggested but I think that may be too broad of a topic for this. I did cover Machine Learning in a blog back in 2023, but this time is for KB, not blog: <a href="https://www.openlogic.com/blog/using-cassandra-kafka-and-spark-ai" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">openlogic.com/blog/using-cassa</span><span class="invisible">ndra-kafka-and-spark-ai</span></a></p><p>Hmm, perhaps some sort of ML performance (as in disk io, etc not accuracy) document would be good but still, where to even start.</p><p>If anyone has beginner resources, I'll likely be pointing folks to some resources</p>
Habr<p>Мой опыт эксплуатации кластера Trino</p><p>Trino — высокопроизводительный распределённый SQL-движок, с возможностью объединения данных из разнородных источников, таких как: реляционные БД, файловые хранилища, шины данных, inmemory-хранилища, облачные сервисы и тд. Архитектура ориентирована на выполнение аналитических запросов с минимальной задержкой. Т.е. с его помощью можно отправлять SQL-запросы в MongoDB и Kafka, например. Благодаря скорости, развитию, и удобству захватывает популярность у инженеров и аналитиков, работающих с bigdata. Я познакомился с Trino 1 год назад, за это время настроил с нуля кластер на baremetal и помог с проблемами в нескольких других. В этой статье делюсь краткой выжимкой опыта эксплуатации, накопленным за это время. Большая часть информации будет актуальна и для российского форка Trino: CedrusData .</p><p><a href="https://habr.com/ru/articles/863854/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">habr.com/ru/articles/863854/</span><span class="invisible"></span></a></p><p><a href="https://zhub.link/tags/trino" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>trino</span></a> <a href="https://zhub.link/tags/trinosql" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>trinosql</span></a> <a href="https://zhub.link/tags/hdfs" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>hdfs</span></a> <a href="https://zhub.link/tags/iceberg" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>iceberg</span></a> <a href="https://zhub.link/tags/cedrusdata" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>cedrusdata</span></a> <a href="https://zhub.link/tags/presto" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>presto</span></a> <a href="https://zhub.link/tags/prestodb" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>prestodb</span></a></p>
Python for Data Science<p>The page for supporting (remote) file systems in Python has become much more comprehensive: <a href="https://www.python4data.science/en/latest/data-processing/file-systems.html" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">python4data.science/en/latest/</span><span class="invisible">data-processing/file-systems.html</span></a><br>And you are welcome to let us know if we have forgotten anything.<br><a href="https://mastodon.social/tags/Python" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Python</span></a> <a href="https://mastodon.social/tags/HDFS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HDFS</span></a> <a href="https://mastodon.social/tags/S3" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>S3</span></a> <a href="https://mastodon.social/tags/GoogleDrive" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GoogleDrive</span></a> <a href="https://mastodon.social/tags/HuggingFace" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HuggingFace</span></a> <a href="https://mastodon.social/tags/Git" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Git</span></a></p>
LINUXexpert.org<p>The Hadoop ecosystem comprises various tools and frameworks designed to handle large-scale data processing and analytics. Let's discuss the core components, namely Hadoop, HBase, and Hive, along with other significant tools such as Pig, Sqoop, Flume, Oozie, and Zookeeper.</p><p><a href="https://linuxexpert.org/so-you-wanna-do-big-data/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">linuxexpert.org/so-you-wanna-d</span><span class="invisible">o-big-data/</span></a></p><p><a href="https://mastodon.social/tags/Hadoop" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Hadoop</span></a> <a href="https://mastodon.social/tags/HBase" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HBase</span></a> <a href="https://mastodon.social/tags/Hive" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Hive</span></a> <a href="https://mastodon.social/tags/BigData" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigData</span></a> <a href="https://mastodon.social/tags/HadoopEcosystem" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HadoopEcosystem</span></a> <a href="https://mastodon.social/tags/HDFS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HDFS</span></a> <a href="https://mastodon.social/tags/MapReduce" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MapReduce</span></a> <a href="https://mastodon.social/tags/YARN" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>YARN</span></a> <a href="https://mastodon.social/tags/Pig" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Pig</span></a> <a href="https://mastodon.social/tags/Sqoop" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Sqoop</span></a> <a href="https://mastodon.social/tags/Flume" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Flume</span></a> <a href="https://mastodon.social/tags/Oozie" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Oozie</span></a> <a href="https://mastodon.social/tags/Zookeeper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Zookeeper</span></a> <a href="https://mastodon.social/tags/DataProcessing" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataProcessing</span></a> <a href="https://mastodon.social/tags/DataAnalytics" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataAnalytics</span></a> <a href="https://mastodon.social/tags/DataWarehousing" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataWarehousing</span></a> <a href="https://mastodon.social/tags/ETL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ETL</span></a> <a href="https://mastodon.social/tags/DataIngestion" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataIngestion</span></a> <a href="https://mastodon.social/tags/Security" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Security</span></a></p>
Patryk Krawaczyński<p>Przyśpieszanie odczytu zajętości dysków na Hadoop &lt; 2.8.X ( <a href="https://nfsec.pl/root/6233" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">nfsec.pl/root/6233</span><span class="invisible"></span></a> ) <a href="https://infosec.exchange/tags/hadoop" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>hadoop</span></a> <a href="https://infosec.exchange/tags/hdfs" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>hdfs</span></a> <a href="https://infosec.exchange/tags/tuning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>tuning</span></a> <a href="https://infosec.exchange/tags/twittermigration" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>twittermigration</span></a></p>
Peter Czanik<p>I did my usual syslog-ng git snapshot compiles today. RPM platforms were all right. On <a href="https://fosstodon.org/tags/FreeBSD" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FreeBSD</span></a> I ran into a minor problem:</p><p><a href="https://github.com/syslog-ng/syslog-ng/issues/4515" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/syslog-ng/syslog-ng</span><span class="invisible">/issues/4515</span></a></p><p><a href="https://fosstodon.org/tags/Java" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Java</span></a> modules fail. My guess is that nobody is really affected, unless someone uses the <a href="https://fosstodon.org/tags/HDFS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HDFS</span></a> destination of <a href="https://fosstodon.org/tags/syslog_ng" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>syslog_ng</span></a>.</p>
Big Data analytics News<p>Hadoop admin interview questions <a href="https://buff.ly/2nB1KSM" target="_blank" rel="nofollow noopener noreferrer" translate="no"><span class="invisible">https://</span><span class="">buff.ly/2nB1KSM</span><span class="invisible"></span></a> <a href="https://mastodon.world/tags/hdfs" class="mention hashtag" rel="tag">#<span>hdfs</span></a> <a href="https://mastodon.world/tags/hive" class="mention hashtag" rel="tag">#<span>hive</span></a> <a href="https://mastodon.world/tags/hbase" class="mention hashtag" rel="tag">#<span>hbase</span></a> <a href="https://mastodon.world/tags/mapr" class="mention hashtag" rel="tag">#<span>mapr</span></a> <a href="https://mastodon.world/tags/database" class="mention hashtag" rel="tag">#<span>database</span></a> <a href="https://mastodon.world/tags/nosql" class="mention hashtag" rel="tag">#<span>nosql</span></a> <a href="https://mastodon.world/tags/bigdata" class="mention hashtag" rel="tag">#<span>bigdata</span></a> <a href="https://mastodon.world/tags/ai" class="mention hashtag" rel="tag">#<span>ai</span></a></p>
Big Data analytics News<p>40 Best Free and Open Source <a href="https://mastodon.world/tags/NoSQL" class="mention hashtag" rel="tag">#<span>NoSQL</span></a> Databases <a href="https://buff.ly/3b6LQag" target="_blank" rel="nofollow noopener noreferrer" translate="no"><span class="invisible">https://</span><span class="">buff.ly/3b6LQag</span><span class="invisible"></span></a> <a href="https://mastodon.world/tags/bigdata" class="mention hashtag" rel="tag">#<span>bigdata</span></a> <a href="https://mastodon.world/tags/hadoop" class="mention hashtag" rel="tag">#<span>hadoop</span></a> <a href="https://mastodon.world/tags/HDFS" class="mention hashtag" rel="tag">#<span>HDFS</span></a> <a href="https://mastodon.world/tags/Hive" class="mention hashtag" rel="tag">#<span>Hive</span></a> <a href="https://mastodon.world/tags/IoT" class="mention hashtag" rel="tag">#<span>IoT</span></a> <a href="https://mastodon.world/tags/Fintech" class="mention hashtag" rel="tag">#<span>Fintech</span></a> <a href="https://mastodon.world/tags/MapR" class="mention hashtag" rel="tag">#<span>MapR</span></a> <a href="https://mastodon.world/tags/ArtificialIntelligence" class="mention hashtag" rel="tag">#<span>ArtificialIntelligence</span></a> <a href="https://mastodon.world/tags/AI" class="mention hashtag" rel="tag">#<span>AI</span></a> <a href="https://mastodon.world/tags/ML" class="mention hashtag" rel="tag">#<span>ML</span></a> <a href="https://mastodon.world/tags/DataScience" class="mention hashtag" rel="tag">#<span>DataScience</span></a> <a href="https://mastodon.world/tags/DataScientists" class="mention hashtag" rel="tag">#<span>DataScientists</span></a> <a href="https://mastodon.world/tags/CodeNewbies" class="mention hashtag" rel="tag">#<span>CodeNewbies</span></a> <a href="https://mastodon.world/tags/Tech" class="mention hashtag" rel="tag">#<span>Tech</span></a> <a href="https://mastodon.world/tags/deeplearning" class="mention hashtag" rel="tag">#<span>deeplearning</span></a> <a href="https://mastodon.world/tags/CyberSecurity" class="mention hashtag" rel="tag">#<span>CyberSecurity</span></a> <a href="https://mastodon.world/tags/Python" class="mention hashtag" rel="tag">#<span>Python</span></a> <a href="https://mastodon.world/tags/Coding" class="mention hashtag" rel="tag">#<span>Coding</span></a> <a href="https://mastodon.world/tags/javascript" class="mention hashtag" rel="tag">#<span>javascript</span></a> <a href="https://mastodon.world/tags/rstats" class="mention hashtag" rel="tag">#<span>rstats</span></a> <a href="https://mastodon.world/tags/100DaysOfCode" class="mention hashtag" rel="tag">#<span>100DaysOfCode</span></a> <a href="https://mastodon.world/tags/programming" class="mention hashtag" rel="tag">#<span>programming</span></a> <a href="https://mastodon.world/tags/Linux" class="mention hashtag" rel="tag">#<span>Linux</span></a> <a href="https://mastodon.world/tags/IoT" class="mention hashtag" rel="tag">#<span>IoT</span></a> <a href="https://mastodon.world/tags/IIoT" class="mention hashtag" rel="tag">#<span>IIoT</span></a> <a href="https://mastodon.world/tags/BigData" class="mention hashtag" rel="tag">#<span>BigData</span></a></p>
Volkan Özçelik 🦄<p>SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of file</p><p><a href="https://github.com/seaweedfs/seaweedfs" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="">github.com/seaweedfs/seaweedfs</span><span class="invisible"></span></a></p><p><a href="https://hachyderm.io/tags/Kubernetes" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Kubernetes</span></a> <a href="https://hachyderm.io/tags/DistributedSystems" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DistributedSystems</span></a> <a href="https://hachyderm.io/tags/S3Compatible" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>S3Compatible</span></a> <a href="https://hachyderm.io/tags/ErasureCoding" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ErasureCoding</span></a> <a href="https://hachyderm.io/tags/HDFS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HDFS</span></a> <a href="https://hachyderm.io/tags/Hadoop" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Hadoop</span></a> <a href="https://hachyderm.io/tags/Fuse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Fuse</span></a> <a href="https://hachyderm.io/tags/Posix" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Posix</span></a> <a href="https://hachyderm.io/tags/infra" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>infra</span></a> <a href="https://hachyderm.io/tags/SeaweedFS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SeaweedFS</span></a></p>