mastodon.world is one of the many independent Mastodon servers you can use to participate in the fediverse.
Generic Mastodon server for anyone to use.

Server stats:

8.1K
active users

#datasets

1 post1 participant0 posts today

New-to-me, from Micah Lee: TeleMessage Explorer: a new open source research tool. “I’ve spent the last week or two writing code to make sense of the massive hack of data from TeleMessage, the comically insecure company that makes a modified Signal app that Trump’s former national security advisor Mike Waltz was caught using. I’ve decided to publish my code as open source in the hopes that other […]

https://rbfirehose.com/2025/08/11/telemessage-explorer-a-new-open-source-research-tool-micah-lee/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · TeleMessage Explorer: a new open source research tool (Micah Lee) | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

PsyPost: Secret changes to major U.S. health datasets raise alarms. “A new study in the medical journal The Lancet reports that more than 100 United States government health datasets were altered this spring without any public notice. The investigation shows that nearly half of the files examined underwent wording changes while leaving the official change logs blank. The authors warn that […]

https://rbfirehose.com/2025/07/19/psypost-secret-changes-to-major-u-s-health-datasets-raise-alarms/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · PsyPost: Secret changes to major U.S. health datasets raise alarms | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

The Conversation: Vanishing data in the U.S. undermines good public policy, with global implications. “As researchers focused on data management (Kristi) and behavioural sciences (Albert) and whose work tackles the significance of research with open access data, we have been concerned about how the data sets that scholars around the world rely on have been vanishing from U.S. government […]

https://rbfirehose.com/2025/07/19/the-conversation-vanishing-data-in-the-u-s-undermines-good-public-policy-with-global-implications/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · The Conversation: Vanishing data in the U.S. undermines good public policy, with global implications | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

NextGov: Inside efforts to capture federal data after ‘the big takedown’. “[Denice] Ross, who’s now a senior fellow at the Federation of American Scientists, is one of the people behind America’s Data Index, an attempt to get a better view of changes across the government data ecosystem.”

https://rbfirehose.com/2025/07/14/nextgov-inside-efforts-to-capture-federal-data-after-the-big-takedown/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · NextGov: Inside efforts to capture federal data after ‘the big takedown’ | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

ERIC: Using LEGO® Brick Data to Teach SQL and Relational Database Concepts. “This paper introduces the LEGO® Database, a large natural dataset that can be used to teach Structured Query Language (SQL) and relational database concepts. This dataset is well-suited for introductory and advanced database assignments and end-of-semester group projects. The data is freely available from […]

https://rbfirehose.com/2025/07/11/eric-using-lego-brick-data-to-teach-sql-and-relational-database-concepts/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · ERIC: Using LEGO® Brick Data to Teach SQL and Relational Database Concepts | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

Data Descriptor: Coronavirus research topics, tracking twenty years of research . “To explore research trends and innovations in this space, we developed a pipeline using natural language processing techniques. This pipeline systematically catalogues and synthesises the vast array of research articles, leading to the creation of a dataset with more than eight hundred thousand articles from […]

https://rbfirehose.com/2025/06/27/data-descriptor-coronavirus-research-topics-tracking-twenty-years-of-research/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · Data Descriptor: Coronavirus research topics, tracking twenty years of research | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

Gallaudet News: Gallaudet experts drive accessibility of speech tech for deaf voices . “Some people use their voices to control tech, from cell phones and remote controls to home appliances and in transportation. Voice command capabilities are made possible through training AI and machine learning. The Speech Accessibility Project is creating datasets of more diverse speech patterns, which […]

https://rbfirehose.com/2025/06/27/gallaudet-news-gallaudet-experts-drive-accessibility-of-speech-tech-for-deaf-voices/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · Gallaudet News: Gallaudet experts drive accessibility of speech tech for deaf voices | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

Howard University: Howard University and Google Research Enhance A.I. Speech Recognition of African American English. “Researchers collected 600 hours of data from users of different [African American English] dialects in an effort to address implicit barriers to improving [automatic speech recognition] performance. Thirty-two states are represented in the dataset.”

https://rbfirehose.com/2025/06/26/howard-university-howard-university-and-google-research-enhance-a-i-speech-recognition-of-african-american-english/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · Howard University: Howard University and Google Research Enhance A.I. Speech Recognition of African American English | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

Data Rescue Project: Data Rescue Project Launches New Portal. “The Data Rescue Project (DRP) is excited to announce the launch of the DRP Portal—a milestone in our collective effort to protect and preserve at-risk public information. … The Portal makes it easy to discover rescued datasets by government offices sharing the data, topic, and more.”

https://rbfirehose.com/2025/06/25/data-rescue-project-data-rescue-project-launches-new-portal/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · Data Rescue Project: Data Rescue Project Launches New Portal | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

My prediction is that we won’t ever get public release of early OpenAI, Google, or even Anthropic #training #datasets.

Why? There are too many rich hard-right conservative backers who need all the misogyny, racism & hate speech to stay there.

We could have just & equal #AI, but we won’t. There’s too much money & power to be made of injustice.

Scientific Data: City-Defined Neighborhood Boundaries in the United States . ” Researchers lack widespread but locally-sourced data on neighborhoods, and instead often adopt widely available but arbitrary Census geographies as neighborhood proxies. … We address this tension between scale and precision by collecting, cleaning, and providing to researchers a new dataset of city-defined […]

https://rbfirehose.com/2025/06/21/scientific-data-city-defined-neighborhood-boundaries-in-the-united-states/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · Scientific Data: City-Defined Neighborhood Boundaries in the United States | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

Data Rescue Project: Why We’re Starting a New Federal Data Forum. “[Population Reference Bureau] recently launched the Federal Data Forum—a centralized online community designed to unite public data stakeholders in defense of America’s statistical infrastructure. The initiative builds on PRB’s previous work as data intermediaries, including our American Community Survey Online Community, […]

https://rbfirehose.com/2025/06/18/data-rescue-project-why-were-starting-a-new-federal-data-forum/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · Data Rescue Project: Why We’re Starting a New Federal Data Forum | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

Harvard Library: Institutional Books 1.0: A 242B Token Dataset from Harvard Library’s Collections, Refined for Accuracy and Usability. “The rapid development and adoption of LLMs of varying quality has brought into focus the scarcity of publicly available, high-quality training data and revealed an urgent need to ground the stewardship of these datasets in sustainable practices with clear […]

https://rbfirehose.com/2025/06/13/institutional-books-1-0-a-242b-token-dataset-from-harvard-librarys-collections-refined-for-accuracy-and-usability-harvard-library/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · Institutional Books 1.0: A 242B Token Dataset from Harvard Library’s Collections, Refined for Accuracy and Usability (Harvard Library) | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

NASA: NASA’s Ready-to-Use Dataset Details Land Motion Across North America. “NASA is collaborating with the Alaska Satellite Facility in Fairbanks to create a powerful web-based tool that will show the movement of land across North America down to less than an inch. The online portal and its underlying dataset unlock a trove of satellite radar measurements that can help anyone identify […]

https://rbfirehose.com/2025/06/08/nasa-nasas-ready-to-use-dataset-details-land-motion-across-north-america/

The Conversation: How remembering railway accidents from 100 years ago can make the industry safer today. “The Railway Work, Life & Death project has added nearly 70,000 cases of worker accidents in England and Wales to its database of staff accidents from before 1939. Until now the records have been available only in hard copy. But digital access via the project website will mean insights […]

https://rbfirehose.com/2025/06/05/the-conversation-how-remembering-railway-accidents-from-100-years-ago-can-make-the-industry-safer-today/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · The Conversation: How remembering railway accidents from 100 years ago can make the industry safer today | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose