I still have a bunch of computers running ArchiveTeam Warrior and here are the totals for I've downloaded so far...
Telegram 107 GiB
Voice of America 94 GiB
US Government 44 GiB
Goo-gl 450 MiB
Twitch 135 MiB
I still have a bunch of computers running ArchiveTeam Warrior and here are the totals for I've downloaded so far...
Telegram 107 GiB
Voice of America 94 GiB
US Government 44 GiB
Goo-gl 450 MiB
Twitch 135 MiB
@eloquence
Having posts or other indexed/indexable content refer to URL shorteners is dangerous for referrals/archiving/…:
#ArchiveTeam, the people behind e.g. the effort to archive US government websites in a hurry—before they were deleted/changed in an even greater hurry by the current administration, write about #URLShorteners:
"Such services are a ticking timebomb. If they go away, get hacked, or sell out, millions of links will be lost (see Wikipedia: Link Rot)."
https://wiki.archiveteam.org/index.php/URLTeam
The ArchiveTeam Warrior has been running intermittently on my laptop for ten days now.
It downloads stuff and puts it into the Internet Archive.
Everything's fine. It only runs while I use the laptop. I don't notice it. When the laptop goes into standby and wakes up again that doesn't seem to have any adverse effects.
I've downloaded and uploaded gigabytes so far. The top of the leader board for this project is half a petabyte.
Now I'm considering a installation where it could run around the clock. I don't want to increase our household's standby energy consumption too much, so I will see how that goes.
#archiving
#internetArchive
#DataRescue
#dataPreservation
#digtitalPreservation
#archiveTeamWarrior
#archiveTeam
Installed and started the ArchiveTeam Warrior. Very smooth experience.
It downloads stuff and puts it into the Internet Archive.
I took the "ArchiveTeam’s Choice" project and it chose public telegram channels. It's not taking a lot of bandwidth or memory or space or computing, as far as I can tell. It might take too much of my time and focus if I continue staring at the dashboard to try and figure out what all that stuff is.
Moin, @EposVox! I've watched your video - the one in which you've mentioned #ArchiveTeam. Would you be inclined to fill the ~5-10 year long gap of videos on YT, explaining their effort …perhaps you could show how those rescued docs can be accessed, searched, used, and stuff.
Is this some kind of IPFS or CEPH (distributed) filesystem, with redundancy, and "fancy tech shiz" …
How to install #ArchiveTeamWarrior and contribute bandwidth to archiving at-risk web sites
ArchiveTeam Warrior - #Archiveteam
There was a lack of a decent web based leaderboard so I wrote one in python and got up to speed on publishing to pypi properly. It's a nice minimal example of publishing a single python file to an installable command.
Anyone else running an #ArchiveTeam instance on the usgov project? Mine was humming all week but not it's not getting any items. Is the archive... done?
I managed to help archive ~35GB of US Government web content with my #ArchiveTeam Warrior instance. At the moment there are no more available to-do items, but I'm keeping my warrior alive if any items re-enter the queue. Glad I was able to donate some CPU and bandwidth to the cause :)
I love that there are so many great efforts out there preserving Geocities, and the internet as a whole. So much wholesome, personal stuff, imortalised.
To the #ArchiveTeam nerds, thank you for being awesome.
I only just now took a good look at the Archive Team Warrior logo 10/10 no notes
Started helping #archiveteam backup all of the US government websites last night. Up to 7Gb today! Simple docker setup in #Unraid for me but If you have bandwidth and a computer doing nothing you can help too: https://wiki.archiveteam.org/index.php/Main_Page
Really simple instructions on how to help archive the US Government's websites:
https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior#Installing_and_running_with_Docker
Although #ArchiveTeam are not affiliated with with the #InternetArchive, that is where the archived sites are stored.
Help @internetarchive back up US Government websites & data before it's "disappeared" — your taxes paid for it: *public* data.
Easy: Use a spare laptop or other device you can leave on — there's a virtual machine you run on Windows and Linux (link below): download the image and install VirtualBox, then open the image in VirtualBox (see under "basic usage"):
https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior
h/t @jbcarroll @ramsey
https://friendica.jb-net.us/display/052e3994-1867-a41a-a152-1c3654928625
https://phpc.social/@ramsey/113943736240301661
https://arstechnica.com/science/2023/07/is-distributed-computing-dying-or-just-fading-into-the-backdrop/
#ArchiveTeam
That #ArchiveTeam warrior is pretty low resource usage. Anybody should be able to run this on modest hardware. Their instructions even explain how you could run it on a laptop or PC in the background, just humming along.
I was curious about where #ArchiveTeam folks are working. Looks like leaflet they have set up to map contributors is shifted to the southeast. Note the probable DFW, Austin, and Houston folks that are shown out in the middle of the rural areas.#mapping #cartography #archiving
#US #USA #archiveteam #archive #usgovernment
YESSSS! ^.^ NLM website right now
If you have some bandwith to offer and want to help --> https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior