Do you use anything to archive content for yourself or others? (research, videos, articles, and anything that could be lost to time or censorship)

Otter@lemmy.ca · edit-2 7 days ago

Do you use anything to archive content for yourself or others? (research, videos, articles, and anything that could be lost to time or censorship)

Otter@lemmy.ca · 7 days ago

One option that I’ve heard of in the past

https://archivebox.io/

ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline.

Admiral Patrick@dubvee.org · edit-2 7 days ago

Going to check that out because…yeah. Just gotta figure out what and where to archive.

tomtomtom@lemmy.world · 6 days ago

I am using archivebox, it is pretty straight-forward to self-host and use.

However, it is very difficult to archive most news sites with it and many other sites as well. Most cookie etc pop ups on a site will render the archived page unusable and often archiving won’t work at all because some bot protection (Cloudflare etc.) will kick-in when archivebox tries to access a site.

If anyone else has more success using it, please let me know if I am doing something wrong…

Daniel Quinn@lemmy.ca · 6 days ago

Monolith has the same problem here. I think the best resolution might be some sort of browser-plugin based solution where you could say “archive this” and have it push the result somewhere.

I wonder if I could combine a dumb plugin with Monolith to do that… A weekend project perhaps.

CrazyLikeGollum@lemmy.world · 7 days ago

That looks useful, I might host that. Does anyone have an RSS feed of at risk data?

M600@lemmy.world · 7 days ago

This seems pretty cool. I might actually host this.

Boomkop3@reddthat.com · 6 days ago

Eyy, I want that!