archivebox alternative with easier to read file and subfolder structure
from lemmy_anonymk@lemmy.world to selfhosted@lemmy.world on 13 Dec 16:46
https://lemmy.world/post/23100728

Hi guys,

I have archivebox running on my server and am very happy with it overall. But one thing bothers me: The archived pages are stored in subfolders whose names do not reflect the content. I would like to have at least a reasonable backup of the archived pages in case the archivebox instance stops working. It would also make it easier to transfer the archived pages to other devices or other people.

My question is therefore whether there is a docker self-hosted web archive solution that offers a similar range of functions to archivebox in terms of the different storage formats, e.g. for videos, but which makes the files easier to search in their folder structure.

Thanks in advance

PS

These are the solutions I have alteady tried:

Which do you recommend for my requirements?

#selfhosted

threaded - newest

pe1uca@lemmy.pe1uca.dev on 13 Dec 20:51 collapse

Maybe you could submit an issue to the repo to include a way to change the format of the saved folders.
(I’m thinking something similar on how immich allows to change some formats)

I’m seeing in my instance the names seem like some sort of timestamp, not sure if the code uses them in a meaningful way, so probably the solution would be to create symlinks with the name of the site or some other format while keeping the timestamp so the rest of the code can still expect it.