The Vault

You favorite music, your personal photos, your notes, your favorite quotes and all your important files, including scanned documents. These are pieces of information that should be able to be stored indefinitely. For 50 or 100 years, especially if you’ll still be alive by then.

It’s nearly impossible to predict which storage solution will be around in 10 or 30 years, the safest bet it so store it using a popular structure that can be later migrated to modern systems if needed.

Requirements for the storage system:

Replicated in multiple locations (no single point of failure, resilient).
Encrypted at rest (secure).
Encrypted in transport (private).
Accessible via a simple, generic protocol (generic).

Since we’re aiming for generic, for storage systems, the top 3 most basic storage systems might be:

File systems, which organize and store files on a computer’s hard drive or other storage devices.
Block storage systems, which divide data into fixed-sized blocks and store them on separate physical devices.
Object storage systems, which store data as discrete objects with unique identifiers and metadata, and can be accessed over a network using APIs.

Everything is a file

Even though the blocks storage and object storage might be good options, to keep things as simple as possible, we’re going to assume that everything is a file, and use the file system as the basic storage system. It’s old and stable enough, easy to understand and to scale, and will probably be around for a long time.

Using the PESOS model, users will probably already use big brand services such as Google. Therefore, a good starting point for building a persona vault is downloading all the data in Google via the Google Takeout service. This download is a file system download, where all of a user’s data is organized in folders and files. Even if some of the files are JSON files that might be stored in an object storage system.

Drop it anywhere

One of the reasons that organization systems are failing is that the mindset used when we store information is not the same mindset used when we search for the information. Therefore, one essential rule to keep in mind for this system is that it shouldn’t matter how you organize the file system. Even more important since organizing one’s entire digital life sounds like a complex task that could take a long time and still not be the perfect organization system.

Second reason to be able to drop it anywhere is that we are using various systems that have data organized in various way, so there’s no single organizational system that will be suitable for everyone.

The only requirement is that the top-level folder should probably be a username, to facilitate storage of multiple users of an organization/company/family/group.

Should be mountable

No matter what OS we are using, the simplest way to organize a file system is to mount it locally as a drive.

The storage solution should then have a mountable interface (such as Samba, NFS, iSCSI or WebDav).

Storage system of choice: ZFS

ZFS began as part of the Sun Microsystems Solaris operating system in 2001¹.

It ticks all the boxes, being resilient, secure, and generic. One can even rent ZFS storage² without having to trust the owners, thanks to encryption at rest.

With ZFS, we have an ability to use various drives and create a pool of device to form one solid storage with a certain degree of data-distribution

Online services sync

Services that have an API can be imported and synchronized automatically.

Manually importing data

Some services (such as Whatsapp) don’t have an API, and thus must be imported manually. The import procedure might be automated to some extent, but exporting is usually done personally and manually.

The big brands have options to manually request a download of all one’s data:

Google Takeout
Facebook: Download your Information⁵
Apple: Get a copy of your data⁶

The importing procedure must be idempotent, in such a way that importing every X months would not generate a completely different copy, but will add up to the already imported archive.