Funny you say this. At my last job I managed a 1.5TB perforce depot with hundred...

coredog64 · 2025-03-17T14:50:01 1742223001

When you create an EBS volume from a snapshot, the content is streamed in from S3 on a pull-through basis. You can enable FSR which creates the EBS volume with all the data up front, but it is an extra cost option.

maccard · 2025-03-17T15:04:53 1742223893

Yeah, this is exactly my point. Despite provisioning (and paying for) io1 ssd’s it doesn’t matter because you’re still pulling through on demand over a network connection to access it.

It was faster to just not do any of this. At my current job we pay $200/mo for a single bare metal server, and our CI is about 50% quicker than it was for 20% of the price.

jayd16 · 2025-03-17T05:13:08 1742188388

Hmm I don't know that making a new volume from a snap should fundamentally be faster than what a P4 sync could do. You're still paying for a full copy.

You could have possibly had existing volumes with mostly up to date workspaces. Then you're just paying for the attach time and the sync delta.

maccard · 2025-03-17T08:52:08 1742201528

> I don't know that making a new volume from a snap should fundamentally be faster than what a P4 sync could do. You're still paying for a full copy.

My experience with running a c++ build farm in the cloud is that in theory all of this is true but in practice it costs an absolute fortune, and is painfully slow. At the end of the day it doesn’t matter if you’ve provisioned io1 storage; you’re still pulling it across something that vaguely resembles a SAN, and that most of the operations that AWS perform are not as quick as you think they are. It took about 6 minutes to boot a windows ec2 instance, for example. Our incremental build was actually quicker than that, so we spent more time waiting for the instance to start up and attach to our volume cache than we did actually running CI. The cost of the machines was expensive that we couldn’t justify keeping them running all day.

> You could have possibly had existing volumes with mostly up to date workspaces.

This is what we did for incremental builds. The problem was when you want an extra instance that volume needs to be created. We also saw roughly a 5x difference in speed (IIRC, this was 2021 when I set this up) between a noop build on a mounted volume and a noop build that we had just performed the build on.

dijit · 2025-03-16T23:59:49 1742169589

I used to use fuse and overlayfs for this, I’m not sure it still works well as I’m not a build engineer and I did it for myself.

Its a lot faster in my case (little over 3TiB for latest revision only).

maccard · 2025-03-17T08:44:55 1742201095

There’s a service called p4vfs [0] which does this for p4. The problem we had with this at the time was that unfortunately our build tool scanned everything (which was slow in and of itself) but that caused p4vfs to pull the file anyway. So it didn’t actually help.

[0] https://help.perforce.com/helix-core/server-apps/p4vfs/curre...

jclarkcom · 2025-03-16T23:58:35 1742169515

VMware?

maccard · 2025-03-17T08:42:21 1742200941

What about it?