Video about it here: https://developer.apple.com/videos/play/wwdc2025/346/ Looks...

OJFord · 2025-06-09T23:27:16 1749511636

The submission is about https://github.com/apple/containerization, not https://github.com/apple/container.

The former is for apps to ship with container sidecars (and cooler news IMO); the latter is 'I am a developer and I want to `docker run ...`'.

(Oh, and container has a submission here: https://news.ycombinator.com/item?id=44229239)

badc0ffee · 2025-06-10T01:17:01 1749518221

The former is the framework enabling Linux containers on lightweight VMs and the latter is a tool using that framework.

WhyNotHugo · 2025-06-10T10:40:02 1749552002

> Looks like each container gets its own lightweight Linux VM.

That sounds pretty heavyweight. A project with 12 containers will run 12 kernels instead of 1?

Curious to see metrics on this approach.

haiku2077 · 2025-06-10T14:18:33 1749565113

This is the approach used by Kata Containers/Firecracker. It's not much heavier than the shared kernel approach, but has significantly better security. An bug in the container runtime doesn't immediately break the separation between containers.

The performance overhead of the VM is minimal, the main tradeoffs is container startup time.

Yeroc · 2025-06-10T19:26:25 1749583585

I wonder why Apple cared so much about the security aspect to take the isolated VM approach versus shared VM approach. Seems unlikely that Apple hardware is going to be used to host containerized applications in production where this would be more of a concern. On the other hand, it's more likely to be used for development purposes where the memory overhead could be a bigger concern.

ghostly_s · 2025-06-10T20:06:27 1749585987

> Seems unlikely that Apple hardware is going to be used to host containerized applications in production

I imagine this is certainly happening already inside Apple datacenters.

haiku2077 · 2025-06-10T20:04:43 1749585883

One of the use cases for this feature is for macOS desktop apps to run Linux sidecars, so this needed to be secure for end user devices.

surajrmal · 2025-06-10T14:53:40 1749567220

Ram overhead can be nontrivial. Each kernel has its own page cache.

haiku2077 · 2025-06-10T15:01:44 1749567704

On a non Linux OS that should be offset by being able to allocate RAM separately to each container instead of the current approach in Docker Desktop where a static slice of your system memory is always allocated to the Docker VM.

fpoling · 2025-06-10T18:41:30 1749580890

This a feature targeting developers or perhaps apps running on end-user machine where page cache sharing between applications or container does not typically get much of RAM saving.

Linux kernel overhead itself while non-trivial is still very manageable in those settings. AWS Nitro stripped down VM kernel is about 40 MB, I suppose for Apple solution it will be similar.

arijun · 2025-06-10T11:19:12 1749554352

Is that not the premise of docker?

rtkwe · 2025-06-10T11:52:25 1749556345

No it's the opposite, the entire premise of Docker over VMs is that you run one instance of all the OS stuff that's shared so it takes less resources than a VM and the portable images are smaller because they don't contain the OS image.

dwaite · 2025-06-10T16:50:28 1749574228

The premise is containerization, not necessarily particular resource usage by the host running the containers.

For hosted services, you want to choose - is it worth running a single kernel with a lot of containers for the cost savings from shared resources, or isolate them by making them different VMs. There are certainly products for containers which lean towards the latter, at least by default.

For development it matters a lot less, as long as the sum resources of containers you are planning to run don't overload the system.

rtkwe · 2025-06-10T23:07:06 1749596826

The VM option is relatively new and the original idea was to provide that isolation without the weight of a VM. Also I'm not sure that docker didn't coin the word containerization, I've alway associated it with specifically the kind of packaging docker provides and don't remember it being mentioned around VMs.

pjmlp · 2025-06-10T19:55:44 1749585344

On Windows containers you can chose if the kernel is shared across containers or not, it in only on Linux containers mode that the kernel gets shared.

WhyNotHugo · 2025-06-10T12:01:36 1749556896

Nope, docker uses the host's kernel, so there are zero additional kernels.

On non-Linux, you obviously need an additional kernel running (the Linux kernel). In this case, there are N additional kernels running.

quietbritishjim · 2025-06-10T12:10:35 1749557435

> On non-Linux, you obviously need an additional kernel running (the Linux kernel).

That seems to be true in practice, but I don't think it's obviously true. As WSL1 shows, it's possible to make an emulation layer for Linux syscalls on top of quite a different operating system.

capitol_ · 2025-06-10T13:30:52 1749562252

I would draw the opposite conclusion from the WSL1 attempt.

It was a strategy that failed in practice and needed to be replaced with a vm based approach.

The Linux kernel have a huge surface area with some subtle behavior in it. There was no economic way to replicate all of that and keep it up to date in a proprietary kernel. Specially as the VM tech is well established and reusable.

paulryanrogers · 2025-06-10T12:28:12 1749558492

WSL1 wasn't really a VM though? IIRC it was implementing syscalls over the Windows kernel.

quietbritishjim · 2025-06-10T12:49:05 1749559745

Indeed, WSL1 isn't a VM. As I said, it's just:

> an emulation layer for Linux syscalls on top of quite a different operating system.

My point was that, in principle, it could be possible to implement Linux containers on another OS without using VMs.

However, as you said (and so did I), in practice no one has. Probably because it's just not worth the effort compared to just using a VM. Especially since all your containers can share a single VM, so you end up only running 2 kernels (rather than e.g. 11 for 10 containers). That's exactly how Docker on WSL2 works.

derekdb · 2025-06-10T16:30:06 1749573006

gVisor has basically re-implemented most of syscall api, but only when the host is also Linux.

ongy · 2025-06-10T12:42:48 1749559368

I think that's the point. You don't have to run the full kernel to run some linux tools.

Though I don't think it ever supported docker. And wasn't really expected to, since the entire namespaces+cgroup stuff is way deeper than just some surface level syscall shims.

asveikau · 2025-06-10T15:48:22 1749570502

And long before WSL, *BSD was doing this with the Linux syscall abi.

lloeki · 2025-06-10T20:21:33 1749586893

> On non-Linux, you obviously need an additional kernel running (the Linux kernel)

Only "obvious" for running Linux processes using Linux container facilities (cgroups)

Windows has its own native facilities allowing Windows processes to be containerised. It just so happens that in addition to that, there's WSL2 at hand to run Linux processes (containerised or not).

There is nothing preventing Apple to implement Darwin-native facilities so that Darwin processes would be containerised. It would actually be very nice to be able to distribute/spin up arbitrary macOS environments with some minimal CLI + CLT base† and run build/test stuff without having to spawn full-blown macOS VMs.

† "base" in the BSD sense.

karel-3d · 2025-06-10T13:03:57 1749560637

eh docker desktop nowadays runs VMs even on Linux

speedgoose · 2025-06-10T13:17:43 1749561463

Docker Desktop is non free proprietary software that isn’t very good anyway.

detaro · 2025-06-10T11:20:38 1749554438

AdamN · 2025-06-10T12:53:56 1749560036

I could imagine one Linux kernel running in a VM (on top of MacOS) and then containers inside that host OS. So 1 base instance (MacOS), 1 hypervisor (Linux L0), 12 containers (using that L0 kernel).

haiku2077 · 2025-06-10T14:19:30 1749565170

That's how Docker Desktop for Mac works. With Apples approach you have 12 VMs with 12 Linux kernels.

paxys · 2025-06-09T23:11:54 1749510714

Also works on macOS 15, but they mentioned that some networking features will be limited.

philips · 2025-06-10T21:58:55 1749592735

Shoutout to Michael Crosby, the person in this video, who was instrumental in getting Open Containers (https://opencontainers.org) to v1.0. He was a steady and calm force through a very rocky process.

discohead · 2025-06-10T22:13:16 1749593596

"A new report from Protocol today details that Apple has gone on a cloud computing hiring spree over the last few months... Michael Crosby, one of a handful of ex-Docker engineers to join Apple this year. Michael is who we can thank for containers as they exist today. He was the powerhouse engineer behind all of it, said a former colleague who asked to remain anonymous."

https://9to5mac.com/2020/05/11/apple-cloud-computing/

musicale · 2025-06-12T04:49:26 1749703766

We can thank the linux kernel developers for implementing namespaces and overlayfs.

And we can thank predecessor systems like BSD jails, Solaris zones, as well as Virtuozzo/openVZ and lxc as previous container systems on linux.

Docker's main improvements over lxc, as I understand it, were adding a layered, immutable image format (vs. repurposing existing VM image formats) and a "free" public image repository.

But the userspace implementation isn't exactly rocket science, which is why we periodically see HN posts of tiny systems that can run docker images.

solarexplorer · 2025-06-10T12:14:13 1749557653

I would assume that "lightweight" in this case means that they share a single Linux kernel. Or that there is an emulation layer that maps the Linux Kernel API to macOS. In any case, I don't think that they are running a Linux kernel per container.

ylk · 2025-06-10T13:09:30 1749560970

You don’t have to assume, the docs in the repo tell you that it does run a Linux kernel in each VM. It’s one container per VM.

solarexplorer · 2025-06-10T16:35:09 1749573309

Good call, thanks for clarifying!

commandersaki · 2025-06-10T21:31:24 1749591084

"Lightweight" in the sense that the VM contains one static executable that runs the container, and not a full fledged Ubuntu VM (e.g. Colima).

selkin · 2025-06-10T08:38:26 1749544706

It seems to work on macOS 15 as well, with some limitations[0].

[0] https://github.com/apple/container/blob/main/docs/technical-...

zmmmmm · 2025-06-10T01:14:27 1749518067

interesting choice - doesn't that then mean that container to container integration is going to be harder and a lot of overhead per-container? I would have thought a shared VM made more sense. I wonder what attracted them to this.

pxc · 2025-06-10T03:04:35 1749524675

It seems great from a security perspective, and a little bit nice from a networking perspective.

avidphantasm · 2025-06-10T12:43:32 1749559412

The "one IP per container" approach (instead of shared IPs) is similar to how kubernetes pods work.

mickdarling · 2025-06-10T12:07:05 1749557225

I can see the decision to do it this way being related to their private secure cloud infrastructure for AI tools.

JoBrad · 2025-06-10T12:27:00 1749558420

I like the security aspect. Maybe DNS works, and you can use that for communication between containers?

honkycat · 2025-06-10T20:29:05 1749587345

> Looks like each container gets its own lightweight Linux VM.

We're through the looking glass here, people

musicale · 2025-06-12T04:43:22 1749703402

"Containers" now apparently means "boot a docker image as an ephemeral VM."

Which isn't such a bad idea really.

zoobab · 2025-06-10T06:28:10 1749536890

"Looks like each container gets its own lightweight Linux VM."

Not a container "as such" then.

How hard is it to emulate linux system calls?

teruakohatu · 2025-06-10T06:34:19 1749537259

> How hard is it to emulate linux system calls?

It’s doable but a lot more effort. Microsoft did it with WSL1 and abandoned it with WSL2.

tsimionescu · 2025-06-10T07:43:53 1749541433

Note that they didn't "do it" for WSL1, they started doing it, realized it is far too much work to cover eveything, and abandoned the approach in favor of VMs. It's not like WSL1 was a fully functioning Linux emulator on top of Windows, it was still very far from it, even though it could do many common tasks.

benwad · 2025-06-10T08:16:39 1749543399

I've always wondered why only Linux can do 'true' containers without VMs. Is there a good blog post or something I can read about the various technical hurdles?

NexRebular · 2025-06-10T08:56:40 1749545800

> I've always wondered why only Linux can do 'true' containers without VMs.

Solaris/illumos has been able to do actual "containers" since 2004[0] and FreeBSD has had jails even before that[1].

[0] https://www.usenix.org/legacy/event/lisa04/tech/full_papers/... [1] https://papers.freebsd.org/2000/phk-jails.files/sane2000-jai...

syhol · 2025-06-10T11:37:59 1749555479

Many OS's have their own (sometimes multiple) container technologies, but the ecosystem and zeitgeist revolves around OCI Linux containers.

So it's more cultural than technical. I believe you can run OCI Windows containers on Windows with no VM, although I haven't tried this myself.

bayindirh · 2025-06-10T08:50:21 1749545421

BSD can do BSD containers with Jails for more than a decade now?

Due to innate features of a container, it can be of the same OS of the host running on the system, since they have no kernel. Otherwise you need to go the VM route.

dwaite · 2025-06-10T16:58:49 1749574729

In this context (OCI containers) that seems very inaccurate. For instance, ocijail is a two year old project still considered experimental.

soupbowl · 2025-06-10T17:47:03 1749577623

FreeBSD has beta podman (OCI) support right now, using freebsd base images not Linux. It is missing some features but coming along.

notpushkin · 2025-06-10T08:20:23 1749543623

Windows can do “true” containers, too. These containers won’t run Linux images, though.

dijit · 2025-06-10T11:00:16 1749553216

Can it? As far as I understood windows containers required Hyper-V and the images themselves seem to contain an NT kernel.

Not that it helps them run on any other Windows OS other than the version they were built on, it seems.

noisem4ker · 2025-06-10T11:12:29 1749553949

Source?

The following piece of documentation disagrees:

https://learn.microsoft.com/en-us/virtualization/windowscont...

> Containers build on top of the host operating system's kernel (...), and contain only apps and some lightweight operating system APIs and services that run in user mode

> You can increase the security by using Hyper-V isolation mode to isolate each container in a lightweight VM

pjmlp · 2025-06-10T20:00:12 1749585612

Yes, it is based on Windows Jobs API.

Additionally you can decide if the images contain the kernel, or not.

There is nothing in OS containers that specifies the golden rule how the kernel sharing takes place.

Remember containers predate Linux.

tsimionescu · 2025-06-10T09:48:27 1749548907

I'm not sure about MacOS, but otherwise all major OSs today can run containers natively. However, the interest in non-Linux containers is generally very very low. You can absolutely run Kubernetes as native Windows binaries [0] in native Windows containers, but why would you?

Note that containers, by definition, rely on the host OS kernel. So a Windows container can only run Windows binaries that interact with Windows syscalls. You can't run Linux binaries in a Windows container anymore than you can run them on Windows directly. You can run Word in a Windows container, but not GCC.

[0] https://learn.microsoft.com/en-us/virtualization/windowscont...

kcoddington · 2025-06-10T11:36:56 1749555416

I wouldn't think there are many use cases for Windows, but I imagine supporting legacy .NET Framework apps would be a major one.

tsimionescu · 2025-06-10T14:04:35 1749564275

Is there any limitation in running older.NET Framework on current Windows? Back when I was using it, you could have multiple versions installed at the same time, I think.

pjmlp · 2025-06-10T19:58:24 1749585504

You can, but there are companies that also want to deploy different kinds of Windows software into Kubernetes clusters and so.

Some examples would be Sitecore XP/XM, SharePoint, Dynamics deployments.

ownagefool · 2025-06-10T08:35:19 1749544519

Containers are essentially just a wrapper tool for a linux kernel feature called cgroups, with some added things such as layered fs and the distribution method.

You can also use just use cgroups with systemd.

Now, you could implement something fairly similar in each OS, but you wouldn't be able to use the vast majority of contained software, because it's ultimately linux software.

xrisk · 2025-06-10T08:50:04 1749545404

cgroups is for controlling resource allocation (CPU, RAM, etc). What you mean is probably namespaces.

ownagefool · 2025-06-10T10:32:03 1749551523

It's technically both I guess, but fair correction.

dwaite · 2025-06-10T16:55:33 1749574533

Every OS can theoretically do 'true' containers without VMs - for containers which match the host platform.

You can have Windows containers running on Windows, for instance.

Containers themselves are a packaging format, and do rather little to solve the problem of e.g. running Linux-compiled executables on macOS.

anthk · 2025-06-10T11:32:50 1749555170

Containers don't virtualize, just separate environments.

NexRebular · 2025-06-10T08:19:28 1749543568

> How hard is it to emulate linux system calls?

FreeBSD has linuxulator and illumos comes with lx-zones that allow running some native linux binaries inside a "container". No idea why Apple didn't go for similar option.

citrin_ru · 2025-06-10T08:58:43 1749545923

FreeBSD Linux emulation is being developed for 20 (may be even 30) years. While Apple can throw some $$$ to get it implemented in a couple years using virtualisation requires much less development time (so it’s cheaper).

rcleveng · 2025-06-10T14:17:57 1749565077

Apple's already got the Virtualization framework and hypervisor already (https://developer.apple.com/documentation/virtualization), so adding the rest of the container ecosystem seems like a natural next step.

It puts them on par with Windows that has container support with a free option, plus I imagine it's a good way to pressure test swift as a language to make sure it really can be the systems programming language they are betting that it can and will be.

OrbStack has a great UX and experience, so I imagine this will eat into Docker Desktop on Mac more than OrbStack.

masklinn · 2025-06-10T11:20:06 1749554406

Because that‘s a huge investment for something they have no reason or desire to productivize.

surajrmal · 2025-06-10T14:58:38 1749567518

syscalls are just a fraction of the surface area. There are many files in many different vfs you need to implement, things like selinux and ebpf, iouring, etc. It's also a constantly shifting target. The VM API is much simpler, relatively stable, and already implemented.

Emulating Linux only makes sense on devices with constrained resources.

throwaway1482 · 2025-06-10T16:28:57 1749572937

> How hard is it to emulate linux system calls?

Just replace the XNU kernel with Linux already.