Automatic1111 and AI Aggregators

Mar 11, 2023

If you've played around with Stable Diffusion and the surrounding technology (ControlNet, outpainting, inpainting, CLIP, LoRa), you might have used this Stable Diffusion Web UI by GitHub user Automatic1111. New research papers and libraries are near instantly implemented to use in this UI. It's made to be run locally, although some run it in a Hugging Face space (an aggregator in an aggregator). Some interesting observations:

Why not run Stable Diffusion directly?

In the early weeks of the model being released, this is what most people did. A series of forks (like TheLastBen/fast-stable-diffusion) added different features – adding macOS GPU acceleration or different memory optimizations to run it on end-user hardware. There were many forks that simply copied patches from each other (see this long thread on M1 support). It was a race of who could integrate the patches the quickest.

Why did Automatic1111's web UI win?

Web UI. There were two other popular local Stable Diffusion UIs. The first, DiffusionBee, is an electron application. The UI itself was clunky but better than using a local notebook and invoking the Python program yourself. Electron applications take a lot of memory, so anecdotally, it felt slower than Web-UI-based methods (although it's all just Chrome?). It was a lot slower to add new features as well.

Fast Updates. cmdr2/stable-diffusion-ui was another Web-UI tool.  You run the web server locally, and it serves the model. Automatic1111's Web UI uses Gradio (see Cheap UIs). This doesn't make for the most visually appealing display (the UI is filled with sliders, radio buttons and looks like a control panel), but it is (1) consistent and (2) quick to implement.

Both maintainers have been working nonstop to incorporate new features and techniques into their UIs. You can see from the contribution graphs that both projects see a healthy number of commits and contributors.

Maybe the UI framework is the differentiator? cmdr2 decided not to use gradio early on. Maybe it's just compounding network effects? Or maybe it's social media coverage (YouTubers and other tutorial writers seemingly choose Automatic1111's UI more often).

.