This page is a collection of notes and links related to AI image generation technologies, including Stable Diffusion, DALL-E, Midjourney, and other generative image models that have become prominent in the modern AI space.
Resources
Field | Category | Date | Link | Notes |
---|---|---|---|---|
3D Rendering | NeRF Tools | 2022 | nerfstudio | A tool for manipulating Neural Radiance Fields (NeRF) and rendering the scenes out as video |
General | Tools | 2024 | upscayl | an AI-based image upscaler |
Generative AI | 3D Asset Models | 2025 | Hunyuan3D-2 | a 3D textured asset generation model |
Frame Prediction | FramePack | a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. |
||
Image Generation Models | Lumina-Image-2.0 | Another image generation model |
||
Video Generation Models | 2024 | HunyuanVideo | A pretty impressive open source video generation model |
|
Web Interface | 2025 | gpt-image-1-playground | a simple web-based interface for generating images using GPT-IMAGE-1 |
|
Image Generation | Apple Optimization | 2022 | CoreML Stable Diffusion | Apple’s optimizations for CoreML |
Diffusion Acceleration | 2024 | NitroFusion | a high-fidelity, fast (single-step) SDXL diffusion model |
|
2022 | Fast Stable Diffusion | Another tactic to accelerate inference |
||
Vector Graphics | 2025 | NeuralSVG | An Implicit Representation for Text-to-Vector Generation |
|
Web Interface | 2022 | Stable Diffusion WebUI | Nearly always the best, bleeding edge WebUI for SD |
|
Multi-modal Models | Samples | 2024 | ml-mgie | instruction-based image self-editing |
Stable Diffusion | Apps | 2023 | swift-coreml-diffusers | Hugging face’s own app, using Swift and CoreML for Apple Silicon |
2022 | Draw Things | Pre-packaged app for iOS, downloads and allows re-use of .ckpt files. |
||
DiffusionBee | Pre-packaged app for macOS (M1 and Intel) |
|||
CGI | 2023 | Blender-ControlNet | A Blender plugin to generate ControlNet inputs for posing figures |
|
2022 | dream-textures | A Blender plugin for texturing models based on a text description. |
||
Implementations | 2023 | OnnxStream | Stable Diffusion XL 1.0 Base on a Raspberry Pi Zero 2 (or in 298MB of RAM) |
|
Libraries | 2024 | sd4j | a Java library for Stable Diffusion that uses ONNX |
|
Models | SDXL-Lightning | an SDXL flavor that works on only a few steps |
||
2023 | Upscale Model Database | Too wide a choice, perhaps |
||
Reference | 2024 | comflowy | a set of reference workflows and documentation for ComfyUI |
|
flux | minimal inference examples for FLUX.1 models |
|||
Tools | comflowyspace | a ComfyUI desktop wrapper |
||
2023 | ComfyUI-AnimateDiff-Evolved | An AnimateDiff integration for ComfyUI |
||
ComfyUI | pretty impressive node-based UI |
|||
InvokeAI | A polished UI |
|||
stable-diffusion.cpp | stable diffusion inference on the CPU, in pure C++ |
|||
ComfyUI-Manager | A component manager for ComfyUI |
|||
Opendream | A layer-oriented, non-destructive editor |
|||
2022 | imaginAIry | Works well on Apple Silicon, pure CLI interface to all SD models. Does not reuse .ckpt files, however, so requires separate disk cache. |