May 1^st 2025 · 2 min read · #ai #dall-e #generative #image #midjourney #stablediffusion

AI Image Generation

This page is a collection of notes and links related to AI image generation technologies, including Stable Diffusion, DALL-E, Midjourney, and other generative image models that have become prominent in the modern AI space.

Resources

Field	Category	Date	Link	Notes
3D Rendering	NeRF Tools	2022	nerfstudio	A tool for manipulating Neural Radiance Fields (NeRF) and rendering the scenes out as video
General	Tools	2024	upscayl	an AI-based image upscaler
Generative AI	3D Asset Models	2025	Hunyuan3D-2	a 3D textured asset generation model
	Diffusion Models		miniDiffusion	a minimalistic implementation of diffusion models in Python
	Frame Prediction		FramePack	a next-frame (next-frame-section) prediction neural network structure that generates videos progressively.
	Image Generation Models		Lumina-Image-2.0	Another image generation model
	Video Generation Models	2024	HunyuanVideo	A pretty impressive open source video generation model
	Web Interface	2025	gpt-image-1-playground	a simple web-based interface for generating images using GPT-IMAGE-1
Image Generation	Apple Optimization	2022	CoreML Stable Diffusion	Apple’s optimizations for CoreML
	Diffusion Acceleration	2024	NitroFusion	a high-fidelity, fast (single-step) SDXL diffusion model
	Diffusion Acceleration	2022	Fast Stable Diffusion	Another tactic to accelerate inference
	Vector Graphics	2025	NeuralSVG	An Implicit Representation for Text-to-Vector Generation
	Web Interface	2022	Stable Diffusion WebUI	Nearly always the best, bleeding edge WebUI for SD
Multi-modal Models	Samples	2024	ml-mgie	instruction-based image self-editing
Stable Diffusion	Apps	2023	swift-coreml-diffusers	Hugging face’s own app, using Swift and CoreML for Apple Silicon
		2022	Draw Things	Pre-packaged app for iOS, downloads and allows re-use of .ckpt files.
		2022	DiffusionBee	Pre-packaged app for macOS (M1 and Intel)
	CGI	2023	Blender-ControlNet	A Blender plugin to generate ControlNet inputs for posing figures
	CGI	2022	dream-textures	A Blender plugin for texturing models based on a text description.
	Implementations	2023	OnnxStream	Stable Diffusion XL 1.0 Base on a Raspberry Pi Zero 2 (or in 298MB of RAM)
	Libraries	2024	sd4j	a Java library for Stable Diffusion that uses ONNX
	Models	2024	SDXL-Lightning	an SDXL flavor that works on only a few steps
	Models	2023	Upscale Model Database	Too wide a choice, perhaps
	Reference	2024	comflowy	a set of reference workflows and documentation for ComfyUI
	Reference		flux	minimal inference examples for FLUX.1 models
	Tools		comflowyspace	a ComfyUI desktop wrapper
		2023	ComfyUI-AnimateDiff-Evolved	An AnimateDiff integration for ComfyUI
			ComfyUI	pretty impressive node-based UI
			InvokeAI	A polished UI
			stable-diffusion.cpp	stable diffusion inference on the CPU, in pure C++
			ComfyUI-Manager	A component manager for ComfyUI
			Opendream	A layer-oriented, non-destructive editor
		2022	imaginAIry	Works well on Apple Silicon, pure CLI interface to all SD models. Does not reuse `.ckpt` files, however, so requires separate disk cache.
Upscaling	Video Enhancement	2025	video2x	a tool for upscaling videos using RealESRGAN and other models

← The Kingroon KP3S Pro (V1), Two Years Later On The Apple U.S. App Guidelines Update →

This page is referenced in:

The Great AI Breakdown • May 1^st 2025
2024 in Review • Dec 23^rd 2024
Artificial Intelligence • Jan 22^nd 2004