Restyle while a game is running: ComfyUI --lowvram profile (coexist with untracked GPU VRAM) #16
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Context
#15 landed the GPU-lease integration: restyle now goes through the mGPUmanager broker, evicts the idle AI services (mvoice/whisper/ollama), and runs FLUX with the GPU lock held across the whole cycle. Verified live.
But when an untracked GPU app (e.g. Baldur's Gate 3, ~3 GB) is running, FLUX (~13 GB) still does not fit even after evicting every managed consumer (13 + 3 + 1 reserved > 16 GB). The broker correctly returns a clean
insufficient_vraminstead of OOMing, but no image is produced. So restyle-while-gaming is still blocked.Goal
Let restyle coexist with a running game by shrinking FLUX's VRAM footprint, trading speed for fit.
Approach (from #15 design doc section 6)
ComfyUI picks its VRAM mode at process start, so this is a launch-flag / second-profile decision, not per-request. Options to evaluate:
--lowvramComfyUI instance on mRock (offloads weights to system RAM; small VRAM footprint, slower) + a secondimagen.yamlbackend pointing at it with a lowervram_resident_mib.flux-schnell-local; oninsufficient_vramfrom the broker, automatically retry against the low-vram profile. (The client already distinguishesinsufficient_vram—ErrBrokerInsufficientVRAM— so a retry hook is feasible.)vram_resident_mibso the lease can actually grant alongside the game.Out of scope
Refs
docs/design-broker-gpu-lease.mdsection 6