Model weights: https://huggingface.co/Comfy-Org/Lens
PR: https://github.com/Comfy-Org/ComfyUI/pull/14077
You’ll need to git the merge pull request if you’re in a hurry:
git fetch origin pull/14077/head:pr-14077
git checkout pr-14077
Supported Resolutions (Width × Height):
Base resolution = 1024
| Aspect Ratio | Resolution (width × height) |
| 1:2 | 736 × 1472 |
| 9:16 | 768 × 1376 |
| 2:3 | 832 × 1248 |
| 3:4 | 864 × 1152 |
| 1:1 | 1024 × 1024 |
| 4:3 | 1152 × 864 |
| 3:2 | 1248 × 832 |
| 16:9 | 1376 × 768 |
| 2:1 | 1472 × 736 |
Base resolution = 1440 (default)
| Aspect Ratio | Resolution (width × height) |
| 1:2 | 1040 × 2080 |
| 9:16 | 1088 × 1936 |
| 2:3 | 1168 × 1760 |
| 3:4 | 1216 × 1616 |
| 1:1 | 1440 × 1440 |
| 4:3 | 1616 × 1216 |
| 3:2 | 1760 × 1168 |
| 16:9 | 1936 × 1088 |
| 2:1 | 2080 × 1040 |
It works pretty well with JSON prompts. I used some shitty ones I had laying around.
Example prompt:
{ "language": "en", "main_subject": { "description": "An anthropomorphic European badger with distinct black and white facial stripes, wearing a faded navy blue oversized hoodie and baggy corduroy pants. It is slumped deeply into a worn-out beanbag chair, holding a Super Nintendo (SNES) controller with intense focus. Its badger feet poke out from the pant cuffs.", "count": 1, "position": "center frame, low angle sitting" }, "secondary_elements": [ { "description": "A glowing CRT television displaying a pixelated 16-bit game (e.g., Street Fighter II).", "relation_to_main": "in front of the badger, providing light" }, { "description": "Empty soda cans, snack wrappers, and game cartridges scattered on a shag carpet.", "relation_to_main": "surrounding the beanbag" } ], "environment": { "description": "A cluttered, finished basement with wood-paneled walls. Band posters (Nirvana, Pearl Jam) are taped to the walls. The room is dimly lit by the TV and a single floor lamp.", "background_style": "cluttered domestic interior" }, "composition": "candid snapshot, slightly messy framing", "style": { "medium": "photograph", "artist_or_reference": "1990s amateur film photography, snapshot aesthetic", "aesthetic_qualities": [ "grainy", "lo-fi", "flash-lit", "nostalgic", "grunge" ] }, "photographic_details": { "lighting": "direct on-camera flash mixed with CRT glow, creating harsh shadows", "camera_shot": "medium shot", "lens_and_film": "35mm film point-and-shoot, high ISO grain, poor color rendition" }, "text_elements": [ { "text": "'93", "language": "en", "placement": "bottom right corner, burnt into the film", "style": "orange digital date stamp font" } ], "aspect_ratio": "4:3", "negative_prompt": "high definition, modern technology, flatscreen TV, clean room, bright studio lighting, CGI fur" }