Categories: Image

Depth Anything 3: Recovering the Visual Space from Any Views ( Code , Model available). lot of examples on project page.

Project page: https://depth-anything-3.github.io/
Paper: https://arxiv.org/pdf/2511.10647
Demo: https://huggingface.co/spaces/depth-anything/depth-anything-3
Github: https://github.com/ByteDance-Seed/depth-anything-3

Depth Anything 3, a single transformer model trained exclusively for joint any-view depth and pose estimation via a specially chosen ray representation. Depth Anything 3 reconstructs the visual space, producing consistent depth and ray maps that can be fused into accurate point clouds, resulting in high-fidelity 3D Gaussians and geometry. It significantly outperforms VGGT in multi-view geometry and pose accuracy; with monocular inputs, it also surpasses Depth Anything 2 while matching its detail and robustness.

submitted by /u/AgeNo5351
[link] [comments]

SDBattle: Week 4 – ControlNet Mona Lisa Depth Map Challenge! Use ControlNet (Depth mode recommended) or Img2Img to turn this into anything you want and share here.

submitted by /u/leakime [link] [comments]

March 14, 2023

In "Image"

A method to turn a video into a 360° 3D VR panorama video

I started working on this with the goal of eventually producing an FMV VR video game. At first, I thought that training a WAN panorama LoRA would be the easy solution, but the very high resolution required for VR means it cannot be the ultimate answer. Also, almost all new…

November 24, 2025

In "Image"

LTXV Just Unlocked Native 60-Second AI Videos

LTXV is the first model to generate native long-form video, with controllability that beats every open source model. 🎉 30s, 60s and even longer, so much longer than anything else. Direct your story with multiple prompts (workflow) Control pose, depth & other control LoRAs even in long form (workflow) Runs…

July 17, 2025

In "Image"

AI Generated Robotic Content

Next Cybersecurity and LLMs »

Previous « Mastering JSON Prompting for LLMs

Published by

AI Generated Robotic Content

Tags: ai images

6 months ago

Brad Pitt casts Elliot for Achilles – an Ai acting performance experiment

I am putting most of my efforts to achieve more realistic Ai acting with natural…

4 hours ago

AI/ML News

New light-based switch could cut chip energy use and speed future AI photonics

Photonic devices are hardware systems that can process information using light instead of electricity. These…

5 hours ago

Image

Microsoft Lens First Tests: It’s Pretty Decent! – ComfyUI Native Support About to Be Merged

Model weights: https://huggingface.co/Comfy-Org/Lens PR: https://github.com/Comfy-Org/ComfyUI/pull/14077 You'll need to git the merge pull request if you're…

1 day ago

Image

Tencent released Z-Image 6B with pixel space gen. No VAE & 1k Resolution.

Link: https://nju-pcalab.github.io/projects/L2P/ submitted by /u/switch2stock [link] [comments]

2 days ago

AI/ML Research

Building Context-Aware Search in Python with LLM Embeddings + Metadata

Keyword search breaks the moment a user types something a document doesn't literally say.

2 days ago

FAANG

The Blueprint: How Movix fills a gap in dental skills with specialized agentic AI

Welcome to The Blueprint, a regular feature where we highlight how Google Cloud customers are…

2 days ago

Depth Anything 3: Recovering the Visual Space from Any Views ( Code , Model available). lot of examples on project page.

Recent Posts