ID4gotten t1_iv77qkz wrote on November 5, 2022 at 9:00 PM

I can't wait to visit the beigeverse

SpatialComputing OP t1_iv6jn6t wrote on November 5, 2022 at 6:12 PM

>In order for learning systems to be able to understand and create 3D spaces, progress in generative models for 3D is sorely needed. The quote "The creation continues incessantly through the media of humans." is often attributed to Antoni Gaudí, who we pay homage to with our method’s name. We are interested in generative models that can capture the distribution of 3D scenes and then render views from scenes sampled from the learned distribution. Extensions of such generative models to conditional inference problems could have tremendous impact in a wide range of tasks in machine learning and computer vision. For example, one could sample plausible scene completions that are consistent with an image observation, or a text description (see Fig. 1 for 3D scenes sampled from GAUDI). In addition, such models would be of great practical use in model-based reinforcement learning and planning [12], SLAM [39], or 3D content creation. > >We introduce GAUDI, a generative model capable of capturing the distribution of complex and realistic 3D scenes that can be rendered immersively from a moving camera. We tackle this challenging problem with a scalable yet powerful approach, where we first optimize a latent representation that disentangles radiance fields and camera poses. This latent representation is then used to learn a generative model that enables both unconditional and conditional generation of 3D scenes. Our model generalizes previous works that focus on single objects by removing the assumption that the camera pose distribution can be shared across samples. We show that GAUDI obtains state-of-the-art performance in the unconditional generative setting across multiple datasets and allows for conditional generation of 3D scenes given conditioning variables like sparse image observations or text that describes the scene.

https://github.com/apple/ml-gaudi

RobbinDeBank t1_iv74pcy wrote on November 5, 2022 at 8:39 PM

Seems like we will see Apple VR headset coming soon

Mefaso t1_iv9eshj wrote on November 6, 2022 at 8:46 AM

Apple publishing research?

lucellent t1_iva1sf7 wrote on November 6, 2022 at 1:22 PM

They've been doing it for some time now, but there's not a lot of it. I remember seeing their paper about text2image generation, something similar to Dalle.2

[deleted] t1_iv9oo0o wrote on November 6, 2022 at 11:05 AM

[removed]

bakochba t1_iv8jfyq wrote on November 6, 2022 at 2:52 AM

I wonder how much memory and time it takes to train models like this

nemesit t1_ivbgb8e wrote on November 6, 2022 at 7:08 PM

Sounds great for games with destructive environment where interiors for buildings could just be ai generated when needed

[R] APPLE research: GAUDI — a neural architect for immersive 3D scene generation

Comments