Comment by petargyurov

petargyurov Aug 14, 2024 parent

Someone help me understand inference here.

Every gaussian splat repo I have looked at doesn't mention how to use the pre-trained models to "simply" take MY images as input and output a GS. They all talk about evaluation, but the CMD interface requires the eval datasets as input.

Is training/fine-tuning on my data the only way to get the output?

littlestymaar Aug 14, 2024

Is there really such thing as a pre-trained model when it comes to Gaussian splatting?

I'm not familiar at all with the topic (nor have I read this particular paper) but I remember that the original 3DGS paper took pride in the fact that this was not “IA” or “deep learning”. There's still a gradient descent process to get the Gaussian splats from the data, but as I understood it, there is no “training on a large dataset then inference”, building the GS from your data is the “training phase” and then rendering it is the equivalent of inference.

Maybe I understood it all wrong though, or maybe new variants of Gaussian splatting use a deep learning network in addition to what was done in the original work, so I'll be happy to be corrected/clarified by someone with actual knowledge here.

jorgemf Aug 14, 2024

Basically you train a model per each set of images. The model is a neural network able to render the final image. Different images will require different trained models. Initial gaussian splatting models took hours to train, last year models took minutes to train. I am not sure how much this one takes, but it should be between minutes and hours (and probably more close to minutes than hours).

tomp Aug 14, 2024

No, what you're describing is NeRF, the predecessor technology.

The output of Gaussian Splat "training" is a set of 3d gaussians, which can be rendered very quickly. No ML involved at all (only optimisation)!

They usually require running COLMAP first (to get the relative location of camera between different images), but NVIDIA's InstantSplat doesn't (it however does use a ML model instead!)

dagmx Aug 14, 2024

Nit: splats are significantly older than NeRFs. They just had a resurgence after nerfs.

We’ve been using pretty similar technology for decades in areas like Renderman radiance caches before RIS.

petargyurov OP Aug 14, 2024

Thank you, that explains it.

This item has no comments currently.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous