3D-UIR: 3D Gaussian for Underwater 3D Scene Reconstruction via Physics-Based Appearance-Medium Decoupling

Jieyu Yuan¹, Yujun Li¹, Yuanlin Zhang^1,2, Chunle Guo^1,3, Xiongxin Tang², Ruixing Wang⁴, Chongyi Li^1,3#

¹NanKai University ²Institute of Software, Chinese Academy of Sciences
³NKIARI, Shenzhen Futian ⁴DJI Co., Ltd
^#Corresponding author

arXiv Code BibTeX

Optical scattering and absorption in underwater environments present unique challenges for novel view synthesis. The standard volume rendering equation inadequately models participating media with suspended particles, leading to the incorrect reconstruction of volumetric water as floating artifacts in 3D representation (left). Additionally, light source directionality and viewing angles cause attenuation variations that result in inconsistent scene appearance across viewpoints (right). 3DGS lacks proper modeling of scattering media, causing water column effects on scene surfaces and resulting in collapsed depth representations. Existing scattering NVS methods, e.g., SeaThru-NeRF, fail to account for dynamic photometric variations, which introduces water float artifacts (highlighted in yellow boxes). In contrast, our approach effectively models the participating medium to render photorealistic novel views with accurate scene representation, yielding more consistent scene rendering across novel viewpoints and effective elimination of underwater artifacts.

Abstract

Novel view synthesis for underwater scene reconstruction presents unique challenges due to complex light-media interactions. While 3D Gaussian Splatting (3DGS) offers real-time rendering capabilities, it struggles with underwater inhomogeneous environments where scattering media introduce artifacts and inconsistent appearance. In this study, we propose a physics-based framework that disentangles object appearance from water medium effects through specialized Gaussian modeling. Our approach introduces appearance embeddings which are explicit medium representations for backscatter and attenuation for enhancing scene consistency. In addition, we propose a distance-guided optimization for improving geometric fidelity. By integrating these physics-inspired components through an underwater imaging model, our method achieves both high-quality novel view synthesis and physically accurate scene restoration. Experiments demonstrate our significant improvements in rendering quality and restoration accuracy over existing methods. Our code will be made available upon acceptance.

Pipeline

Overview of 3D-UIR:

Underwater Appearance Modeling Branch (Yellow): This branch incorporates appearance features and embeddings that are crucial for maintaining view consistency, ensuring accurate scene representation across various perspectives.
Scatter Medium Modeling Branch (Red): The medium modeling branch separately handles backscatter and attenuation, key factors that influence the appearance of objects in underwater environments.
Depth-guided Regularization Optimization (Blue): Pseudo-depth maps are utilized to guide distance optimization, improving the accuracy of parameter estimation and enhancing the overall depth accuracy in the scene reconstruction process.

Physics-Based Integration: All components, including UAM, SMD, and DRO, are integrated into a unified framework through a physics-based underwater image formation model during differentiable rasterization, allowing for smooth, gradient-based optimization.Our method effectively disentangles object appearance from the water medium effects using specialized Gaussian modeling techniques.

Video Comparison

Video Selector:

README

• The left side displays the default render video from the Ours method applied to the Japan scene.
• The right side displays the default RGB video from the 3DGS method applied to the Japan scene.
• Switch scenes via Video Selector dropdown
• Drag the slider to compare videos
• Click the Pause Button to stop the video
• RGB Button: Reconstructed Image which presents reconstructed scene geometry from raw sensor data.
• Clear Button: Restoration Image which shows optimized scene representation with descattering restoration.

SeaThru-Nerf

3DGS

SeaSplat

Water-Splatting

Ours

SeaThru-Nerf

3DGS

SeaSplat

Water-Splatting

Ours

Comparative Video Analysis: The comparative results validate the effectiveness of our method in achieving high-quality RGB reconstruction and precise depth estimation. Compared to existing baselines, our approach demonstrates improved structural consistency and photometric accuracy, particularly in distant regions where alternative methods tend to produce artifacts or exhibit depth collapse. These findings underscore the robustness of our framework in addressing the challenges of underwater visual degradation. Furthermore, our descattering restoration significantly improves visual clarity, enabling more faithful scene recovery under scattering-dominant conditions.

Quantitative Results Comparison

We evaluate our method on three datasets: SeaThru-NeRF, Underwater in the Wild (U-IW), and our Simulated dataset (U-S). SeaThru-NeRF contains four forward-facing real underwater scenes with diverse aquatic and imaging conditions. U-IW consists of frames sampled from in-the-wild underwater videos collected from the Internet. Both datasets feature unbounded camera-to-scene distances. To further evaluate our method, we simulated underwater scenes using four scenes from the MipNeRF-360 dataset.

Method	Seathru-Nerf			U-IW			S-U			Speed
Method	PSNR ↑	SSIM ↑	LPIPS ↓	PSNR ↑	SSIM ↑	LPIPS ↓	PSNR ↑	SSIM ↑	LPIPS ↓	FPS ↑	Training ↓
SeaThru-Nerf	27.394	0.860	0.215	18.942	0.644	0.383	24.436	0.805	0.293	0.55	2 h 39 m
3DGS	26.188	0.859	0.238	27.361	0.894	0.158	29.274	0.881	0.233	149.36	17 m
Splatfacto-Wild	25.750	0.832	0.229	25.159	0.847	0.209	25.786	0.853	0.260	42.98	21 m
SeaSplat	27.385	0.866	0.194	27.023	0.889	0.159	28.566	0.861	0.253	42.69	1 h 25 m
RecGS	25.829	0.857	0.233	22.186	0.838	0.180	24.620	0.825	0.259	146.62	38 m
Water-Splatting	27.573	0.865	0.198	25.673	0.882	0.167	29.973	0.878	0.235	35.80	29 m
Ours	28.116	0.876	0.202	28.198	0.902	0.150	31.227	0.891	0.187	48.72	48 m

Quantitative evaluation of existing methods: Metrics are averaged across all scenes in various datasets. The top three rankings in each category are highlighted: 1st, 2nd, 3rd. Arrows indicate whether higher (↑) or lower (↓) values are better.

More details (including validation experiments) can be found in our paper.

BibTeX

@misc{yuan20253duir,
        title={3D-UIR: 3D Gaussian for Underwater 3D Scene Reconstruction via Physics Based Appearance-Medium Decoupling}, 
        author={Jieyu Yuan and Yujun Li and Yuanlin Zhang and Chunle Guo and Xiongxin Tang and Ruixing Wang and Chongyi Li},
        year={2025},
        eprint={2505.21238},
        archivePrefix={arXiv},
        primaryClass={cs.CV},
        url={https://arxiv.org/abs/2505.21238}, 
  }

Contact

Feel free to contact us at jieyuyuan.cn[AT]gmail.com !