Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Reality to Simulation: A Scene Understanding Approach to 3D Log Pile Scene Reconstruction
Umeå University, Faculty of Science and Technology, Department of Physics. (Digital Physics)
2025 (English)Independent thesis Advanced level (professional degree), 20 credits / 30 HE creditsStudent thesisAlternative title
Verklighet till Simulering: Scenförståelse-baserad 3D-rekonstruktion av Timmerhögar (Swedish)
Abstract [en]

This thesis presents a pipeline for physically accurate reconstruction of log pile scenes from RGB-D data, connecting real-world perception and physics-based simulation. The proposed method integrates SAM-6D, a zero-shot 6D pose estimation framework, with AGX Dynamics, a high-fidelity physics engine. Starting from RGB-D images and a CAD model reference of a log, SAM-6D identifies and performs 6D pose estimation for each individual log. The resulting poses and segmentation masks are used to infer the terrain beneath occluded regions through interpolation, generating an initial terrain guess. Direct simulation of the predicted scene, however, does not often result in stable configurations. To address this, a heightfield optimization process is introduced.The terrain under each log is perturbed locally, and candidate configurations are evaluated in simulation using a loss function that penalizes deviation from the predicted poses, accumulated linear and angular velocity after spawn, and terrain distortion. The system is evaluated on synthetic log pile scenes under varying conditions in three different tests: AGX generated log pile scenes, repeated optimization on a poorly performing configuration, and added environmental complexities using Blender. Results show that the optimized simulations achieve median position errors of 18 mm, which is 7% error relative to the chosen log diameter, and angular deviations below 1° after letting the logs settle for 78 AGX-generated scenes, with a resulting error decrease of 59%. The heightfield optimization also demonstrates consistency across 57 repeated runs on a log pile configuration withpoor initial stability, resulting in a 94% improvement. The pipeline successfully segmented all three logs in 5 out of 10 Blender generated scenes. For these, the optimized simulations achieved a median position error of 23% relative to the log diameter and angular deviations of 1.6°.

Place, publisher, year, edition, pages
2025. , p. 26
Keywords [en]
Computer Vision, Machine Vision, SAM, SAM-6D, Segmentation, Pose Estimation, Scene Understanding, 3D Reconstruction, AGX Dynamics
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:umu:diva-240745OAI: oai:DiVA.org:umu-240745DiVA, id: diva2:1973415
Subject / course
Examensarbete i teknisk fysik
Educational program
Master of Science Programme in Engineering Physics
Presentation
2025-06-12, Nat.D.410, UNIVERSITETSTORGET 4, 901 87, Umeå, 10:00 (Swedish)
Supervisors
Examiners
Available from: 2025-06-23 Created: 2025-06-19 Last updated: 2025-06-23Bibliographically approved

Open Access in DiVA

fulltext(18094 kB)794 downloads
File information
File name FULLTEXT01.pdfFile size 18094 kBChecksum SHA-512
4f97db2d340c501b12cf59497863868340a96a0f14c47d5befda53e5093e88177c5eba23edbc18e33bca4f1f407d3c28447b98b3e247dc509532e475d11fd5e7
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Lundberg, Philiph
By organisation
Department of Physics
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 794 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 341 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf