Automating Synthetic Dataset Generation for Image-based 3D Detection: A Literature Review

A comprehensive review of synthetic dataset generation approaches for 3D object detection, covering both traditional 3D modeling and neural image synthesis methods, along with techniques to address the Sim-to-Real domain gap.

Automating Synthetic Dataset Generation for Image-based 3D Detection: A Literature Review

Reliable 3D detection is fundamental to autonomous systems such as mobile robots, self-driving cars, and unmanned aerial vehicles (UAVs). To achieve this capability, researchers have developed and trained supervised networks, which require large amounts of diverse and precisely annotated data. Due to the complex, expensive, and time-consuming capturing and annotation process, synthetic dataset generation approaches have gained popularity over the last decade.

With increasing computational resources and advances in simulation technologies, a variety of dataset generators have emerged. These methods rely on either traditional 3D modeling or neural image synthesis to generate data for specific scenarios or general-purpose 3D detection tasks. Their primary goal is to produce high-quality, annotated 3D datasets in an automated and scalable manner.

In this review, we evaluate the extent to which state-of-the-art approaches fulfill this goal by introducing a categorization scheme and conducting a comprehensive analysis of both 3D modeling and neural synthesis methods. Our analysis includes techniques used to address the Sim-to-Real domain gap. Furthermore, we assess each method’s level of automation, prerequisites, and practical adoption. This review aims to guide the reader in selecting automated dataset generation workflows for specific detection problems. By considering dataset quality, prerequisites, and application scenarios, we offer practical insights into identifying suitable methods for diverse downstream tasks.

Fulltext Access

https://doi.org/10.1007/s10462-025-11431-3

Citation (BibTeX)

@article{schulz2026automating,
  author    = {Paul Schulz and Thorsten Hempel and Magnus Jung and Ayoub Al-Hamadi},
  title     = {Automating synthetic dataset generation for image-based 3D detection: a literature review},
  journal   = {Artificial Intelligence Review},
  volume    = {59},
  number    = {1},
  pages     = {8},
  year      = {2026},
  doi       = {10.1007/s10462-025-11431-3},
  url       = {https://doi.org/10.1007/s10462-025-11431-3}
}