このエントリーをはてなブックマークに追加


ID 69374
FullText URL
fulltext.pdf 18.5 MB
suppl.pdf 5.21 MB
Author
Pan, Shijun Shenzhen Institute for Advanced Study, UESTC, University of Electronic Science and Technology of China
Fan, Zhun Shenzhen Institute for Advanced Study, UESTC, University of Electronic Science and Technology of China
Yoshida, Keisuke Graduate School of Environmental and Life Science, Okayama University
Qin, Shujia Shenzhen Academy of Robotics
Kojima, Takashi TOKEN C.E.E. Consultants Co., Ltd.
Nishiyama, Satoshi Graduate School of Environmental and Life Science, Okayama University
Abstract
In recent years, large multimodal models (LMMs), such as ChatGPT 4o and DeepSeek R1—artificial intelligence systems capable of multimodal (e.g., image and text) human–computer interaction—have gained traction in industrial and civil engineering applications. Concurrently, insufficient real-world drone-view data (specifically close-distance, high-resolution imagery) for civil engineering scenarios has heightened the importance of artificially generated content (AIGC) or synthetic data as supplementary inputs. AIGC is typically produced via text-to-image generative models (e.g., Stable Diffusion, DALL-E) guided by user-defined prompts. This study leverages LMMs to interpret key parameters for drone-based image generation (e.g., color, texture, scene composition, photographic style) and applies prompt engineering to systematize these parameters. The resulting LMM-generated prompts were used to synthesize training data for a You Only Look Once version 8 segmentation model (YOLOv8-seg). To address the need for detailed crack-distribution mapping in low-altitude drone-based monitoring, the trained YOLOv8-seg model was evaluated on close-distance crack benchmark datasets. The experimental results confirm that LMM-prompted AIGC is a viable supplement for low-altitude drone crack monitoring, achieving >80% classification accuracy (images with/without cracks) at a confidence threshold of 0.5.
Keywords
artificial intelligence
large multimodal model
unmanned aerial vehicle
crack
Published Date
2025-09-21
Publication Title
Drones
Volume
volume9
Issue
issue9
Publisher
MDPI AG
Start Page
660
ISSN
2504-446X
Content Type
Journal Article
language
English
OAI-PMH Set
岡山大学
Copyright Holders
© 2025 by the authors.
File Version
publisher
DOI
Related Url
isVersionOf https://doi.org/10.3390/drones9090660
License
https://creativecommons.org/licenses/by/4.0/
Citation
Pan, S.; Fan, Z.; Yoshida, K.; Qin, S.; Kojima, T.; Nishiyama, S. Application of LMM-Derived Prompt-Based AIGC in Low-Altitude Drone-Based Concrete Crack Monitoring. Drones 2025, 9, 660. https://doi.org/10.3390/drones9090660