フルテキストURL
fulltext.pdf 18.5 MB
suppl.pdf 5.21 MB
著者
Pan, Shijun Shenzhen Institute for Advanced Study, UESTC, University of Electronic Science and Technology of China
Fan, Zhun Shenzhen Institute for Advanced Study, UESTC, University of Electronic Science and Technology of China
Yoshida, Keisuke Graduate School of Environmental and Life Science, Okayama University
Qin, Shujia Shenzhen Academy of Robotics
Kojima, Takashi TOKEN C.E.E. Consultants Co., Ltd.
Nishiyama, Satoshi Graduate School of Environmental and Life Science, Okayama University
抄録
In recent years, large multimodal models (LMMs), such as ChatGPT 4o and DeepSeek R1—artificial intelligence systems capable of multimodal (e.g., image and text) human–computer interaction—have gained traction in industrial and civil engineering applications. Concurrently, insufficient real-world drone-view data (specifically close-distance, high-resolution imagery) for civil engineering scenarios has heightened the importance of artificially generated content (AIGC) or synthetic data as supplementary inputs. AIGC is typically produced via text-to-image generative models (e.g., Stable Diffusion, DALL-E) guided by user-defined prompts. This study leverages LMMs to interpret key parameters for drone-based image generation (e.g., color, texture, scene composition, photographic style) and applies prompt engineering to systematize these parameters. The resulting LMM-generated prompts were used to synthesize training data for a You Only Look Once version 8 segmentation model (YOLOv8-seg). To address the need for detailed crack-distribution mapping in low-altitude drone-based monitoring, the trained YOLOv8-seg model was evaluated on close-distance crack benchmark datasets. The experimental results confirm that LMM-prompted AIGC is a viable supplement for low-altitude drone crack monitoring, achieving >80% classification accuracy (images with/without cracks) at a confidence threshold of 0.5.
キーワード
artificial intelligence
large multimodal model
unmanned aerial vehicle
crack
発行日
2025-09-21
出版物タイトル
Drones
9巻
9号
出版者
MDPI AG
開始ページ
660
ISSN
2504-446X
資料タイプ
学術雑誌論文
言語
英語
OAI-PMH Set
岡山大学
論文のバージョン
publisher
DOI
ライセンス
https://creativecommons.org/licenses/by/4.0/
Citation
Pan, S.; Fan, Z.; Yoshida, K.; Qin, S.; Kojima, T.; Nishiyama, S. Application of LMM-Derived Prompt-Based AIGC in Low-Altitude Drone-Based Concrete Crack Monitoring. Drones 2025, 9, 660. https://doi.org/10.3390/drones9090660