フルテキストURL | |
著者 |
Pan, Shijun
Shenzhen Institute for Advanced Study, UESTC, University of Electronic Science and Technology of China
Fan, Zhun
Shenzhen Institute for Advanced Study, UESTC, University of Electronic Science and Technology of China
Yoshida, Keisuke
Graduate School of Environmental and Life Science, Okayama University
Qin, Shujia
Shenzhen Academy of Robotics
Kojima, Takashi
TOKEN C.E.E. Consultants Co., Ltd.
Nishiyama, Satoshi
Graduate School of Environmental and Life Science, Okayama University
|
抄録 | In recent years, large multimodal models (LMMs), such as ChatGPT 4o and DeepSeek R1—artificial intelligence systems capable of multimodal (e.g., image and text) human–computer interaction—have gained traction in industrial and civil engineering applications. Concurrently, insufficient real-world drone-view data (specifically close-distance, high-resolution imagery) for civil engineering scenarios has heightened the importance of artificially generated content (AIGC) or synthetic data as supplementary inputs. AIGC is typically produced via text-to-image generative models (e.g., Stable Diffusion, DALL-E) guided by user-defined prompts. This study leverages LMMs to interpret key parameters for drone-based image generation (e.g., color, texture, scene composition, photographic style) and applies prompt engineering to systematize these parameters. The resulting LMM-generated prompts were used to synthesize training data for a You Only Look Once version 8 segmentation model (YOLOv8-seg). To address the need for detailed crack-distribution mapping in low-altitude drone-based monitoring, the trained YOLOv8-seg model was evaluated on close-distance crack benchmark datasets. The experimental results confirm that LMM-prompted AIGC is a viable supplement for low-altitude drone crack monitoring, achieving >80% classification accuracy (images with/without cracks) at a confidence threshold of 0.5.
|
キーワード | artificial intelligence
large multimodal model
unmanned aerial vehicle
crack
|
発行日 | 2025-09-21
|
出版物タイトル |
Drones
|
巻 | 9巻
|
号 | 9号
|
出版者 | MDPI AG
|
開始ページ | 660
|
ISSN | 2504-446X
|
資料タイプ |
学術雑誌論文
|
言語 |
英語
|
OAI-PMH Set |
岡山大学
|
著作権者 | © 2025 by the authors.
|
論文のバージョン | publisher
|
DOI | |
関連URL | isVersionOf https://doi.org/10.3390/drones9090660
|
ライセンス | https://creativecommons.org/licenses/by/4.0/
|
Citation | Pan, S.; Fan, Z.; Yoshida, K.; Qin, S.; Kojima, T.; Nishiyama, S. Application of LMM-Derived Prompt-Based AIGC in Low-Altitude Drone-Based Concrete Crack Monitoring. Drones 2025, 9, 660. https://doi.org/10.3390/drones9090660
|