Investigating the Performance of a Vision Transformer Model for Anomaly Detection in Laser Metal Deposition Imaging
2024 (Engelska)Självständigt arbete på avancerad nivå (masterexamen), 20 hp
Studentuppsats (Examensarbete)
Abstract [en]
Laser metal deposition (LMD) is recognized as a critical technique in Additive Manufacturing (AM) that allows the production and repair of components in a high-quality, efficient, and cost-effective manner. However, defects may still arise in the deposited components. While conventional architectures like Convolutional Neural Networks (CNNs) have shown satisfactory results in detecting these defects using images captured during the process, transformer-based models remain relatively underexplored in this context.
This study focused on designing a transformer-based architecture that could achieve high accuracy in identifying anomalies through melt pool images obtained during the wirefed LMD process. Upon its development, it was used to crossreference the predictions of an existing powerful CNN approach to ensure the reliability of its outcomes.
Initially, the algorithm was trained using a custom Vision Transformer-decoder architecture with no labels involved, resulting in an accuracy of 92.66%. By utilizing the captured information from the classification token, its ability to identify anomalies was significantly improved, achieving 99.78% in a 900-image dataset.
However, when evaluated on 6,497 unseen frames from the process with ground truth predictions generated by the CNN model, ViT’s accuracy decreased to 97.83%, a result attributed to the specific training method and the variability in the test set. Despite this reduction, the results were considered satisfactory, given the relatively new application of transformers on images, which has not been extensively explored in the field of anomaly detection.
Overall, this research offers a comprehensive explanation of the proposed model architecture and outlines the necessary modifications required to achieve near-perfect performance on a transformer-based architecture, paving the way for future enhancements in anomaly detection.
Ort, förlag, år, upplaga, sidor
2024. , s. 47
Nyckelord [en]
Additive Manufacturing, Anomaly Detection, Artificial Intelligence, Computer Vision, Deep Learning, Laser Metal Deposition, Machine Learning, Transformer, Vision Transformer
Nationell ämneskategori
Robotik och automation Bearbetnings-, yt- och fogningsteknik
Identifikatorer
URN: urn:nbn:se:hv:diva-22133Lokalt ID: EXA620OAI: oai:DiVA.org:hv-22133DiVA, id: diva2:1886506
Ämne / kurs
Teknik
Utbildningsprogram
Master i AI och automation
Handledare
Examinatorer
2024-08-232024-08-012025-09-30Bibliografiskt granskad