Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Investigating the Performance of a Vision Transformer Model for Anomaly Detection in Laser Metal Deposition Imaging
University West, Department of Engineering Science.
2024 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 HE creditsStudent thesis
Abstract [en]

Laser metal deposition (LMD) is recognized as a critical technique in Additive Manufacturing (AM) that allows the production and repair of components in a high-quality, efficient, and cost-effective manner. However, defects may still arise in the deposited components. While conventional architectures like Convolutional Neural Networks (CNNs) have shown satisfactory results in detecting these defects using images captured during the process, transformer-based models remain relatively underexplored in this context.

This study focused on designing a transformer-based architecture that could achieve high accuracy in identifying anomalies through melt pool images obtained during the wirefed LMD process. Upon its development, it was used to crossreference the predictions of an existing powerful CNN approach to ensure the reliability of its outcomes.

Initially, the algorithm was trained using a custom Vision Transformer-decoder architecture with no labels involved, resulting in an accuracy of 92.66%. By utilizing the captured information from the classification token, its ability to identify anomalies was significantly improved, achieving 99.78% in a 900-image dataset.

However, when evaluated on 6,497 unseen frames from the process with ground truth predictions generated by the CNN model, ViT’s accuracy decreased to 97.83%, a result attributed to the specific training method and the variability in the test set. Despite this reduction, the results were considered satisfactory, given the relatively new application of transformers on images, which has not been extensively explored in the field of anomaly detection.

Overall, this research offers a comprehensive explanation of the proposed model architecture and outlines the necessary modifications required to achieve near-perfect performance on a transformer-based architecture, paving the way for future enhancements in anomaly detection.

Place, publisher, year, edition, pages
2024. , p. 47
Keywords [en]
Additive Manufacturing, Anomaly Detection, Artificial Intelligence, Computer Vision, Deep Learning, Laser Metal Deposition, Machine Learning, Transformer, Vision Transformer
National Category
Robotics and automation Manufacturing, Surface and Joining Technology
Identifiers
URN: urn:nbn:se:hv:diva-22133Local ID: EXA620OAI: oai:DiVA.org:hv-22133DiVA, id: diva2:1886506
Subject / course
Technology
Educational program
Master in AI and automation
Supervisors
Examiners
Available from: 2024-08-23 Created: 2024-08-01 Last updated: 2025-09-30Bibliographically approved

Open Access in DiVA

fulltext(1456 kB)632 downloads
File information
File name FULLTEXT01.pdfFile size 1456 kBChecksum SHA-512
e96dc6ee8ce7f83b14438138b59012db0397fdf1670b75e191af08843379cfe9ea40bb1c1d39020bb2f8af52c8cee9edfb5115c063d0feb32e77248e4a35f3b5
Type fulltextMimetype application/pdf

By organisation
Department of Engineering Science
Robotics and automationManufacturing, Surface and Joining Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 632 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 2438 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf