Skip to content

Parallel Fusion of CNN and Transformer structure for efficient industrial component image super-resolution under lightweight deployment

Notifications You must be signed in to change notification settings

Luckycat518/FPFCT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 

Repository files navigation

FPFCT:For Industrial Image Super Resolution📌

GitHub Computer Vision in Action License

Table of Contents

Introduction

For the smart manufacturing, lots of studies focus on high-quilty surface detection under diverse industrial environments. However, for surface detection, besides the performance of advanced detectors, the image quality/resolution is also the key factor affecting the detection performance. Considering the urgency of the need for high-quality images when performing surface detection in complex and changeable industrial environments, it is necessary to propose a lightweight image processing method for effectively improving the input image quality for surface detection with only a small increase in time and computational cost. This method can reduce the dependence of the detection system on the environment and image collection equipment, leading to significant economical benefits on the industrial detection system design and deployment. Thus, we innovatively proposed a lightweight SR structure by parallelly fusing CNN feature extraction module and Transformer feature extraction module (FPFCT). Different from the serial structure that uses CNN and transformer to extract features successively, the parallel fusion structure can effectively avoid the irreparable information loss caused by the inheritable sampling imperfection of two different methods. Moreover, an industrial image SR dataset called WCI110, consisting of 110 typical welding component images with 2040×1524 pixels, is established for the vertification of the proposed method.

1. Our dataset for Industrial Image Super Resolution

1)Welding Component Images: WCI110

WCI110 can be used for image super resolution tasks.

The WCI110 contains 110 pictures with a total of 110 HR welding component surface images under 2040×1524 pixels. For this dataset, there are three typical scenarios, including global view with component structure background, and detail views focusing on the unground weld and ground weld.

2. Comparison with SOTA methods

1)In public datasets

The PSNR and SSIM of FPFCT are higher than other advanced SR methods and comparable to that of the SOTA SwinIR-light model almost in all the five public standard datasets and the self-established industrial dataset in all three scales. However, the model parameters are reduced by more than 37.5%, and the FLOPs are reduced by more than 25.6% compared to SwinIR-light in all three scales.

👆 BACK to Table of Contents -->

2)In WCI110

For three typical target, including workpiece scratches, surface rust and weld corrugation characteristics, the super-resolution repair ability of FPFCT is comparable to that of SwinIR-light, and it is ahead of other advanced methods. Moreover, it can be noticed that compared with the slimming target in SR image obtained by SwinIR-light, the target of FPFCT is closer to ground truth (GT) in terms of size.

👆 BACK to Table of Contents -->

Notification

The first version of the dataset containing the original images appearing in the paper has been released.

The WCI110 is collected from a commercial company. Therefore, this dataset is limited to academic use!!!Prohibited for any commercial use!!! You can only use this dataset for research purpose. Before the entire dataset is fully released, if you need this dataset to do some research, please contact us. After review and permission by the commercial company, we will provide the dataset to you.

If you have any questions or idea, please let me know

(email: [email protected])

👆 BACK to Table of Contents -->

Citation

This paper is still under review......

👆 BACK to Table of Contents -->

About

Parallel Fusion of CNN and Transformer structure for efficient industrial component image super-resolution under lightweight deployment

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages