Color Image Demosaicking using a 3-stage Convolutional Neural Network Structure
Abstract
Color demosaicking (CDM) is a critical first step for the acquisition of high-quality RGB images with single chip cameras. Conventional CDM approaches are mostly based on interpolation schemes and hand-crafted image priors, which result in unpleasant visual artifacts in some cases. Motivated by the special characteristics of inter-channel correlations (higher correlations for R/G and G/B channels than that for R/B), in this paper, a 3-stage convolutional neural network (CNN) structure for CDM is proposed. In the first stage, the G channel is reconstructed independently. Then, by using the reconstructed G channel as guidance, the R and B channels are recovered in the second stage. Finally, high-quality RGB color images are reconstructed in the third stage. The objective and visual quality evaluation results show that the proposed structure achieves noticeable quality improvements in comparison to the state-of-the-art approaches.
Proposed Structure
Inspired by inter-channel correlation, in this paper, we propose a 3-stage CNN structure for color demosaicking.
Evaluation
You can download the original images and zoom in for better visual quality, the difference for area marked with the green box can be easily observed. (The 2-stage results are based on our reimplementation since the source code is not available yet, all other results are generated from the original source codes downloaded from the authors’ webpages.)
kodim19.png from Kodak dataset
kodim19.png Original Image
kodim19.png AHD (38.29dB)
kodim19.png DLMMSE (41.05dB)
kodim19.png DLMMSE (41.83dB)
kodim19.png LDI-NAT (37.78dB)
kodim19.png RI (39.16dB)
kodim19.png MLRI (39.92dB)
kodim19.png ARI (40.50dB)
kodim19.png RI-modified (39.46dB)
kodim19.png ARI-modified (40.57dB)
kodim19.png 2-stage (41.74dB)
kodim19.png Proposed (42.82dB)
kodim01.png from Kodak dataset
kodim01.png Original Image
kodim01.png AHD (35.12dB)
kodim01.png DLMMSE (38.52dB)
kodim01.png DLMMSE (39.62dB)
kodim01.png LDI-NAT (35.29dB)
kodim01.png RI (35.57dB)
kodim01.png MLRI (36.80dB)
kodim01.png ARI (38.84dB)
kodim01.png RI-modified (36.33dB)
kodim01.png ARI-modified (38.84dB)
kodim01.png 2-stage (40.62dB)
kodim01.png Proposed (41.93dB)
03168.bmp from WED dataset
03168.bmp Original Image
03168.bmp AHD (29.01dB)
03168.bmp DLMMSE (31.09dB)
03168.bmp DLMMSE (31.47dB)
03168.bmp LDI-NAT (29.86dB)
03168.bmp RI (29.49dB)
03168.bmp MLRI (30.16dB)
03168.bmp ARI (29.64dB)
03168.bmp RI-modified (29.69dB)
03168.bmp ARI-modified (29.98dB)
03168.bmp 2-stage (33.57dB)
03168.bmp Proposed (33.90dB)
For more objective evaluation results, please refer to our paper.
Source Code
https://github.com/amnesiack/ICIP2018CDM