555
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

90 lines
4.9 KiB

  1. \section{Background}
  2. \label{sec:bk_related}
  3. This section describes background of Super-resolution technique, and the sensors
  4. we are using in this work.
  5. %\subsection{Background}
  6. \subsection{Super-Resolution Convolutional Neural Network}
  7. C. Dong et al.~\cite{ChaoDong16} proposed a deep learning method for single image
  8. super-resolution (SR). Their method learns how to directly map the low-resolution
  9. image to high-resolution image. They show that the traditional sparse coding
  10. based SR methods can be reformulated into a deep convolutional neural network.
  11. SRCNN consists three operations.
  12. The first layer of SRCNN model extracts the patches from low-resolution image.
  13. The second layer maps the patches from low-resolution to high-resolution. The
  14. third layer will reconstruct the high-resolution image by the high-resolution
  15. patches.
  16. %% \begin{figure}[htb]
  17. %% \begin{center}
  18. %% \includegraphics[width=1\linewidth]{figures/SRCNN_model.pdf}
  19. %% \caption{Illustration of super-resolution convolutional neural network (SRCNN) model~\cite{ChaoDong16}.}
  20. %% \label{fig:SRCNN_model}
  21. %% \end{center}
  22. %% \end{figure}
  23. \subsection{Thermal cameras}
  24. In this work, we use two different resolution thermal cameras to play the role
  25. of low-resolution and high-resolution cameras. For low-resolution camera, we use
  26. Grid-EYE thermal camera. Grid-EYE is a thermal camera that can output
  27. $8 \times 8$ pixels thermal data with $2.5^\circ C$ accuracy and $0.25^\circ C$
  28. resolution at $10$ fps. For the high-resolution one, we use Lepton 3. The
  29. specification of them are shown in Table~\ref{table:specificationofdevices}.
  30. \begin{table}[tbp]
  31. \centering
  32. \footnotesize
  33. \begin{tabular}{|p{3cm}<{\centering}|p{2cm}<{\centering}|p{2.5cm}<{\centering}|}
  34. \hline
  35. Specification & Grid-EYE & Lepton 3\\
  36. \hline
  37. Resolution & 64 pixels (8x8) & 120x160\\
  38. \hline
  39. FOV-horizontal & $60^{\circ}$ & $49.5^{\circ}$\\
  40. \hline
  41. FOV-vertical & $60^{\circ}$ & $61.8^{\circ}$\\
  42. \hline
  43. Frame rate & 10Hz & 8.7Hz\\
  44. \hline
  45. Detect temperature range & $-20^{\circ}$C to $80^{\circ}$C & $0^{\circ}$C to $120^{\circ}$C\\
  46. \hline
  47. Output format & Absolute temperature (Celsius) & 14-bits value which is relative to camera temperature\\
  48. \hline
  49. Temperature accuracy & $\pm2.5^{\circ}$C & -\\
  50. \hline
  51. Temperature resolution & $0.25^{\circ}$C & -\\
  52. \hline
  53. Bits per pixel & 12 & 14\\
  54. \hline
  55. Data Rate & 7.68 kbps & 2.34 mbps\\
  56. \hline
  57. \end{tabular}
  58. \caption{Specification of Grid-EYE~\cite{grideye_datasheet} and Lepton 3~\cite{lepton_datasheet}.}
  59. \label{table:specificationofdevices}
  60. \end{table}
  61. %\subsection{Related Works}
  62. %X. Chen et al.~\cite{7805509} proposed to use the visible camera as a guidance to super resolution the IR images. They proposed the IR-RBG multi-sensor imaging system. The approach bases on the fact that RGB channels have different correlation with IR images. To avoid wrong texture transfer, the method used cross correlation to find the proper matching between the region of IR image and RGB image. Next, the method applied sub-pixel estimation and guided filtering to remove the noise and discontinuities in IR image. At the last step of the iteration, they used truncated quadric model as the cost function to terminate the algorithm. After the iteration, the method fine tuned the output IR image with outlier detection to remove the black points which only appear at the edge of IR image.
  63. % \begin{figure}[ht]
  64. % \begin{center}
  65. % \includegraphics[width=0.7\linewidth]{figures/color_guided_SR.pdf}
  66. % \caption{Flow chart of IR-Color multi-sensor imaging system.}
  67. % \label{fig:color_guided_SR}
  68. % \end{center}
  69. % \end{figure}
  70. %F. Almsari et al.~\cite{8713356} applied Generative Adversarial Network (GAN) to the super-resoltuion problem called {\it TIGAN}. TIGAN consists of two networks, generator and discriminator, which learn from each other with zero-sum games and try to find an equilibrium state. The generator was trained to transfer low-resolution thermal image into super-resolution thermal image domain. The generated image should be similar to its ground truth high-resolution image domain. While the discriminator was trained to discriminate between generated image and ground truth high-resolution image. Compared to SRCNN model, it preserves the high-frequency details and gave shaper textures.
  71. % \begin{figure}[hb]
  72. % \begin{center}
  73. % \includegraphics[width=0.7\linewidth]{figures/GAN_SR.pdf}
  74. % \caption{Network architecture of TIGAN.}
  75. % \label{fig:GAN_SR}
  76. % \end{center}
  77. % \end{figure}
  78. % % background
  79. %The assumption on the resolution of thermal images in above works is at least $120 \times 160$. Compared to the $8 \times 8$ thermal images collected by Grid-EYE sensors, more thermal features are available. In addition, these works use RGB images as reference to train the model. In this work, RGB camera will not be used so as to protect privacy.