A Mathematical Explanation of Unet

Xue-cheng Tai, Hao Liu*, Raymond H. Chan, Lingfeng Li

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

The UNet architecture has transformed image segmentation. UNet's versatility and accuracy have driven its widespread adoption, significantly advancing fields reliant on machine learning problems with images. In this work, we gave a clear and concise mathematical explanation of UNet. We explained what is the meaning and function of each of the components of UNet. We showed that UNet was solving a control problem. We decomposed the control variables using multigrid methods. Then, operator-splitting techniques were used to solve the problem, whose architecture exactly recovered the UNet architecture. Our result showed that UNet was a one-step operator-splitting algorithm for the control problem.
Original languageEnglish
Number of pages16
JournalMathematical Foundations of Computing
DOIs
Publication statusE-pub ahead of print - Oct 2024

User-Defined Keywords

  • UNet
  • Deep neural network
  • Image segmentation
  • Operator splitting

Fingerprint

Dive into the research topics of 'A Mathematical Explanation of Unet'. Together they form a unique fingerprint.

Cite this