Vision Transformers have demonstrated strong performance across diverse visual tasks; however, their high computational cost due to long token sequences remains a critical challenge. Sequence ...
In recent years, the pyramid-based encoder-decoder network architecture has become a popular solution to the problem of large deformation image registration due to its excellent multi-scale ...