Scene Text Image Super Resolution Via Semantic Distillation and Text Perceptual ...
Progressive Pseudo Labeling for Multi Dataset Detection Over Unified Label Space
Adaptive Knowledge Distillation With Attention Based Multi Modal Fusion for Robu...
Implicit and Explicit Language Guidance for Diffusion Based Visual Perception
Explain Vision Focus Blending Human Saliency Into Synthetic Face Images
RHLS A Robust Hybrid Level Set Model Using Global Local Signed Energy Based Pres...
A Mobile Image Driven PM25 Estimation Framework Using Deep Learning Techniques
Token Masking Transformer for Weakly Supervised Object Localization
Focus Entirety and Perceive Environment for Arbitrary Shaped Text Detection
FER Former Multimodal Transformer for Facial Expression Recognition
PMMTalk Speech Driven 3D Facial Animation From Complementary Pseudo Multi Modal ...
VB KGN Variational Bayesian Kernel Generation Networks for Motion Image Deblurring
Training Free Image Style Alignment for Domain Shift on Handheld Ultrasound Devices
Deep Learning Landscape Evaluation System Integrating Poetic Emotion and Visual ...
SafeRespirator Comprehensive Database for N95 Filtering Facepiece Respirator Lea...
An Efficient Detector for Automatic Tomato Classification Systems