I am first-year Ph.D. student in Department of Artificial Intelligence at Kyungpook National University, advised by Prof. Dong-Gyu Lee.
Prior to KNU, I received a B.S. degree in Department of Computer Engineering at Kumoh National Institute of Technology, advised by Prof. Hyeonju Yoon. I worked on diverse topics in computer vision.
Advised by Professor Dong-Gyu Lee.
Mar 2024 - Present | Daegu, Korea
Advised by Professor Dong-Gyu Lee.
Mar 2024 - Present | Daegu, Korea
Advised by Professor Hyeonju Yoon.
Mar 2020 - Feb 2024 | Gumi, Korea
My research interest is human-centric AI, especially in the field of computer vision (CV).
In Preparation
Under Review
This paper proposes a context-aware adaptive mask selection framework for referring video object segmentation (RVOS). The method integrates CLIP-based keyframe selection, mask consistency scoring, text-guided object number determination, and temporal mask interpolation to ensure both temporal stability and accurate multi-object segmentation.
This paper proposes a multi-scale discrete wavelet transform framework designed to enhance the detail quality of super-resolution (SR) methods based on latent diffusion model. Experimental results on benchmark datasets demonstrate that the proposed framework outperforms previous methods, effectively improving the detail quality of super-resolved images.
This paper proposes a multi-perspective prompt-based framework for zero-shot image classification using LVLMs, enhancing robustness by integrating object-aware, intent-aware, and reason-aware prompts, and achieving superior performance through ensemble prediction on the VizWiz dataset.
This paper proposes a diffusion model-based image super-resolution framework using multi-scale high-frequency error maps, enabling accurate restoration of image details and outperforming existing methods on benchmark datasets.
This paper proposes a robust OCR algorithm using the end-to-end Transformer-based TrOCR model, achieving a Character Error Rate (CER) of approximately 0.08 and significantly outperforming Tesseract OCR and EasyOCR.
This paper proposes a system for detecting vehicle damage areas through semantic segmentation using the UNet++ model, achieving pixel-level accuracy with an average IoU of 0.94 and comparing its performance with the conventional U-Net model.
This paper proposes a deep learning-based technique for detecting stains in clothing images using a UNet++ model with an EfficientNet encoder pre-trained on ImageNet, achieving an average IoU of approximately 0.9589.
This paper analyzes various words used in CQI reports and constructs a domain dictionary to provide the foundation for future automatic analysis of such reports.
This paper proposes a vehicle model recognition and license plate identification system using YOLOv5, enabling both tasks simultaneously and reducing computational cost compared to previous methods.
Technology Transfer
Jul 2025 - Dec 2025 | Daegu, Korea
Undergraduate Tutor
Mar 2025 - Jul 2025 | Daegu, Korea
Education Completion
Jul 2022 - Sep 2022 | Online
Web Publishing Team
Jul 2022 - Sep 2022 | Gumi, Korea
Facilities Management Department - Data Processing
Jul 2021 - Sep 2021 | Gumi, Korea
Freshmen Mentor
Mar 2021 - Feb 2022 | Gumi, Korea
|
Source code credit to Dr. Jon Barron |