2nd Integrating Image Processing with Large-Scale Vision/Language Models for Advanced Visual Understanding Workshop

(IEEE ICIP 2025 ALASKA)

Introduction

This workshop aims to bridge the gap between conventional image processing techniques and the latest advancements in large-scale models (LLM and LVLM). In recent years, the integration of large-scale models into image processing tasks has shown significant promise in improving visual object understanding and image classification.

This workshop will provide a platform for researchers and practitioners to explore the synergies between conventional image processing methods and cutting-edge large language model and large vision language models, fostering innovation and collaboration in the field.

Our objectives are as follows:

Explore the foundations of image processing techniques with large-scale models.
Investigate the current landscape of large-scale language/vision models and their capabilities.
Discuss challenges and opportunities in integrating large-scale models with image processing to enhance visual understanding.
Showcase practical examples and case studies where the combined approach has yielded superior results.

This workshop is designed for researchers, academics, and industry professionals working in the fields of image processing, computer vision, multimedia processing and natural language processing. Participants should have a basic understanding of image processing concepts and an interest in exploring innovative approaches for visual understanding.

The workshop will consist of paper presentations by leading experts in image processing and large-scale language/vision models. Participants will have the opportunity to engage in discussions, exchange insights, and collaborate on potential research projects.

Call for Papers

We warmly invite submissions of high-quality research papers, not exceeding 4 pages (excluding references), that focus on the following workshop topics on Large-scale Vision/Language Models. You can download the pdf version of Call for Papers.

Workshop Topics (include, but not limited to):

Cross-Modal Fusion
Object Detection and Recognition with Large-scale models
Image Classification and Annotation
Multimodal Sensor Fusion
Semantic Segmentation with Large-scale Models
Cross-Domain Visual Understanding
Visual Question Answering (VQA) Systems
Text-Image Linking and Alignment

cfp-icip25_v1.0.pdf

Paper Submission

Authors are invited to submit papers. Submission instructions, templates and the “no show” policy are detailed in https://2025.ieeeicip.org/. Please note, as per ICIP regulations, at least one author from each accepted paper is required to complete an in-person registration for the conference. All accepted papers will be published in IEEE Xplore.

Authors can submit their papers through our submission link: https://cmsworkshops.com/ICIP2025/papers/submission.asp?Type=WS&ID=1

Important Dates

Paper Submission Deadline: May 28, 2025.
Paper Acceptance Notification: July 2, 2025
Final Paper Submission Deadline: July 9, 2025
Author Registration Deadline: July 16, 2025

Organizers

Yong Man Ro (KAIST, South Korea), e-mail address: ymro@kaist.ac.kr

Prof. Yong Man Ro earned his Ph.D. degree from the Department of Electrical Engineering at KAIST. He conducted research at various institutions including Columbia University, the University of California, Irvine, and, Berkeley. Additionally, he served as a visiting professor at the University of Toronto. Currently, he holds the position of full professor at the School of Electrical Engineering and ICT endowed chair professor at KAIST. Furthermore, he is the director of the Center for Applied Research in Artificial Intelligence, the Image Video System Lab, and the Integrated Vision and Language Lab at KAIST. Prof. Ro has received notable recognition, including the Young Investigator Finalist Award from ISMRM in 1992 and the Scientist of the Year Award (Korea) in 2003. He has contributed to the academic community, has served as an associate editor for IEEE Signal Processing Letters and currently serving IEEE Transactions on Circuits and Systems for Video Technology. He is also the IVMSP committee member in the IEEE Signal Processing Society. Moreover, he has played key roles in organizing numerous international conferences, including serving as the organizing chair/program chair of MMM 2020/PCM 2015 and IWDW 2004. He has also curated several special sessions, such as "Explainable Deep Neural Networks for Image/Video Processing" at ICIP 2021 and 2022, "Digital Photo Album Technology" at AIRS 2005, "Social Media" at DSP 2009, and "Human 3D Perception and 3D Video Assessments" at DSP 2011. Prof. Ro's recent research interests span various AI areas, including deep learning in computer vision and image processing, multimodal learning, integrating vision, speech, and language for AI, explainable and robust AI. His scholarly output includes over 520 peer-reviewed papers published in international journals and conferences.

2. Wen-Huang Cheng (National Taiwan University), e-mail address: wenhuang@csie.ntu.edu.tw

Prof. Wen-Huang Cheng is a University Distinguished Chair Professor in the Department of Computer Science and Information Engineering at National Taiwan University. His current research interests include multimedia, computer vision, and machine learning. He has actively participated in international events and played significant leadership roles in prestigious journals, conferences, and professional organizations. These roles include serving as Editor-in-Chief for IEEE CTSoc News on Consumer Technology, Senior Editor for IEEE Consumer Electronics Magazine (CEM), Associate Editor for IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) and IEEE Transactions on Multimedia (TMM), General Chair for ACM MMAsia (2023), IEEE ICME (2022), and ACM ICMR (2021), Chair for IEEE CASS Multimedia Systems and Applications (MSA) technical committee, and governing board member for IAPR. He has received numerous research and service awards, including the Best Paper Award at the 2021 IEEE ICME and the Outstanding Associate Editor Award of IEEE TMM (2021 and 2020, twice). He is an IEEE Fellow, IET Fellow, and ACM Distinguished Member.

3. Hak Gu Kim (Chung-Ang University, South Korea), e-mail address: hakgukim@cau.ac.kr

Prof. Hak Gu Kim received the Ph.D. degree from the Department of Electrical Engineering at Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea, in 2019. From 2019 to 2021, he was a postdoctoral researcher at École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland. He is currently an assistant professor at the Graduate School of Advanced Imaging Science, Multimedia & Films (GSAIM) in Chung-Ang University, South Korea. He served as the tutorial chair for the 2023 IEEE International Conference on Electronics, Information, and Communication (ICEIC). His research interests include deep learning and machine learning in 2D/3D/VR image and video processing and computer vision, human visual perception, multi-modal learning, and vulnerability of deep neural network for convergence of AI and reality.

Workshop Committees

Zhu Li (University of Missouri, United States), e-mail address: lizhu@umkc.edu

Wesley De Neve (Ghent University, Belgium), e-mail address: Wesley.DeNeve@ghent.ac.kr

Cong-Thang Truong (The University of Aizu, Japan), e-mail address: thang@u-aizu.ac.jp

Minsu Kim (Meta, United Kingdom), e-mail address: minsu@meta.com

Hyung Il Kim (Chonnam National University, South Korea), e-mail address: hikim@jnu.ac.kr

Seong Tae Kim (Kyung Hee University, South Korea), e-mail address: st.kim@khu.ac.kr

Jung Uk Kim (Kyung Hee University, South Korea), e-mail address: ju.kim@khu.ac.kr

Sangmin Lee (Sungkyunkwan University, South Korea), e-mail address: sangmin.lee@skku.edu

Youngjoon Yu (KAIST, South Korea), e-mail address: greatday@kaist.ac.kr

Chen Liu (City University of Hong Kong, Hong Kong), e-mail address: chen.liu@cityu.edu.hk

Chuan-Yu Chang, (National Yunlin University of Science and Technology), e-mail address: chuanyu@yuntech.edu.tw

Hong-Han Shuai (National Yang Ming Chiao Tung University), e-mail address: hhshuai@nycu.edu.tw

Ching-Chun Huang (National Yang Ming Chiao Tung University), e-mail address: chingchun@nycu.edu.tw

Yunghui Li (Hon Hai Research Institute), e-mail address: yunghui.li@foxconn.com

Hongxia Xie (Jilin University), e-mail address: hongxiaxie@jlu.edu.cn

Shintami Chusnul Hidayati (Institut Teknologi Sepuluh Nopember), e-mail address: shintami@its.ac.id

Supported by

This workshop is supported by Center for Applied Research in Artificial Intelligence (CARAI) and CAU Graduate School of Metaverse Convergence.

Previous Workshops

2024

Page updated

Google Sites

Report abuse