Heeseong Shin

I am curently a Ph.D student at the CVLAB at KAIST AI, advised by Seungryong Kim. My main research interest lies in solving practical tasks, such as image segmentation and 3D reconstruction, in an efficient manner by leveraging foundation models. I am also interested in diffusion models and Vision-Language Models (VLMs).
I had the opportunity to intern at Naver AI Lab, where I worked with Taekyung Kim. I have also collaborated with Google Research and Microsoft Research Asia, and I am continuing to work closely with Anurag Arnab at Google Deepmind.
Outside of research, I am a huge fan of the Scuderia in Formula 1 and I also love to travel. If you are interested in my work, please feel free to reach out to me!
publications
(*) denotes equal contribution- ICCVS4M: Boosting Semi-Supervised Instance Segmentation with SAMIn IEEE/CVF International Conference on Computer Vision, 2025
- ICMLPF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting for Novel View SynthesisIn International Conference on Machine Learning, 2025
- NeurIPSTowards Open-Vocabulary Semantic Segmentation Without Semantic LabelsIn Advances in Neural Information Processing Systems, 2024
- ICMLWLarge Language Models are Frame-Level Directors for Zero-Shot Text-to-Video GenerationIn First Workshop on Controllable Video Generation at ICML, 2024
- CVPR
Highlight CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic SegmentationIn IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024 - CVPR
Highlight Unifying Correspondence Pose and NeRF for Generalized Pose-Free Novel View SynthesisIn IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024