I am a first-year PhD student at the National University of Singapore πΈπ¬, majoring in Biomedical Engineering. Prior to this, I obtained a Bachelor of Engineering in Computer Science from Nanyang Technological University. My current research interests include AI in Healthcare, Surgical Video Analysis, and Large Multimodal Models.
- Conference: NeurIPS 2024 Workshop
- Description: Efficient Segment Anything 2 (SAM2) with frame pruning mechanism for real-time surgical video segmentation
- π Paper
- Conference: CVPR 2024
- Description: Low-level visual instruction tuning for multi-modality LLMs
- π Paper
- Conference: ICLR 2024 (spotlight)
- Description: A benchmark for multi-modality LLMs on low-level vision and visual quality assessment.
- π Paper
- Conference: ACMMM 2023 (oral)
- Description: Introduced a 16-dimensional VQA Dataset and Method for a more explainable VQA.
- π Paper
- Conference: ICCV 2023
- Description: A state-of-the-art NR-VQA method that predicts disentangled aesthetic and technical quality.
- π Paper
- Email: zhangerlicarl@gmail.com or erli.zhang@nus.edu.sg
- Twitter: @zhang_erli