EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing

Official repository for EarthMarker.

Authors: Wei Zhang*, Miaoxin Cai*, Tong Zhang, Yin Zhuang, and Xuerui Mao

The authors contributed equally to this work.

📣 News

The dataset, model, code, and demo are coming soon! 🚀

[2024.07.19]: The paper for EarthMarker is released arxiv. 🔥🔥

✨ Introduction

The first visual prompting MLLM named EarthMarker is proposed. EarthMarker can interpret RS imagery in the multi-turn conversation at different granularity, including image, region, and point levels, significantly catering to the fine-grained interpretation needs for RS imagery. EarthMarker is capable of various RS visual tasks including scene classification, referring object classification, captioning, and relationship analyses, beneficial to making informed decisions in real-world applications.

🔖 Citation

@article{zhang2024earthmarker,
  title={EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing},
  author={Zhang, Wei and Cai, Miaoxin and Zhang, Tong and Li, Jun and Zhuang, Yin and Mao, Xuerui},
  journal={arXiv preprint arXiv:2407.13596},
  year={2024}
}

📝 Acknowledgment

This paper benefits from llama. Thanks for their wonderful work.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md
VP-example.png		VP-example.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing

📣 News

✨ Introduction

🔖 Citation

📝 Acknowledgment

About

Releases

Packages

wivizhang/EarthMarker

Folders and files

Latest commit

History

Repository files navigation

EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing

📣 News

✨ Introduction

🔖 Citation

📝 Acknowledgment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages