Aliproduct-BLIP-cvpr2023

This is the solution for Aliproduct Largs-Scale Competition on CVPR2023 workshop.

We finish the AliProduct competition, upload our solution on GitHub and the report will come soon.

Result

Our solution achieves an average recall of 0.76 on the val dataset without pre-trained models and doesn't require additional dirty data pre-processing and multi-stage training and can achieve a speed of 0.16s per image per gpu. We train the model on 8*A100 40G with the aliproduct dataset, which include 4 million image-context pairs.

How to use?

1.install the environment

conda create -n air python=3.9
conda activate air
pip install -r requirements.txt

2.train the model

You can change some hyperparameters in train_retrieval.py before run.

bash run.sh

3.val and predict

After finish the train steps, you can use the itm_predict.py or itc_predict.py to predict the result. If you want to test the preformance, do this:

bash test.sh

The test.sh will compute the itm_socre or itc_score for top_k image-context pairs. The start and end is for image index to accelerate by multi-gpus. Each 10 image-context pairs need 3.6s on an A100 80G gpu.

Visualization

context: "M & D Simple Modern Light Luxury Comfort Good Quality Living Room with a Double Motor Lounge Chair Sofa TE04"

context: "er tong hua xing che fang ce fan niu niu che 1-3 sui bao bao wan ju che yin le ke zuo ke qi si lun lium che"

context: "feiyangg/LP Paragraph Style Electric Guitar Tiger Veneer Factory Direct Color Can Be Customized"

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
configs		configs
data		data
imgs		imgs
models		models
transform		transform
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval_nocaps.py		eval_nocaps.py
eval_retrieval_video.py		eval_retrieval_video.py
get_result.py		get_result.py
itc_predict.py		itc_predict.py
itm_predict.py		itm_predict.py
merge_result.py		merge_result.py
predict.py		predict.py
pretrain.py		pretrain.py
requirements.txt		requirements.txt
run.sh		run.sh
test.sh		test.sh
train_caption.py		train_caption.py
train_nlvr.py		train_nlvr.py
train_retrieval.py		train_retrieval.py
train_vqa.py		train_vqa.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aliproduct-BLIP-cvpr2023

This is the solution for Aliproduct Largs-Scale Competition on CVPR2023 workshop.

Result

How to use?

1.install the environment

2.train the model

3.val and predict

Visualization

About

Releases

Packages

Languages

License

CuriseJia/aliproduct-BLIP-cvpr2023

Folders and files

Latest commit

History

Repository files navigation

Aliproduct-BLIP-cvpr2023

This is the solution for Aliproduct Largs-Scale Competition on CVPR2023 workshop.

Result

How to use?

1.install the environment

2.train the model

3.val and predict

Visualization

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages