Understanding-and-Visualizing-Deep-Visual-Saliency-Models-cvpr-2019

Introduction

This is the demo of code, model and methods used in my CVPR 2019 paper (link). There are some differences between the model used in the paper and this repository, the model used in the paper is implemented in Tensorflow and the model in this repository is implemented in Pytorch(0.4.1), if you are interested in this work, please run the demo.ipynb in jupyter notebbok to see the model and methods.

The Model Architecture:

The model is trained on Salicon database.
Some saliency prediction examples on OSIE data

Data and annotation

Data annotation

All the data annotation is done by myself using labelme

Data link

synthetic_data
OSIE-SR
The SegmentationClass folder contains the semantic level masks for each salient regions in the image,and the SegmentationObject folder contains the instance level masks for each salient regions in the image.

Indexing in semantic mask

1:person head, 10:person part, 16:animal head, 22:animal part, 27:object, 37:text, 46:symbol, 51:vehicle, 57: food, 63:plant, 68:drink, 73:other

Some data examples:

Representation in different backbones

Here we report the representation in different backbones (resnet-18) after fine-tuning

Comparison of the activation map in vgg-16 and resnet-18 after fine-tuning, top row is the image and activation maps from vgg-16 after fine-tuning, bottom row is the groud truth saliency map and activation maps from resnet-18 after fine-tuning

training saliency prediction model

you need to first set up your own image path and the binary fixation map path in the code.

in pytorch (preferred)

python sal_train_pt.py

in tensorflow (Not sure the exact version, the original code was written in 2018)

download the imagenet pretrained weight here

python sal_train_tf.py

Citation

@inproceedings{he2019understanding,
    title={Understanding and Visualizing Deep Visual Saliency Models},
    author={He, Sen and Tavakoli, Hamed R and Borji, Ali and Mi, Yang and Pugeault, Nicolas},
    booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
    pages={10206--10215},
    year={2019}
}

Contact

senhe752@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
exa_data		exa_data
model		model
53_act_ma.npy		53_act_ma.npy
53_act_mi.npy		53_act_mi.npy
README.md		README.md
archi.png		archi.png
demo.ipynb		demo.ipynb
in1.jpg		in1.jpg
in2.jpg		in2.jpg
in3.jpg		in3.jpg
logger.py		logger.py
res_sta.png		res_sta.png
res_vgg.png		res_vgg.png
sal1_infer.py		sal1_infer.py
sal_map.png		sal_map.png
sal_train_pt.py		sal_train_pt.py
sal_train_tf.py		sal_train_tf.py
se1.jpg		se1.jpg
se2.jpg		se2.jpg
se3.jpg		se3.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Understanding-and-Visualizing-Deep-Visual-Saliency-Models-cvpr-2019

Introduction

The Model Architecture:

Data and annotation

Data annotation

Data link

Indexing in semantic mask

Some data examples:

Representation in different backbones

training saliency prediction model

in pytorch (preferred)

in tensorflow (Not sure the exact version, the original code was written in 2018)

Citation

Contact

About

Releases

Packages

Languages

SenHe/uavdvsm

Folders and files

Latest commit

History

Repository files navigation

Understanding-and-Visualizing-Deep-Visual-Saliency-Models-cvpr-2019

Introduction

The Model Architecture:

Data and annotation

Data annotation

Data link

Indexing in semantic mask

Some data examples:

Representation in different backbones

training saliency prediction model

in pytorch (preferred)

in tensorflow (Not sure the exact version, the original code was written in 2018)

Citation

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages