Skip to content

A tool is implemented to merge caffemodels according to a config file

Notifications You must be signed in to change notification settings

joshua19881228/mergeCaffeModel

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

mergeCaffeModel

A tool is implemented to merge caffemodels according to a config file. The source code can be found in this repo

Dependency

Any version of Caffe that has been successfully compiled.

Usage

Compile Source Code

Copy the source code to the corresponding folders in Caffe and compile. An executable file named as mergeCaffeModel should be generated in the "build/tools" folder.

Create Configuration File

A configuration file should be prepared. The file content should be like

[path to dst prototxt] [path to dst caffemodel]
[path to src prototxt] [path to src caffemodel] [auto_flag (0 or 1)] [num_pairs(int)] [src_layer_name_(1):dst_layer_name_(1)] ... [src_layer_name_(num_pairs):dst_layer_name_(num_pairs)]
[path to src prototxt] [path to src caffemodel] [auto_flag (0 or 1)] [num_pairs(int)] [src_layer_name_(1):dst_layer_name_(1)] ... [src_layer_name_(num_pairs):dst_layer_name_(num_pairs)]
...

The first line gives the path to wanted caffemodel's prototxt and the path to save the wanted caffemodel.

From the second line, source information are given. The first two strings gives the path to the source caffemodel's prototxt and the path to source caffemodel. Then a variable (0 or 1) is used to indicate whether to automatically copy weights when the src and dst layer names are same. (1 true, others false). The forth parameter tells how many pairs of src and dst layer would be processed. From then on, pairs are give with formant of "src_layer_name:dst_layer_name"

An example can be found below

dst.prototxt dst.caffemodel
src0.prototxt src0.caffemodel 1 0
src1.prototxt src1.caffemodel 0 1 layer_0:layer_dst_0

It means that all the layers in dst.caffemodel that share the same name with those in src0.caffemodel will be filled by the weights in src0.caffemodel as long as they have the exact same number of parameters. And the layer_dst_0 in dst.caffemodel will be filled by the weigths of layer_0 in src1.caffemodel.

Merge Caffe Model in Commond Line

After the configuration file is created, you can run a commond to merge the caffemodel.

build/tools/mergeCaffeModel path_to_config_file

Use the Module in Your Own Code

An example code is as follows

#include "mergeModel.h"

int main()
{
    string config_file_path = "test_config.txt";
    MergeModelClass test_class(config_file_path);
    test_class.mergeModel();

    return 0;
}

Of course you can write your own main function, which can take inputs from command line.

NOTE only float type is supported.

20170420

A class is defined to merge caffemodels.

Now the project can copy src weights to dst weights automatically if the two layers have same name. Further test will be carried out to confirm weights can be copied from src caffemodel using pairs.

20170423

Copying weights using pairs has been confirmed.

20170505

Update code to deal with newer virsion of Caffe. Add a complete example for merging models.

About

A tool is implemented to merge caffemodels according to a config file

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages