Skip to content

Latest commit

 

History

History
122 lines (88 loc) · 4.63 KB

README.md

File metadata and controls

122 lines (88 loc) · 4.63 KB

Landmark localization/detection

Train Predict Landmarks by dlib

This is how to train and use dlib shape predictior for landmark detection/localization.
I wanted to train it on my own dataset which is not face and I dont have more than one subject in each image.
Also I was not interested to use any detector before landmark predictior.
So, I tried to modify following codes to reach my aims.

Paper:

This code is used for the following research. If you found it usefull, please cite the following document:

https://www.nature.com/articles/s41598-020-58103-6

@article{eslami2020automatic, title={Automatic vocal tract landmark localization from midsagittal MRI data}, author={Eslami, Mohammad and Neuschaefer-Rube, Christiane and Serrurier, Antoine}, journal={Scientific Reports}, volume={10}, number={1}, pages={1--13}, year={2020}, publisher={Nature Publishing Group} }

Following repositories are also used for the mentioned paper:

https://github.com/mohaEs/Train-Predict-Landmarks-by-SFD

https://github.com/mohaEs/Train-Predict-Landmarks-by-DAN

https://github.com/mohaEs/Train-Predict-Landmarks-by-MCAM

https://github.com/mohaEs/Train-Predict-Landmarks-by-Autoencoder

https://github.com/mohaEs/Train-Predict-Landmarks-by-flat-net

setups:

Take care!
create and exploit new environment for your setup!
because dlib is not compatibale with many new version of other papckages and it will downgrade them seriously.

My python version was 3.5.2
install dlib and cv2 .
my versions were:
dlib 19.4.0
opencv-contrib-python 4.1.0.25
opencv-python 4.1.0.25

data preparation:

My dataset contains images with size 256x256 and landmarks were localized on it.
first I need to create the xml file for training should be created based on the dlib template.
You can modify code which is on MATLAB for generating your own xml file:
it is supposed images are in a folder (path_train_png) and the landmarks are saved in another file (path_train_lm) as csv files with same file names.

DirPngs=dir(fullfile(path_train_png,'*.png'));

fileID = fopen([path_train_png '4Dlib_training_images_with_landmarks.xml'],'w');   
fprintf(fileID, "<?xml version='1.0' encoding='ISO-8859-1'?> \n");
fprintf(fileID, "<?xml-stylesheet type='text/xsl' ?> \n");
fprintf(fileID, "<dataset>\n");
fprintf(fileID, "<name>Training landmarks</name>\n");
fprintf(fileID, "<images>\n");

for SampleCounter=1:length(DirPngs)
  ImageFileName= DirPngs(SampleCounter).name;
  LMs=csvread([path_train_lm ImageFileName(1:end-4) '.csv']);

  fprintf(fileID, "\t <image file='%s'> \n", ImageFileName);
  fprintf(fileID, "\t \t <box top='1' left='1' width='255' height='255'> \n", ImageFileName);
  for lm_counter=1:length(Ind_landmarks)
     fprintf(fileID, "\t \t \t <part name='%d' x='%d' y='%d'/> \n",lm_counter,LMs(lm_counter,1),LMs(lm_counter,2));
  end

  fprintf(fileID, "\t \t </box> \n");
  fprintf(fileID, "\t </image> \n");
%       type training_images_with_landmarks.xml
end
fprintf(fileID, "</images>\n");
fprintf(fileID, "</dataset>\n");
fclose(fileID);

The created xml file would be placed at the path_trainin_png folder.
Note that, the bounding box for all images are fixed around image which is one of my goal.
Also note that, in dlib x and y are assumed such as follow:
Alt text

Trainig predicting:

For training: suppose or set the path of input_folder (folder contains images and xml file) and output_folder (folder for saving trained model)

python Pr_Train.py input_folder output_folder
e.g.
python Pr_Train.py ./NewFaces256 ./Trained_Model

For prediction: suppose or set input_folder(folder contains test images (xml is not required)), trained checkpoint_folder (folder contains trained model), output_folder (folder for saving results) and Num_landmarks integer shows the number of landmarks which you trained (for this face example it is 68),

python Pr_test.py input_folder checkpoint_folder output_folder Num_landmarks
e.g.
python Pr_test.py ./NewFaces256 ./Trained_Model ./Results 68

result image file and corresponding csv file would be created.

Alt text Alt text
and notice that, as expected, because we do not use any detector it can not find faces in larger images: Alt text

Note:

Notice that, on your own dataset, the dlib predictor may not preserve the landmark orderings. in my case:
Alt text