preparing_data

Data preparation to use with our deep learning face recognition system

renamer.py - module to rename names of images in our dataset folder (format: Name_Surname_0001.png, Name_Surname_0002.png, etc.)

With any questions regarding usage, please send me a message to [email protected]

----- ADDITIONAL NOTES -----

Collect dataset => ready ------------------------------ refine, delete advertisement pages
Crop faces from images with multiple people in one folder => ready ------------------------------ crop.py => python crop.py --data-dir /home/ti/path_to_uncropped_data --target-dir /home/ti/path_to_destination_directory
Refine folder names and images => ready ------------------------------ rename_folders_v2.py (e.g. python rename_folders_v2.py --data-dir /home/ti/Downloads/crop_and_find/faces501) (this module renames folder names of each person: it merges all the letters, removing sepcial characters (.,-, etc) and splits the remaining string into two parts in the middle with underline '_' ------------------------------ renamer_lfw.py (rename image names according to LFW dataset format)
Select faces that correspond only to the anchored person => ready/revise ----------------------- recognize_and_store.py --known_dir /path/to/dir/with/anchor/people --unknown-dir /path/to/dir/with/images/divided/into/folders --target-dir /path/to/dir/where/to/save/the/resulting/folders (this module looks at the image in the known folder and compares the images with this images in this person's folder)
Divide into Train (80), Valid (10), Test (10) => ready
Align faces with - align_dataset_mtcnn_v1.py => ready
create .lst file with - insightface_pairs_gen.py --use write_lst function => ready
create .rec, .idx files using - face2rec2.py => ready
create pairs.txt file using gen_pairs_lfw.py => if you don't have loop in generate() function you have to write it with range(10), because you have to create 10-folds cross validation .bin
create .bin file using dataset2bin.py by setting => ready ------------------------------ usage: python dataset2bin.py --data-dir /your/dataset/directory --image-size 112,112 --output /output/directory/to/dave/bin
Yahoooooo, STAAAAARRRRTTTT TRRRRRRAAAAAIIIIINIIIING!!!

EXAMPLE FOLDERS: https://www.dropbox.com/s/i10tma4zqygdxdm/data.zip?dl=0

EXTRA: Parse folder with face_recognition => count number of occurences of each descriptor of the image => If one descriptor appears more than 10 times => write it to another folder

EXTRA1: Before cropping faces from collective images:

take known image
parse folder with images of this person
take one image
run dlib's get_frontal_face_detector on it
compare each face with known image
save the most similar one/ones

875798590 / preparing_data Goto Github PK

preparing_data's Introduction

preparing_data

preparing_data's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent