aizootech / facemaskdetection Goto Github PK

View Code? Open in Web Editor NEW

2.1K 57.0 618.0 57.77 MB

开源人脸口罩检测模型和数据 Detect faces and determine whether people are wearing mask.

Home Page: https://demo.aizoo.com/face-mask-detection.html

License: MIT License

Python 35.66% Jupyter Notebook 61.07% CMake 0.12% C++ 3.03% Shell 0.12%

detection pytorch caffe

facemaskdetection's Introduction

FaceMaskDetection

中文版 | English version

We open source all the popular deep learning frameworks' model and inference code to do face mask detection.

** Detect faces and determine whether they are wearing mask. **

** First of all, we hope the people in the world defeat COVID-2019 as soon as possible. Stay strong, all the countries in the world.**

We make face mask detection models with five mainstream deep learning frameworks （PyTorch、TensorFlow、Keras、MXNet和caffe） open sourced, and the corresponding inference codes.
We published 7971 images to train the models. The dataset is composed of WIDER Face and MAFA, we verified some wrong annotations. You can download here from Google drive, if you can not visit Google, you can download it from BaiduDisk, click here to know how to download.

We deployed a web page, you can click the link to experience it. The page is writen by Tensorflow.js.

AIZOO face mask detection

Model structure

We used the structure of SSD. However, in order to make it run quickly in the browser, the backbone network is lite. The total model only has 1.01M parametes.

Input size of the model is 260x260, the backbone network only has 8 conv layers. The total model has only 24 layers with the location and classification layers counted.

SSD anchor configurtion is show bellow:

multibox layers	feature map size	anchor size	aspect ratio）
First	33x33	0.04,0.056	1,0.62,0.42
Second		17x17	0.08,0.11
Third	9x9	0.16,0.22	1,0.62,0.42
Forth	5x5	0.32,0.45	1,0.62,0.42
Fifth	3x3	0.64,0.72	1,0.62,0.42

How to run

opencv

python opencv_dnn_infer.py  --img-path /path/to/your/img

paddle

on image：

python paddle_infer.py  --img-path /path/to/your/img

pytorch

on image：

python pytorch_infer.py  --img-path /path/to/your/img

on video：

python pytorch_infer.py --img-mode 0 --video-path /path/to/video  
# If you want to run with camera video, set  video_path to be 0
python pytorch_infer.py --img-mode 0 --video-path 0

TensorFlow/Keras/MXNet/Caffe

The other four frameworks running method is similar to pytorch, just replace pytorchwith tensorflow, keras,caffe，mxnet, if you want to use tensorflow, just run:

python tensorflow_infer.py  --img-path /path/to/your/img

Attention，for caffe's inference ，we use permute layer，so that we should use caffe-ssd，you can use opencv-dnn to do the inference.

Appendix

Feedback

If you use wechat, you can follow us.

** If you use Wechat, scan the code to add me on wechat **

Model structure

We merge the BN to Conv layers in order to accelerate the inference speed.

Testset PR curve

Our demo page

aizoo.com face mask detection demo

facemaskdetection's People

Contributors

Stargazers

Watchers

Forkers

gllm126 seasky100 sdzzlover highland2019 stormreaver fangbaolei lueyinny zhangming8 wxm2018 gc5218112 naqute bh0001 qaz734913414 annpeter mlbo runauto phoenixtreeember timverion xiamenwcy frankiegu boragocode dlutor romitavia ysnower xubaozhao lizhiyuanbest aliushn dbb4560 sfxrk baifanysu jsmilemsj ronden dearsunshine baiyuang davidding77 ccppasm99 zhouxiunai zerofo kk52099 hell-to-heaven xinlangchen usercgp zhangbo-tj allensmile allenmao liuwenhaha jianweilin tiantian-han xuguosheng110 liuxiaoliu321 hxhh zingway wishgale grabber berlinsilkroad shuangzixing89 developjoy myhome1998 czsgit zzdxlee tchigher ssh-shashi robinluog wenlong0913 yt-python felixzhang7 njustmxn zhangshengvvip guo-pu gsyn77 aicvhub quitino finerc asdlei99 jia0511 yuanqiujie hsp0285 jon-drugstore tomsirliu redblow simple1110 zoe-koala liulouis0512 sunmingyang1987 cwq63 yuhonghong7035 mangoluo etemical tony109060581 xxxmatcha zhangcj13 dzwpusa gonnavis wuxuan186 qsliao charlesnord ethan-zhu-code ufo2011 mengfanmin123 kioco

facemaskdetection's Issues

Tensorflow.js部署问题

One of the images in your dataset 'test_00000306.xml' has a false category

Thanks for your excellent work! I am using the dataset you provide to train a detection network. While turning the annotations into .json format, I found out that the image 'test_00000306.xml' in the val set has a category named 'face_nask'. I'm sure it's a typo, hope you can correct it. :)

特殊情况无法识别

看看这两块钱的口罩行不行？？

How to train the model

Hi, I'm just a novice and I would like to know if you can share the train code (for any model or pytorch) to setup a complete pipeline.

One more question is what are the differences between face_mask_detection and model360 ?

根据提供的模型可以导出onnx模型么？

尝试导出，发现并未成功

i open face_mask_detection.tflite via Netron, why the dimension of this are [1, 5972, 4] and [1,5972, 2] ?

I want to use face_mask_detection.tflite port into android.
and there is a question, why label just 2 type [NoMask, Mask], but the dimension of this file are [1, 5972, 4] and [1,5972, 2] ?

使用opencv的readNetFromCaffe，之后怎么做就不会了

输入大小固定，修改成520*520报错

输入大小被固定
2.泛化性较差

Train model on keras

How you have trained model on keras?
how to train model on new data on keras to detect accurately or upload keras training file.

什么

请问怎么使用这些文件

要用pytorch框架的话直接运行pytorch_infer文件就可以了吗？

请问MAFA数据集修改标记信息是怎样修改的，将人脸框外扩一定比例么

How to train on new dataset

Hi, First of all thanks for doing wonderful job.

I am new to this field but want to create for my society with custom images. Please help in training the model and use that for detection

训练时有基于欧拉角的增广吗?

整体bbox的性能很棒, 大角度戴口罩的侧脸也没有出现box变小的情况.
请问下是否做了基于欧拉角的增广? 还是说有什么特殊方法增强了这种情景的表现?
我训出来出现戴口罩大角度的box检测不准
(或者也可能是我自己的模型太轻了? 只有20w的参数量

How to train with caffe？

can you proviod the train code?

没用深色皮肤的做训练集吧

一些深色皮肤的人可以靠种族天赋轻松欺骗模型

detection network contains face alignment

Hello, may I ask if your detection network contains face alignment? Can I do face recognition based on the detection results of your network model

请问pytorch和tensorflow是什么版本啊

mxnet C++ version

Hi,

Is there mxnet C++ version of the code ?

Best

thx

How to run mxnet python on GPU?

Hi ,

How to run mxnet python on GPU ?

The code running on the cpu

C/C++调用caffe示例

非常感谢作者开源的模型和数据，很良心了。请问一下有C/C++调用caffe版模型的示例代码吗？

Model for edgetpu.

Hello and good job.
Could be possible have the model compiled for the Google Coral edgetpu?

Question of adjusting input size

Hi, I adjust the input size of model to 132*132. The size of feature_map_sizes and anchor_sizes are as follows：
feature_map_sizes = [[17, 17], [9, 9], [5, 5], [3, 3], [1, 1]]
anchor_sizes = [[0.08, 0.11], [0.16, 0.22], [0.32, 0.45], [0.64, 0.72], [1,1]]
anchor_ratios = [[1, 0.62, 0.42]] * 5
In order to improve the speed of the algorithm, have you ever adjusted the input size of model?

请问在keras或者tf2.0上如何将conv和bn进行融合？

请问在keras或者tf2.0上如何将conv和bn进行融合？
是否有代码教程？可以分享一下吗？
非常感谢~

请问tensorflow需要哪些依赖？ ts是1.0版本还是2.0呢

dataset download

please upload the dataset otherthan baidu. As baidu is blocking non-Chinese users

Cant run pytorch_infer.py

Hey after I cloned the repository and ran python -W pytorch_infer.py --img-path (path to my local image ) I get the following error (-W is to ignore warnings about SourceChangeWarning )

` img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
cv2.error: OpenCV(4.2.0) C:\projects\opencv-python\opencv\modules\imgproc\src\color.cpp:182: error: (-215:Assertion failed) !_src.empty() in function 'cv::cvtColor'```

请问keras版本？

请问您的训练代码所使用的是tf.keras（tf2）还是keras？

请问下这个问题怎么解决感谢

OpenCV Error: Assertion failed (scn == 3 || scn == 4) in cv::ipp_cvtColor, file ......\modules\imgproc\src\color.cpp, line 7341
Traceback (most recent call last):
File "tenforflow_infer.py", line 142, in
img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
cv2.error: ......\modules\imgproc\src\color.cpp:7341: error: (-215) scn == 3 || scn == 4 in function cv::ipp_cvtColor

如何把检测放到网站上

作者是怎么在网站上放这个程序的，有没有什么大概思路介绍一下

博主有训练好的.h5模型不

ValueError: operands could not be broadcast together with shapes

在Ubuntu系统下面测试pytorch_infer调用摄像头会遇到下面的报错：
FaceMaskDetection/utils/anchor_decode.py", line 21, in decode_bbox
predict_center_x = raw_outputs_rescale[:, :, 0:1] * anchors_w + anchor_centers_x
ValueError: operands could not be broadcast together with shapes (1,5456,1) (1,11000,1)

训练的人

我想训练。我如何获取数据集。感谢分享。

急急急，我使用flutter插件“ tflite”加载模型需要两个参数，一个为模型文件，另一个为标签文本文件，标签文本文件该怎么写呢？

tflite https://github.com/shaqian/flutter_tflite

return Tflite.loadModel(
      model: "assets/model_unquant.tflite",
      labels: "assets/labels.txt",
    );

也就是labels这个key对应的value为一个标签文本，那么这个文本该如何编写呢？

预训练权重

能否提供预训练权重？

Precision and Recall

There is a PR curve provided in the readme. Could you also please provide your best precision and recall? Thx

给infer.py增加保存截图、保存结果视频功能出错

windows pycharm连得anaconda的环境
改过的infer.py

-- coding:utf-8 --

import cv2
import time
import os
import argparse

import numpy as np
from PIL import Image
from keras.models import model_from_json
from anchor_generator import generate_anchors
from anchor_decode import decode_bbox
from nms import single_class_non_max_suppression
from playsound import playsound

model = model_from_json(open('models/face_mask_detection.json').read())
model.load_weights('models/face_mask_detection.hdf5')

anchor configuration

feature_map_sizes = [[33, 33], [17, 17], [9, 9], [5, 5], [3, 3]]
anchor_sizes = [[0.04, 0.056], [0.08, 0.11], [0.16, 0.22], [0.32, 0.45], [0.64, 0.72]]
anchor_ratios = [[1, 0.62, 0.42]] * 5

generate anchors

anchors = generate_anchors(feature_map_sizes, anchor_sizes, anchor_ratios)

for inference , the batch size is 1, the model output shape is [1, N, 4],

so we expand dim for anchors to [1, anchor_num, 4]

anchors_exp = np.expand_dims(anchors, axis=0)

id2class = {0: 'MASK', 1: 'NOMASK'}

def inference(image,
conf_thresh=0.5,
iou_thresh=0.4,
target_shape=(160, 160),
draw_result=True,
show_result=True
):
'''
Main function of detection inference
:param image: 3D numpy array of image
:param conf_thresh: the min threshold of classification probabity.
:param iou_thresh: the IOU threshold of NMS
:param target_shape: the model input size.
:param draw_result: whether to daw bounding box to the image.
:param show_result: whether to display the image.
:return:
'''
# image = np.copy(image)
output_info = []
height, width, _ = image.shape
image_resized = cv2.resize(image, target_shape)
image_np = image_resized / 255.0 # 归一化到0~1
image_exp = np.expand_dims(image_np, axis=0)

result = model.predict(image_exp)

y_bboxes_output = result[0]
y_cls_output = result[1]

# remove the batch dimension, for batch is always 1 for inference.
y_bboxes = decode_bbox(anchors_exp, y_bboxes_output)[0]
y_cls = y_cls_output[0]
# To speed up, do single class NMS, not multiple classes NMS.
bbox_max_scores = np.max(y_cls, axis=1)
bbox_max_score_classes = np.argmax(y_cls, axis=1)

# keep_idx is the alive bounding box after nms.
keep_idxs = single_class_non_max_suppression(y_bboxes,
                                                bbox_max_scores,
                                                conf_thresh=conf_thresh,
                                                iou_thresh=iou_thresh,
                                                )

for idx in keep_idxs:
    conf = float(bbox_max_scores[idx])
    class_id = bbox_max_score_classes[idx]
    bbox = y_bboxes[idx]
    # clip the coordinate, avoid the value exceed the image boundary.
    xmin = max(0, int(bbox[0] * width))
    ymin = max(0, int(bbox[1] * height))
    xmax = min(int(bbox[2] * width), width)
    ymax = min(int(bbox[3] * height), height)

    if draw_result:
        if class_id == 0:
            color = (0, 255, 0)
        else:
            color = (255, 0 , 0)

        cv2.rectangle(image, (xmin, ymin), (xmax, ymax), color, 2)
        cv2.putText(image, "%s: %.2f" % (id2class[class_id], conf), (xmin + 2, ymin-2),
                    cv2.FONT_HERSHEY_SIMPLEX, 0.8, color)
    output_info.append([class_id, conf, xmin, ymin, xmax, ymax])

if show_result:
    Image.fromarray(image).show()
return output_info

def run_on_video(video_path, output_video_name, conf_thresh):
cap = cv2.VideoCapture(video_path)
height = cap.get(cv2.CAP_PROP_FRAME_HEIGHT)
width = cap.get(cv2.CAP_PROP_FRAME_WIDTH)
fps = cap.get(cv2.CAP_PROP_FPS)
fourcc = cv2.VideoWriter_fourcc(*'XVID')
writer = cv2.VideoWriter(output_video_name, fourcc, int(fps), (int(width), int(height)))
total_frames = cap.get(cv2.CAP_PROP_FRAME_COUNT)
cv2.imwrite("/saimg" + '.img', cap.read())
if not cap.isOpened():
raise ValueError("Video open failed.")
return
status = True
idx = 0
while status:
start_stamp = time.time()
status, img_raw = cap.read()
cv2.imwrite("/saimg"+'.png', img_raw)
img_raw = cv2.cvtColor(img_raw, cv2.COLOR_BGR2RGB)
read_frame_stamp = time.time()
if (status):
inference(img_raw,
conf_thresh,
iou_thresh=0.5,
target_shape=(260, 260),
draw_result=True,
show_result=False)
cv2.imshow('image', img_raw[:,:,::-1])
cv2.waitKey(1)
inference_stamp = time.time()
writer.write(img_raw)
write_frame_stamp = time.time()
idx += 1
print("%d of %d" % (idx, total_frames))
print("read_frame:%f, infer time:%f, write time:%f" % (read_frame_stamp - start_stamp,
inference_stamp - read_frame_stamp,
write_frame_stamp - inference_stamp))
writer.release()

if name == "main":
parser = argparse.ArgumentParser(description="Face Mask Detection")
parser.add_argument('--img-mode', type=int, default=1, help='set 1 to run on image, 0 to run on video.')
parser.add_argument('--img-path', type=str, help='path to your image.')
parser.add_argument('--video-path', type=str, default='0', help='path to your video, 0 means to use camera.')
# parser.add_argument('--hdf5', type=str, help='keras hdf5 file')
args = parser.parse_args()
if args.img_mode:
imgPath = args.img_path
img = cv2.imread(imgPath)
img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
inference(img, show_result=True, target_shape=(260, 260))
else:
video_path = args.video_path
if args.video_path == '0':
video_path = 0
run_on_video(video_path, '', conf_thresh=0.5)
错误日志：
(base) C:\Users\xiaomi\PycharmProjects\untitled8\FaceMaskDetection-master>python infer.py --img-mode 0 --video-path 0
Using TensorFlow backend.
WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:517: The name tf.p
laceholder is deprecated. Please use tf.compat.v1.placeholder instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:4185: The name tf.
truncated_normal is deprecated. Please use tf.random.truncated_normal instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:245: The name tf.g
et_default_graph is deprecated. Please use tf.compat.v1.get_default_graph instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:174: The name tf.g
et_default_session is deprecated. Please use tf.compat.v1.get_default_session instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:181: The name tf.C
onfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:186: The name tf.S
ession is deprecated. Please use tf.compat.v1.Session instead.

2020-03-03 16:44:54.419321: I tensorflow/core/platform/cpu_feature_guard.cc:145] This TensorFlow binary is optimized with
Intel(R) MKL-DNN to use the following CPU instructions in performance critical operations: AVX AVX2
To enable them in non-MKL-DNN operations, rebuild TensorFlow with the appropriate compiler flags.
2020-03-03 16:44:54.434687: I tensorflow/core/common_runtime/process_util.cc:115] Creating new thread pool with default in
ter op setting: 8. Tune using inter_op_parallelism_threads for best performance.
WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:190: The name tf.g
lobal_variables is deprecated. Please use tf.compat.v1.global_variables instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:199: The name tf.i
s_variable_initialized is deprecated. Please use tf.compat.v1.is_variable_initialized instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:206: The name tf.v
ariables_initializer is deprecated. Please use tf.compat.v1.variables_initializer instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:1834: The name tf.
nn.fused_batch_norm is deprecated. Please use tf.compat.v1.nn.fused_batch_norm instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:133: The name tf.p
laceholder_with_default is deprecated. Please use tf.compat.v1.placeholder_with_default instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:3976: The name tf.
nn.max_pool is deprecated. Please use tf.nn.max_pool2d instead.

WARNING:tensorflow:From C:\Users\xiaomi\Anaconda3\lib\site-packages\keras\backend\tensorflow_backend.py:4138: The name tf.
random_uniform is deprecated. Please use tf.random.uniform instead.

[ERROR:0] global C:\projects\opencv-python\opencv\modules\videoio\src\cap.cpp (415) cv::VideoWriter::open VIDEOIO(CV_IMAGE
S): raised OpenCV exception:

OpenCV(4.2.0) C:\projects\opencv-python\opencv\modules\videoio\src\cap_images.cpp:207: error: (-215:Assertion failed) !fil
ename.empty() in function 'cv::icvExtractPattern'

Traceback (most recent call last):
File "infer.py", line 159, in
run_on_video(video_path, '', conf_thresh=0.5)
File "infer.py", line 111, in run_on_video
cv2.imwrite("/saimg" + '.img', cap.read())
TypeError: Expected Ptrcv::UMat for argument 'img'
[ WARN:1] global C:\projects\opencv-python\opencv\modules\videoio\src\cap_msmf.cpp (674) SourceReaderCB::~SourceReaderCB t
erminating async callback

opencv版本：3.4.1

Changing map_location to CPU

Sorry if the question will result "out of topic".
I'm having the following error after having changed the model:
model = load_pytorch_model('models/new_model.pth');

Error:
RuntimeError: Attempting to deserialize object on a CUDA device but torch.cuda.is_available() is False. If you are running on a CPU-only machine, please use torch.load with map_location=torch.device('cpu') to map your storages to the CPU.

I'm using the pytorch_infer.py file.
How can I specify the map_location=torch.device('cpu') mapping?

I've tried with:
model = load_pytorch_model('models/new_model.pth', map_location=torch.device('cpu') );

but it doesn't work.
Sorry but I'm dealing with python only since a couple of days.

Best regards,
TK

Android App

Hi, how can we port it to android?

pre-trained model with tensorflow and GPU

I used your code in desktop with CPU and all it is ok.
Now I would test on Jetson Nano with GPU but I have problems.
I am using your pre-trained model for tensorflow.
Can you help me with the problem?
I have to setting something or the problem is the Jetson Nano?

Thanks

如何用GPU加速检测

想用GPU加速检测，现在的速度是0.15s一帧

tensorflow.js open source

Hello,

First off, great implementation. Thank you for sharing this.

I am wondering if you are planning to provide the tensorflow.js implementation. I converted the model from keras to tensorflow.js, but It seems that the post-processing part should be done in order to print the bboxes and detections of the masks.

Any idea or tip of how should be done?

Thank you in advance

<annotation>
	<folder>/home/daniel/Data/Dataset/FaceMaskNew/train_mafa</folder>
	<filename>test_00001347.xml</filename>
	<object>
		<name>face_mask</name>
		<difficult>0</difficult>
		<bndbox>
			<xmin>131</xmin>
			<ymin>71</ymin>
			<xmax>244</xmax>
			<ymax>219</ymax>
		</bndbox>
	</object>
</annotation>

这是test_00001347.jpg的xml文件

aizootech / facemaskdetection Goto Github PK

facemaskdetection's Introduction

FaceMaskDetection

We open source all the popular deep learning frameworks' model and inference code to do face mask detection.

Model structure

How to run

opencv

paddle

pytorch

TensorFlow/Keras/MXNet/Caffe

Appendix

Feedback

Model structure

Testset PR curve

Our demo page

facemaskdetection's People

Contributors

Stargazers

Watchers

Forkers

facemaskdetection's Issues

-- coding:utf-8 --

anchor configuration

generate anchors

for inference , the batch size is 1, the model output shape is [1, N, 4],

so we expand dim for anchors to [1, anchor_num, 4]

Recommend Projects

Recommend Topics

Recommend Org