mks0601 / tf-simplehumanpose Goto Github PK

TensorFlow implementation of "Simple Baselines for Human Pose Estimation and Tracking", ECCV 2018

Python 91.40% Makefile 0.07% C++ 0.09% Cuda 3.20% Cython 5.25%

human-pose-estimation tensorflow eccv-2018 deep-learning computer-vision mscoco-keypoint posetrack mpii-dataset

tf-simplehumanpose's Introduction

Simple Baselines for Human Pose Estimation and Tracking

Introduction

This repo is TensorFlow implementation of Simple Baselines for Human Pose Estimation and Tracking (ECCV 2018) of MSRA for 2D multi-person pose estimation from a single RGB image.

What this repo provides:

TensorFlow implementation of Simple Baselines for Human Pose Estimation and Tracking.
Flexible and simple code.
Compatibility for most of the publicly available 2D multi-person pose estimation datasets including MPII, PoseTrack 2018, and MS COCO 2017.
Human pose estimation visualization code (modified from Detectron).

Dependencies

This code is tested under Ubuntu 16.04, CUDA 9.0, cuDNN 7.1 environment with two NVIDIA 1080Ti GPUs.

Python 3.6.5 version with Anaconda 3 is used for development.

Running TF-SimpleHumanPose

Start

Run pip install -r requirement.txt to install required modules.
Run cd ${POSE_ROOT}/lib and make to build NMS modules.
In the main/config.py, you can change settings of the model including dataset to use, network backbone, and input size and so on.

Train

In the main folder, run

python train.py --gpu 0-1

to train the network on the GPU 0,1.

If you want to continue experiment, run

python train.py --gpu 0-1 --continue

--gpu 0,1 can be used instead of --gpu 0-1.

Test

Place trained model at the output/model_dump/$DATASET/ and human detection result (human_detection.json) to data/$DATASET/dets/.

In the main folder, run

python test.py --gpu 0-1 --test_epoch 140

to test the network on the GPU 0,1 with 140th epoch trained model. --gpu 0,1 can be used instead of --gpu 0-1.

Results

Here I report the performance of the model from this repo and the original paper. Also, I provide pre-trained models and human detection results.

As this repo outputs compatible output files for MS COCO and PoseTrack, you can directly use cocoapi or poseval to evaluate result on the MS COCO or PoseTrack dataset. You have to convert the produced mat file to MPII mat format to evaluate on MPII dataset following this.

Results on MSCOCO 2017 dataset

For all methods, the same human detection results are used (download link is provided at below). For comparison, I used pre-trained model from original repo to report the performance of the original repo. The table below is APs on COCO val2017 set.

Methods	AP	AP .5	AP .75	AP (M)	AP (L)	AR	AR .5	AR .75	AR (M)	AR (L)	Download
256x192_resnet50 (this repo)	70.4	88.6	77.8	67.0	76.9	76.2	93.0	83.0	71.9	82.4	model pose
256x192_resnet50 (original repo)	70.3	88.8	77.8	67.0	76.7	76.1	93.0	82.9	71.8	82.3	-

Human detection result on val2017 (55.3 AP on human class) and test-dev2017 (57.2 AP on human class) [bbox]
Other human detection results on val2017 [Detectron_MODEL_ZOO]

Results on PoseTrack 2018 dataset

The pre-trained model on COCO dataset is used for training on the PoseTrack dataset following paper. After training model on the COCO dataset, I set lr, lr_dec_epoch, end_epoch in config.py to 5e-5, [150, 155], 160, respectively. Then, run python train.py --gpu $GPUS --continue. The table below is APs on validation set.

Methods	Head	Shoulder	Elbow	Wrist	Hip	Knee	Ankle	Total	Download
256x192_resnet50 (bbox from detector)	74.4	76.9	72.2	65.2	69.2	70.0	62.9	70.4	model pose
256x192_resnet50 (bbox from GT)	87.9	86.7	80.2	72.5	77.0	77.8	74.6	80.1	model pose

Human detection result on validation set [bbox]

Troubleshooting

Add graph.finalize when your machine takes more memory as training goes on. [issue]
For those who suffer from FileNotFoundError: [Errno 2] No such file or directory: 'tmp_result_0.pkl' in testing stage, please prepare human detection result properly. The pkl files are generated and deleted automatically in testing stage, so you don't have to prepare them. Most of this error comes from inproper human detection file.

Acknowledgements

This repo is largely modified from TensorFlow repo of CPN and PyTorch repo of Simple.

Reference

[1] Xiao, Bin, Haiping Wu, and Yichen Wei. "Simple Baselines for Human Pose Estimation and Tracking". ECCV 2018.

tf-simplehumanpose's People

Contributors

Stargazers

Watchers

Forkers

davidxue1989 iscas-lee fendaq ashchauh jackytu256 shubhampachori12110095 kekedan wxyz-lang jcjs paolomanchisi mathpopo phonicavi ablattmann daydreamer2023 xiejingcai newsky gm19900510 vvhj alirashidnejad youngwk tucan9389 yunhengzi pbdahzou gsyn77 zlb2016 zjz19960805 kwonhyeokmin peterzhousz xuyanche yangchuancv supersaiyan30 ramstein mehtadushy bearcatt compliceu xincxiong aniketzz lxtgh cisco08 llforest yuewenjun hj0116 cjue zhangming8 luyujia mohatarem figure-it-out hongranfu shiyuan0806 gillmac13 hakosch mechanike louisnust zhoukaii tecnimaqgabriela zechendev luluqie wanggossip katsura-jp sebastianlutter hujiajia0401 j543707866 eugeniorj youngjunseo leopigletyong violet2020 wangaq126 sporterman fudigeng sunya0 zhangchenwei115 ttxskk nabin-subedi alalemp akshaymalavade spirithere thanisornsr hendriktpl whoyayawho yueyedeai aditdn apeizou atlains ericyoong zcc720 annbless kk0in linusericsson cooparation

tf-simplehumanpose's Issues

Multi GPU

안녕하세요 질문이 2가지 있습니다.
제가 COCO val2017 데이터를 올려주신 pre-trained model 을 restore 하여 테스트 해보려고하는데
test.py 에 MultiGPUFunc.work() 에서 에러가 납니다, 혹시 multi-gpu 환경에서만(gpu 2개) snapshot_140.ckpt 가 restore 되나요?
그리고 추가적으로 test.py 했을때, load_pkl 에서 tmp_result_0.pkl 파일 없다고 오류가 나는데,
위와 관련이 있는건지, 아니면 혹시 추가하지 않은 파일이 있는건지 궁금합니다.

감사합니다

PoseTrack18 results?

Hi, you write very good code and recently I am working on PoseTrack (including 3 tasks) too.
I use the msra's pytorch code (simple baseline) finetune the PoseTrack 2018 dataset and all parameters I use according to the paper.
Using gt_box on PoseTrack2018 dataset for Human Pose Estimation, I get the results which is worse than you reported:
my eval results:

Loading data
('# gt frames :', 3902)
('# pred frames:', 3902)
Evaluation of per-frame multi-person pose estimation
('saving results to', './out/total_AP_metrics.json')
Average Precision (AP) metric:
& Head & Shou & Elb & Wri & Hip & Knee & Ankl & Total\
& 49.4 & 89.2 & 83.9 & 76.1 & 81.0 & 80.3 & 74.6 & 74.6 \

Can you teach me how to finetune the model and I want to get the same results as you!
Thank you!
BTW, do you plan to implement the tracking code in this paper(simple baseline for human pose estimation and tracking)? I am doing this section ...

tf.stop_gradient

Hi,

Could someone explain why do we need tf.stop_gradient applied to gt_heatmap. I mean this line from model.py:
gt_heatmap = tf.stop_gradient(self.render_gaussian_heatmap(target_coord, cfg.output_shape, cfg.sigma))

I don't see any dependency on tf.Variables in render_gaussian_heatmap() graph?

mpii to coco format

get the 12g mpii dataset,run the mpii2coco.py,
error

FileNotFoundError: [Errno 2] No such file or directory: '../images/015601864.jpg'

and I check the imags,it has no 015601864.jpg.
how to solve it

__cudaRegisterFatBinaryEnd

I have not changed anything in the code and using the mentioned configurations described in the project. Finished the training. I am getting an error. Does anyone have any idea? The traceback is:
Traceback (most recent call last):
File "test.py", line 24, in
from nms.nms import oks_nms
File "/datadrive/common/Ishrak/Integral_Pose_estimation/TF-SimpleHumanPose/main/../lib/nms/nms.py", line 14, in
from .gpu_nms import gpu_nms
ImportError: /datadrive/common/Ishrak/Integral_Pose_estimation/TF-SimpleHumanPose/main/../lib/nms/gpu_nms.cpython-36m-x86_64-linux-gnu.so: undefined symbol: __cudaRegisterFatBinaryEnd

Best
Ishrak
Technos Data Science Engineering Inc, Tokyo

how long it takes with one gpu

resnet50？？？

blocks = [
resnet_utils.Block('block1', bottleneck,
[(256, 64, 1)] * 2 + [(256, 64, 1)]),
resnet_utils.Block('block2', bottleneck,
[(512, 128, 2)] + [(512, 128, 1)] * 3),
resnet_utils.Block('block3', bottleneck,
[(1024, 256, 2)] + [(1024, 256, 1)] * 5),
resnet_utils.Block('block4', bottleneck,
[(2048, 512, 2)] + [(2048, 512, 1)] * 2)
]

blocks means resnet50？？？ but why compute net1234？？？
resnet_features = [net, net2, net3, net4]
return resnet_features

it means？？？

self.lr_dec_epoch.index(e)

in config ,the lr code

if epoch < self.lr_dec_epoch[-1]:
i = self.lr_dec_epoch.index(e)
what is lr_dec_epoch.index(e). the lr_dec_epoch is [90,120] but the index(e)?????????

what is the index ???????

Pytorch weights in tensorflow model

Has anyone tried using the weights in the original pytorch repo with this code?

FileNotFoundError: [Errno 2] No such file or directory: 'tmp_result_0.pkl'

When the model on MPII has been trained by one GPU, I want to use test.py to evaluation the result. The error about some pkl files occurs while I didn't find any such files.

TypeError: Can't instantiate abstract class Tester with abstract methods _make_data

hello, i want to use your code to inference single image, and i followed the code in test.py, but, when i use tester = Tester(Model(), cfg), i got this error:
TypeError: Can't instantiate abstract class Tester with abstract methods _make_data
do u know why and how can i repair this bug? thanks

The result is strange

python dependencies should be updated

There are some unused dependencies should be removed from requirement.txt

easydict, unused and will downgrade python to 2.7 if you use conda install
opencv-python, I think opencv is enough

some dependencies should be added:

pillow
matplotlib
opencv

Here is my conda list, it works but may contain unused dependencies. you can copy it to a file and use conda create -n TF-SimpleHumanPose -f your_file_name to create a identical conda environment. hope it will be helpful:

name: vac_py3_n
channels:
  - conda-forge
  - anaconda
  - defaults
dependencies:
  - _tflow_180_select=1.0=gpu
  - absl-py=0.6.1=py36_0
  - arrow-cpp=0.11.1=py36h5c3f529_0
  - astor=0.7.1=py36_0
  - bleach=1.5.0=py36_0
  - c-ares=1.15.0=h7b6447c_1
  - cudatoolkit=9.0=h13b8566_0
  - cudnn=7.1.2=cuda9.0_0
  - cupti=9.0.176=0
  - gast=0.2.0=py36_0
  - gflags=2.2.2=he6710b0_0
  - glog=0.3.5=hf484d3e_1
  - html5lib=0.9999999=py36_0
  - intel-openmp=2019.1=144
  - keras-applications=1.0.6=py36_0
  - keras-base=2.2.4=py36_0
  - keras-gpu=2.2.4=0
  - keras-preprocessing=1.0.5=py36_0
  - libboost=1.67.0=h46d08c1_4
  - libedit=3.1.20170329=h6b74fdf_2
  - libffi=3.2.1=hd88cf55_4
  - libgcc-ng=8.2.0=hdf63c60_1
  - libgfortran-ng=7.3.0=hdf63c60_0
  - libopenblas=0.3.3=h5a2b251_3
  - libprotobuf=3.6.1=hd408876_0
  - libsodium=1.0.16=h1bed415_0
  - libstdcxx-ng=8.2.0=hdf63c60_1
  - lz4-c=1.8.1.2=h14c3975_0
  - markdown=3.0.1=py36_0
  - mkl=2019.1=144
  - ncurses=6.1=he6710b0_1
  - numpy-base=1.15.4=py36h2f8d375_0
  - olefile=0.46=py36_0
  - pandas=0.23.4=py36h04863e7_0
  - pillow=5.3.0=py36h34e0f95_0
  - pip=18.1=py36_0
  - protobuf=3.6.1=py36he6710b0_0
  - pyarrow=0.11.1=py36he6710b0_0
  - python-dateutil=2.7.5=py36_0
  - pytz=2018.7=py36_0
  - pyyaml=3.13=py36h14c3975_0
  - pyzmq=17.1.2=py36h14c3975_0
  - readline=7.0=h7b6447c_5
  - setuptools=40.6.2=py36_0
  - six=1.11.0=py36_1
  - snappy=1.1.7=hbae5bb6_3
  - tabulate=0.8.2=py36_0
  - tensorboard=1.8.0=py36hf484d3e_0
  - tensorflow=1.8.0=hb11d968_0
  - tensorflow-base=1.8.0=py36hc1a7637_0
  - tensorflow-gpu=1.8.0=h7b35bdc_0
  - termcolor=1.1.0=py36_1
  - tk=8.6.8=hbc83047_0
  - tqdm=4.28.1=py36h28b3542_0
  - werkzeug=0.14.1=py36_0
  - wheel=0.32.3=py36_0
  - xz=5.2.4=h14c3975_4
  - yaml=0.1.7=had09818_2
  - zeromq=4.2.5=hf484d3e_1
  - zlib=1.2.11=h7b6447c_3
  - zstd=1.3.3=h84994c4_0
  - atk=2.25.90=hf2eb9ee_1001
  - blas=1.1=openblas
  - boost-cpp=1.68.0=h11c811c_1000
  - bzip2=1.0.6=h14c3975_1002
  - ca-certificates=2018.11.29=ha4d7672_0
  - cairo=1.14.12=h80bd089_1005
  - certifi=2018.11.29=py36_1000
  - cycler=0.10.0=py_1
  - dbus=1.13.0=h4e0c4b3_1000
  - expat=2.2.5=hf484d3e_1002
  - ffmpeg=4.1=h6dce934_1000
  - fontconfig=2.13.1=h2176d3f_1000
  - freetype=2.9.1=h3cfcefd_1004
  - gdk-pixbuf=2.36.12=h4f1c04b_1001
  - gettext=0.19.8.1=h9745a5d_1001
  - giflib=5.1.4=h14c3975_1001
  - glib=2.56.2=had28632_1001
  - gmp=6.1.2=hf484d3e_1000
  - gnutls=3.6.5=hd3a4fd2_1001
  - gobject-introspection=1.56.1=py36h9e29830_1001
  - graphite2=1.3.13=hf484d3e_1000
  - grpcio=1.16.0=py36h4f00d22_1000
  - gst-plugins-base=1.12.5=h3865690_1000
  - gstreamer=1.12.5=h0cc0488_1000
  - gtk2=2.24.31=h5baeb44_1000
  - h5py=2.9.0=py36h31fdc65_1000
  - harfbuzz=1.9.0=he243708_1001
  - hdf5=1.10.4=nompi_h11e915b_1105
  - icu=58.2=hf484d3e_1000
  - jasper=1.900.1=h07fcdf6_1005
  - jpeg=9c=h14c3975_1001
  - kiwisolver=1.0.1=py36h6bb024c_1002
  - libevent=2.0.22=hb7f436b_1002
  - libiconv=1.15=h14c3975_1004
  - libpng=1.6.36=h84994c4_1000
  - libtiff=4.0.10=h648cc4a_1001
  - libuuid=2.32.1=h14c3975_1000
  - libwebp=1.0.1=h576950b_1000
  - libxcb=1.13=h14c3975_1002
  - libxml2=2.9.8=h143f9aa_1005
  - matplotlib=3.0.2=py36h8a2030e_1001
  - matplotlib-base=3.0.2=py36h167e16e_1001
  - mkl_fft=1.0.10=py36_0
  - mkl_random=1.0.2=py36_0
  - msgpack-python=0.6.0=py36h6bb024c_1000
  - nettle=3.4.1=h14c3975_1002
  - numpy=1.16.0=py36_blas_openblash1522bff_1000
  - openblas=0.3.3=h9ac9557_1001
  - opencv=3.4.4=py36_blas_openblash85ad109_1203
  - openh264=1.8.0=hdbcaa40_1000
  - openssl=1.0.2p=h14c3975_1002
  - pango=1.40.14=hf0c64fd_1003
  - parsedatetime=2.4=py_1
  - pcre=8.41=hf484d3e_1003
  - pixman=0.34.0=h14c3975_1003
  - pthread-stubs=0.4=h14c3975_1001
  - pyparsing=2.3.1=py_0
  - pyqt=5.6.0=py36h13b7fb3_1008
  - python=3.6.6=hd21baee_1003
  - qt=5.6.2=hf516382_1011
  - scikit-learn=0.20.2=py36_blas_openblashebff5e3_1400
  - scipy=1.2.0=py36_blas_openblash1522bff_1200
  - setproctitle=1.1.10=py36h14c3975_1001
  - sip=4.18.1=py36hf484d3e_1000
  - thrift-cpp=0.11.0=h23e226f_1003
  - tornado=5.1.1=py36h14c3975_1000
  - x264=1!152.20180717=h14c3975_1001
  - xorg-kbproto=1.0.7=h14c3975_1002
  - xorg-libice=1.0.9=h14c3975_1004
  - xorg-libsm=1.2.3=h4937e3b_1000
  - xorg-libx11=1.6.6=h14c3975_1000
  - xorg-libxau=1.0.8=h14c3975_1006
  - xorg-libxdmcp=1.1.2=h14c3975_1007
  - xorg-libxext=1.3.3=h14c3975_1004
  - xorg-libxrender=0.9.10=h14c3975_1002
  - xorg-libxt=1.1.5=h14c3975_1002
  - xorg-renderproto=0.11.1=h14c3975_1002
  - xorg-xextproto=7.3.0=h14c3975_1002
  - xorg-xproto=7.0.31=h14c3975_1007
  - cython=0.29.2=py36he6710b0_0
  - sqlite=3.26.0=h7b6447c_0
  - pip:
    - keras==2.2.4
    - msgpack==0.6.0
    - msgpack-numpy==0.4.4.2
prefix: /data1/anaconda3/envs/vac_py3_n

strange results

hello, i have trained my datasets by this code, but the result is very strange, it seems can't learning anything. i had check my datasets seriously, it's right, could you give me some advices for solve this. the result is shown when i trained about 40 epochs. Any suggestions from you will give me a lot of help, thank you very much.

When should I stop training?

If there are some other indicators to tell me when to stop except epoch loss?
The epoch loss changes heavily and it's hard to determine if the model converged or not

OOM while training

Hi mks,

First of all, thanks for making this such an amazing implementation of SHP; however, I encounter an OOM issue while training the model that The more time I utilize to train the model the larger memory is occupied. The way I fixed is to simply add a "graph.finalize" before iter.

Please just let me know If you need more info.
Thanks

Pre-Trained Model for resnet_152 ?

Do you have the pre-trained model for Resnet_152 ?

test.py outputs -1 for Average Precision and Average Recall metrics

I ran the test.py prorgramme on the COCO test2017 dataset with the human_detections_text-dev2017.json renamed to human_detections.json. However, the output results of all the Average Precision and Average Recall metrics are -1. Here is the snapshot:

I am unsure whether the problem is due to the human_detections_text-dev2017.json is not the right file to use.

Could you please help? Thanks in advance.

Transfer Learning on Custom Data

Hey,
Thanks for the pre-trained models.
I want to do transfer learning on top of that on my Custom Dataset.
I have the Custom training images tagged in COCO format and saved in json Format.
Can you please guide on how to go ahead from here ?

Getting the same results as announced on the github

Hello, thanks for your work.
I am currently struggling to get the same results as the ones announced on the github page.
Here are the results I get using your code and your pretrained model :

With the useGTbbox flag on, I get :

Is there something I misunderstood about he way to evaluate the model ?
Should I pass the result.pkl into coco api to get some new scores ?

Thank you very much for your time.

Sorry for this naive question but it is critical: What is the name of the output tensor of COCO Model?

Could you share the training details on Posetrack 18?

I am sorry that I ask this question because I barely know tf. Could you pls describe what you have done and what parameters you use during training? Did you follow the training technique used on posetrack 17 in the original paper?

human detection json

how to get the human detection json???the mpii dataset has it????

it means get the pepole bbox from the imgs???

How to start on my own image or video

Thanks for your great work.
I would like to know how to start on my own image or video and how about the speed.

CPU only support

안녕하세요. 한국인인듯하여 한국어로 질문을 드립니다.

이 프로젝트를 macOS에서 돌려보기 위해서 시도해보고 있는데 어려움을 겪고 있습니다. 그래서 몇가지 질문을 드립니다.

cpu only 환경은 고려하지 않은 프로젝트인가요?
cpu only가 고려되지 않은 프로젝트라면 cpu only 환경에서 실행가능하게 기능을 추가하여 PR을 날려도 될까요?

감사합니다 :-)

keypoint map back to original images problem

reimpletment of the paper?

Simple Baselines for Human Pose Estimation and Tracking

License

Hello!

What is the license of usage?

Some confusion about the repo

Thank you for your work. You are very cool. I have watched your two projects. There are some questions about this repo. Does this repo only implement the 2d human pose estimation with bboxs already in place. The Simple-Baseline paper has a lot of stuff about detector bbox and propagate bbox processing, optical flow tracking and so on. The original repo and yours don't seems to cover that . So the algorithm of tracking processing implementation is not open source?

Inference with Pre-trained Models

Hello, and thank you for this amazing work.
I have an Image and I used a pre-trained object detection model to get classID, scores, bounding box.
Now, using this, how can I use the pose estimation model.

sess run with dynamic batch size

Hi mks,
I gotta run inference while loading the ckpt file as well as the input node name I choose is "tower_0/Placeholder" based on the tester.predict function you wrote. the ideal shape should be (None,256,192,3), but I don't know why the shape I get is (32,256,192,3). The code is following.

saver = tf.train.import_meta_graph('location of meta file',clear_devices=True)
sess = tf.Session() 
saver.restore(sess, 'location of ckpt file') 
_input = sess.graph.get_tensor_by_name("tower_0/Placeholder:0")
print(_input)
<tf.Tensor 'tower_0/Placeholder:0' shape=(32, 256, 192, 3) dtype=float32>

Any idea to change the batch size to None? Thanks

Tracking Code

Did you implement Flow Based Pose Tracking algorithm in the paper?

Could you please share the code?

Pre-Trained Model on Google Drive

Thanks for your work! Is is possible to upload the pre-trained model to Google Drive? The currently provided link too slow for downloading (less than 100KB/s here in German).

name of output tensor

tensorboard

FileNotFoundError: [Errno 2] No such file or directory: 'tmp_result_0.pkl' and IndexError: list index out of range

Hello,

I am trying to run the test.py on the provided pre-trained model 256x192_resnet50_coco but I am getting the following errors:

though I don't really understand what it does yet, I skimmed over the code and you seemed to try to delete it. So as I don't have that pickle file, I tried to bypass those blocks of code by setting argument dump_method=2 in test.py like so MultiGPUFunc = MultiProc(len(args.gpu_ids.split(',')), func, dump_method=2) but then I encountered another error as following:

the command I used to run the programme is : python test.py --gpu 0 --test_epoch 140 as I have only 1 gpu card. Also I tried to changed argument gpu to 1, 0-1 but the error is still the same.

As an aside, I am not sure which file is right for the dets/human_detection.json and so I used the human_detection_test-dev2017.json of Human detection result on test-dev2017 (57.2 AP on human class) and renamed it to human_detection.json. I wonder if it is the cause to the error?

I need your help.

thanks in advance.

PoseTrack Training

Hi! Thanks for your repo.
The number of joints in posetrack is a little different from COCO. How did you handle the missing joints like left_ear and right_ear. Also the different joint head_bottom and head_top?

where can I get human_detection.json of MPII?

I want to test model trained from MPII, where can I get human_detection.json of MPII

eval ？？？

def make_network(self, is_train)
backbone = eval(cfg.backbone)
resnet_fms = backbone(image, is_train, bn_trainable=True)
heatmap_outs = self.head_net(resnet_fms, is_train)

what is it ?????eval in backbone = eval(cfg.backbone)??
this line of code ???i do not understand..
eval ？？？

change image size to 384x288 and accuracy drops to 62AP?

Hi, I just try to modify an input image size from 256x288 to 384x28, resulting in a 62 AP. No idea why this happens. The step I use is simply downloaded model and modify the parameter of the image size in a config file

Input from OpenCV

Hi,
Can I get input stream from OpenCV?
Thank you very much.

Any model to create human detection json

Hi can you suggest any model to create human detection json. We are Technos data science Inc . from Japan.

PoseTrack18 AP=38.5 and provided pose download link only has 74 json files?

Hi @mks0601, I use provided model of 256x192_resnet50 and human detection results to test the accuracy on the PoseTrack 2018 dataset. The result is as follows:

Loading data
gt frames : 8923
pred frames: 8923
Evaluation of per-frame multi-person pose estimation
saving results to ./out/total_AP_metrics.json
Average Precision (AP) metric:
& Head & Shou & Elb & Wri & Hip & Knee & Ankl & Total\
& 39.1 & 41.3 & 39.6 & 35.6 & 38.5 & 38.9 & 36.5 & 38.5 \

Furthermore, I downloaded the pose results from the provided link(https://cv.snu.ac.kr/research/TF-SimpleHumanPose/PoseTrack/pose_result/person_keypoints_256x192_resnet50_val_results.zip) and the number of included json files is only 74 while the number of groundtruth of val set is 170. The evaluation cannot be finished. I am not sure where is the problem. Could you give some suggestions? Thanks in advance.

issue__start training....

안녕하세요,
질문에 앞서 우선 훌륭한 결과를 공유해주셔서 정말 감사드립니다.
제가 Windows10에서 train.py를 구동하게되면 하기와 같이 start training......에서 멈추고 다음진행으로 넘어가지 않습니다. 혹시 구현하시는데 저와 비슷한 오류를 경험하신 적이 있으신지요??

Why is Encoder part trainable True?

In the original paper, the encoder part is freezing. Is there a reason why you chose the trainable "True" in the encoder(ResNet50) of the code?
Thanks.

Unable to visualize

I can't see the visualization in vis output after running the test. The test generated results and saved in result.pkl. I commented out from the /lib/nms/nms.py:
from .gpu_nms import gpu_nms
to bypass CudaRegistarFatbinaryending error. Do you have any suggestions for why the vis folder empty?

hello, i am confused about test result. As we know, all keypoints are composed 3 parts: (x,y,v), v is valid. but i found your test result(https://cv.snu.ac.kr/research/TF-SimpleHumanPose/COCO/pose_result/person_keypoints_256x192_resnet50_val2017_results.json), is all 1, and test code,

set v=1 of all keypoints.

mks0601 / tf-simplehumanpose Goto Github PK

tf-simplehumanpose's Introduction

Simple Baselines for Human Pose Estimation and Tracking

Introduction

Dependencies

Directory

Root

Data

Output

Running TF-SimpleHumanPose

Start

Train

Test

Results

Results on MSCOCO 2017 dataset

Results on PoseTrack 2018 dataset

Troubleshooting

Acknowledgements

Reference

tf-simplehumanpose's People

Contributors

Stargazers

Watchers

Forkers

tf-simplehumanpose's Issues

Recommend Projects

Recommend Topics

Recommend Org