OpenPoseCMU - HackMD

--- title: 'OpenPoseCMU' tags: CS --- # Table of Contents [TOC] ## Referencd [github](https://github.com/CMU-Perceptual-Computing-Lab/openpose) [demo](https://github.com/CMU-Perceptual-Computing-Lab/openpose/blob/master/doc/demo_overview.md) [openpose documentation(ENG)](https://github.com/CMU-Perceptual-Computing-Lab/openpose/blob/master/python/openpose/openpose_python.cpp#L194) [openpose documentation(CHI)](https://blog.csdn.net/weixin_40802676/article/details/100830688) ## 人體姿態辨識論文 [Hung-Chih Chiu](https://medium.com/@williamchiu0127) ## LightWeight OpenPose [Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose](https://arxiv.org/pdf/1811.12004.pdf) ## Openpose Windows 安裝 ### Some problems 1. 要安裝python API時顯示找不到pyopenpose ![](https://i.imgur.com/gFnDQqq.png) 打開visual studio，在pyopenpose這個專案進行build ![](https://i.imgur.com/i5ETKt2.png) 資料夾會出現pyopenpose的library ![](https://i.imgur.com/p84q1tc.png) ### 添加環境變數 1. 把openpose.dll所在資料夾加到使用者變數Path裡面 ![](https://i.imgur.com/7FN5RSa.png) 2. 把以下三個檔案所再資料夾加到系統變數PYTHONPATH裡面 ![](https://i.imgur.com/vORmIbK.png) ![](https://i.imgur.com/ZazerEx.png) 添加完畢之後就可以直接import openpose了 ```python import pyopenpose as op ``` ## Openpose save video in different format - The default video save format is `.avi`. If you want to save it with `.mp4`, error will show as below: ![](https://i.imgur.com/Sz0OYXw.png) - Solution: Install `ffmpeg` > sudo apt-get install ffmpeg ## Openpose 不同model [model link](https://github.com/CMU-Perceptual-Computing-Lab/openpose_train/tree/master/experimental_models) ![](https://i.imgur.com/At0eplV.png) 1. 100_135AlmostSameBatchAllGPUs - [Paper link(whole body in 2019)](https://arxiv.org/pdf/1909.13423.pdf) - Feature extraction : 10 VGG layers - 4 wider & deeper PAF stages and 1 CM stage 3. 1_25BBkg - [Paper link(whole body in 2019)](https://arxiv.org/pdf/1909.13423.pdf) - Feature extraction : 10 VGG layers 4. 1_25BSuperModel11FullVGG - [Paper link(whole body in 2019)](https://arxiv.org/pdf/1909.13423.pdf) - Feature extraction : Complete VGG layers 5. body_25() - [Paper link(Pami那篇)](https://arxiv.org/pdf/1812.08008.pdf) - Feature extraction : 10 VGG layers ## Openpose src code [prototext](https://github.com/CMU-Perceptual-Computing-Lab/openpose_caffe_train/blob/master/src/caffe/proto/caffe.proto) [oPDataTransformer](https://github.com/CMU-Perceptual-Computing-Lab/openpose_caffe_train/blob/master/src/caffe/openpose/oPDataTransformer.cpp) [dataAugmentation](https://github.com/CMU-Perceptual-Computing-Lab/openpose_caffe_train/blob/master/src/caffe/openpose/dataAugmentation.cpp) ## Openpose Detection Parts ### COCO (Original)(17) ![](https://i.imgur.com/wi7vQmL.png) ### COCO used in Openpose(18) ![](https://i.imgur.com/r4zHc8w.png) ### BODY25(25:COCO + middle of hips + 6 foot parts) ![](https://i.imgur.com/w4LTKiO.png) ## Openpose Linux (Ubuntu) 安裝遇到的小問題 1. generate完成的時候發現跑src code，圖片會顯示不出來 + Sol:原因是opencv的版本太新(4.3.0)，降回4.2.0就可以正常顯示了 2. generate完成之後，跑src code，會發現只顯示原本的圖片，上面並沒有畫上skeleton + Sol:因為前面在下載model的時候有失敗(coco和mpi)，只要重新下載並放到正確的位置(models/......)就可以得到正常的結果了 ## Openpose Training - Provided Dataset [github](https://github.com/CMU-Perceptual-Computing-Lab/openpose_train) ### [Training Process](https://github.com/CMU-Perceptual-Computing-Lab/openpose_train/blob/master/training/README.md) 1. Get images 2. Annotate 3. Generate LMDB files 4. GPUing... ### Prepare Openpose Training File [openpose_train.md](https://github.com/CMU-Perceptual-Computing-Lab/openpose_train/blob/master/training/README.md) 1. Generate LMDB files using COCO dataset ([REF](https://www.immersivelimit.com/tutorials/create-coco-annotations-from-scratch/#coco-dataset-format)) ![](https://i.imgur.com/jMo8C86.png) - In step b, to find images without people. - In step e, to obtain json file for training. - For original COCO annotation (x, y, v), - v=0: not labeled (in which case x=y=0), - v=1: labeled but not visible - v=2: labeled and visible - For refined json file, - v=0: labeled but not visible - v=1: labeled and visible - v=2: not labeled (in which case x=y=0), - ex. 000000345507.jpg ![](https://i.imgur.com/r8zFKyJ.png) - keypoints - "keypoints": [ "nose","left_eye","right_eye","left_ear","right_ear", "left_shoulder","right_shoulder","left_elbow","right_elbow", "left_wrist","right_wrist","left_hip","right_hip", "left_knee","right_knee","left_ankle","right_ankle" ] - original - [115.000,170.000,2.000], [0.000,0.000,0.000], [107.000,161.000,2.000], [0.000,0.000,0.000], [74.000,165.000,2.000], [63.000,225.000,2.000], [76.000,240.000,2.000], [81.000,325.000,2.000], [94.000,342.000,2.000], [109.000,390.000,1.000], [140.000,413.000,2.000], [80.000,391.000,1.000], [92.000,416.000,2.000], [0.000,0.000,0.000], [0.000,0.000,0.000], [0.000,0.000,0.000], [0.000,0.000,0.000] - refined - [115.000,170.000,1.000], [0.000,0.000,2.000], [107.000,161.000,1.000], [0.000,0.000,2.000], [74.000,165.000,1.000], [63.000,225.000,1.000], [76.000,240.000,1.000], [81.000,325.000,1.000], [94.000,342.000,1.000], [109.000,390.000,0.000], [140.000,413.000,1.000], [80.000,391.000,0.000], [92.000,416.000,1.000], [0.000,0.000,2.000], [0.000,0.000,2.000], [0.000,0.000,2.000], [0.000,0.000,2.000] - In step f, using command below: > python2 c_generateLmdbs.py 2. Generate LMDB files (foot dataset) 1. Follow the steps below: ![](https://i.imgur.com/aJarPLl.png) + step a: Use the COCO dataset downloaded before. The foot annotation json files should be downloaded by yourself([link](https://cmu-perceptual-computing-lab.github.io/foot_keypoint_dataset/)). It contains 23(original 17 + 6 foot keypoints) body parts in "keypoints" + step b: + Modify line 32 and 33 to enable foot option. + Modify line 83 and 87 to match the foot annotation files we download. + step c: + Modify line 15 and 17 + step d: > python2 c_generateLmdbs.py 3. Generate LMDB files using MPII dataset ([link](http://human-pose.mpi-inf.mpg.de/#download)) 1. Follow the steps below: ![](https://i.imgur.com/I8SK4Gc.png) - For original MPII annotation (x, y, is_visible), - is_visible=0 or []: labeled but not visible - is_visible=1: labeled and visible - For refined json file, - v=0: labeled but not visible - v=1: labeled and visible - v=2: not labeled (in which case x=y=0), - ex. 070755336.jpg ![](https://i.imgur.com/p8XScfh.png) - keypoints - joint id (0 - r ankle, 1 - r knee, 2 - r hip, 3 - l hip, 4 - l knee, 5 - l ankle, 6 - pelvis, 7 - thorax, 8 - upper neck, 9 - head top, 10 - r wrist, 11 - r elbow, 12 - r shoulder, 13 - l shoulder, 14 - l elbow, 15 - l wrist) - original - [{"x":181,"y":303,"id":6,"is_visible":true}, {"x":166,"y":150,"id":7,"is_visible":false}, {"x":167.546,"y":132.3504,"id":8,"is_visible":[]}, {"x":176.454,"y":30.6496,"id":9,"is_visible":[]}, {"x":305,"y":453,"id":0,"is_visible":false}, {"x":301,"y":355,"id":1,"is_visible":true}, {"x":160,"y":317,"id":2,"is_visible":true}, {"x":201,"y":289,"id":3,"is_visible":true}, {"x":332,"y":291,"id":4,"is_visible":false}, {"x":342,"y":395,"id":5,"is_visible":false}, {"x":253,"y":251,"id":10,"is_visible":true}, {"x":173,"y":259,"id":11,"is_visible":true}, {"x":138,"y":155,"id":12,"is_visible":true}, {"x":194,"y":145,"id":13,"is_visible":false}, {"x":199,"y":213,"id":14,"is_visible":false}, {"x":258,"y":239,"id":15,"is_visible":false}] - refined - [[305.0, 453.0, 0.0], [301.0, 355.0, 1.0], [160.0, 317.0, 1.0], [201.0, 289.0, 1.0], [332.0, 291.0, 0.0], [342.0, 395.0, 0.0], [181.0, 303.0, 1.0], [166.0, 150.0, 0.0], [167.546, 132.3504, 0.0], [176.454, 30.6496, 0.0], [253.0, 251.0, 1.0], [173.0, 259.0, 1.0], [138.0, 155.0, 1.0], [194.0, 145.0, 0.0], [199.0, 213.0, 0.0], [258.0, 239.0, 0.0]] ### Image Preprocessing Problem 1.When executing a3_coco_matToMasks.m, the error "Unable to open file **"/openpose-train/openpose_train/dataset/COCO/cocoapi/images/segmentation2017/train2017/xxxxxxxxxx.jpg"** for writing. You may not have write permission." + Sol: At first, I suppose that it's the problem of writing permission of MATLAB folder. So I re-install the entire matlab under /home/cmw/. However, the same error still exists. :sweat::sweat: Then I modify the permission of MATLAB and openpose-train from 755 to 775 and nothing happens again.:sob::sob: Finally, I find that there is nothing under segmentation2017 (folder train2017 should be there). So I manually create train2017 folder and guess what? The problem is solved.:laughing::laughing: ### Generate LMDB files problems ```python= ＃因為caffe..是安裝python2.7版本，所以這邊也用python2 python2 c_generateLmdbs.py ``` 1. ![](https://i.imgur.com/DeX54Mf.png) - Sol: If the data type is 'str', using index to access it will return a str type result. **It's a version problem between python2 and python3 .** - [[Ref]](https://www.cnblogs.com/lshedward/p/9926150.html) - Ex. ```python= >>> a = "COCO" >>> type(a) <class 'str'> >>> type(a[0]) <class 'str'> >>> b = b'COCO' >>> type(b) <class 'bytes'> >>> type(b[0]) <class 'int'> ``` 2. ![](https://i.imgur.com/LSGDnru.png) - Sol: [[Stackoverflow solution]](https://stackoverflow.com/questions/43805999/python3-and-not-is-python2-typeerror-wont-implicitly-convert-unicode-to-byt) ### Caffe Installation (Please use python2.7) ![](https://i.imgur.com/S1iIrEk.png) #### [參考連結](https://blog.csdn.net/oJiMoDeYe12345/article/details/72900948) 1. “fatal error: hdf5.h: 没有那个文件或目录” 2. nccl.hpp:5:18: fatal error: nccl.h: No such file or directory 3. error: ‘accumulate’ is not a member of ‘std’ ![](https://i.imgur.com/aMdegsX.png) + Sol:https://stackoverflow.com/questions/7899237/function-for-calculating-the-mean-of-an-array-double-using-accumulate 4. recipe for target '.build_debug/lib/libcaffe.so.1.0.0' failed ![](https://i.imgur.com/MRThIGL.png) + Sol: - Uncomment `OPENCV_VERSION := 3` - ![](https://i.imgur.com/LnsH1sC.png) 5. ![](https://i.imgur.com/hgvh6fT.png) + Sol:Fuck no! #### 重裝 - Alright, alright, even though I solve so many problems. I CANNOT SUCCESSFULLY INSTALL THE OPENPOSETRAINCAFFE!! + Finally, I delete the entire folder and download the entire repository again. + This time, I follow the [tutorial](https://mc.ai/installing-caffe-on-ubuntu-18-04-with-cuda-and-cudnn/). For the instruction in this blog, do not execute `make clean` after successfully build. (Only use it when you want to rebuild) * If some error about missing of package were shown, just use pip3 to install it. ### Generate the Caffe ProtoTxt and shell file for training ``` python2 d_setLayers.py ``` ![](https://i.imgur.com/inC3vv4.png) 1. [Error when use python3.6](https://github.com/CMU-Perceptual-Computing-Lab/openpose_train/issues/28) - Sol: Use python2.7 instead.(The problem) ### Resume Training 1. Modify the snapshot parameter in resume_train_pose.sh. The default path of pretrained model is ***/home/cmw/openpose-train/openposetrain/training_results/pose/model/pose_iter_668000.solverstate*** ![](https://i.imgur.com/cmoDhNh.png) 2. Then in the command line, just type: ***bash resume_train_pose.sh 0*** (generated by d_setLayers.py) to start the training with the 1 GPU (0). ## Openpose Training - Custom Dataset ### Understanding Dataset Annotation - foot dataset: **openpose自己標註的腳步資料集**，相較於原始COCO資料集所提供的17個部位點多出了6個(總共23個)，標註的方法就是在原始COCO資料集JSON檔中annotations下的keypoints中，把新增的6個部位點之x,y座標以及visibility加到最後面(如下圖) ![](https://i.imgur.com/ZB8IipZ.png) ### Understanding src code (Vscode is recommended for tracing code :thumbsup::thumbsup:) - Please follow the steps mentioned in **Openpose Training - Provided Dataset** - Some code needs to be modified, see following: - a4_coco_matToRefinedJson.m:用來產生訓練用的json檔案，此檔案會被用來產生lmdb檔案 - function reshapeKeypoints(*line 334*):COCO資料集中，針對每一個部位點會給出x,y座標以及visibility，此function用來修改visibility (詳情可見上方[Openpose Preprocessing(openpose_train.md)](#Openpose-Preprocessing-openpose_trainmd)中的說明或者是此function中的comment也有提到) - d_setLayers.py:用來產生訓練所需的training, deploy 以及solver的.prototxt檔案，事實上前述的檔案產生是call generateProtoTxt.py這隻程式完成的，在d_setLayers.py裡面主要做的事情是去產生所有的參數(所以要增加自訂資料集時，要在這個檔案裡面改，generateProtoTxt.py只負責接參數然後產生前述的.prototxt檔案) - ex.用到的dataset, 用到的lmdb資料夾位置, network的縮寫(在generateProtoTxt.py中再改成caffe的對應layer名稱)...等等 - c_generateLmdbs.py:用來產生lmdb檔案，需要事先準備好訓練資料集的圖片以及json檔案，主要是call generateLmdbFile.py這隻檔案，但因為裡面考慮了很多資料集的情況，我們自訂資料集只需要拿類似coco foot的處理方式來做就好，因此重新改出了一個generateCustomLmdbFile.py的檔案，目前專門for foot dataset，之後再看要改哪邊 - generateCustomLmdbFile.py:看會用到json檔裡面的哪些欄位，然後標記資料時只要標記這些欄位就好 1. dataset 2. img_height 3. img_width 4. numOtherPeople 5. people_index 6. annolist_index 7. numOtherPeople 8. objpos 9. scale_provided 10. joint_self 11. joint_others(如果一張圖片含有多人，才會讀到這項資料) 13. objpos_other(如果一張圖片含有多人，才會讀到這項資料) 14. scale_provided_other(如果一張圖片含有多人，才會讀到這項資料) - 要產生lmdb也會用到mask，COCO是有segmentation的資料，而mpii沒有，所以其實可以學mpii的方法，用bbox來做mask - **接續上一點，其實也不一定要有bbox來做mask，原本會做mask是因為資料集有標記不完全的情況發生，如果是自己的資料集都有標記，其實mask就直接用和原圖相同大小，所有pixel都改成255灰階值(白色)就好** > 補充：在 COCO 裡面，如果有 segmentation的標注資料，會根據此產生 bbox - If you want to customize the number of keypoints: - 假設有要用自己標記的keypoints，也就是要用到其他的skeleton，必須要去改openpose-caffe裡面的檔案並且重新build，**如果只是用相同的標記點(例如foot dataset裡面的23 keypoints)，就只要在training.prototxt裡面的models標籤使用"COCO_25B_23"就好**，如果使用相同的skeleton，但dataset不同，那就在models標籤那邊重複就好，如圖(重複了兩次COCO_25B_23):![](https://i.imgur.com/QesOAzU.png) - openpose_Caffe_train/src/caffe/openpose/poseModel.cpp - 更改Auxiliary functions底下的function (int poseModelToIndex)以及Parameters and functions to change if new PoseModel底下的每一個array的相關參數 - openpose_Caffe_train/include/caffe/openpose/poseModel.hpp - enum class PoseModel裡面要新增自己的model(注意要加在現有model的最下面以及Size的上面，順序會影響到後面oPDataTransformer在計算channel的部份) - poseModel.cpp:定義所使用dataset對應的body part keypoints、skeleton是怎麼組成(哪些部位要相連)、最後產生的model會output出那些body parts等等的參數 - oPDataTransformer.cpp:產生G.T.(ground truth)的地方 ### Custom dataset annotation - [Annotation tool](https://github.com/jsbroks/coco-annotator) - [Annotation tool setup](https://github.com/jsbroks/coco-annotator/wiki/Getting-Started) - [Annotation tool tutorial](https://hackmd.io/@cmwchw/H1XVv7jB_) #### Prepare Training LMDB file 1. a2_coco_jsonToMat.m + Add your custom dataset ex. line 40, 89, 130, 288 + Put your custom coco-style annotation file into folder: **openpose_train/dataset/COCO/cocoapi/annotations/your_dataset.json** + File **your_dataset.mat** will be created in following path: **openpose_train/dataset/COCO/mat/your_dataset.mat** 2. a3_coco_matToMasks.m + Add your custom dataset ex. line 40, 111, 128, 184 + Put your custom dataset into folder: **openpose_train/dataset/COCO/cocoapi/images/your_dataset/** + Folder **mask2017** and **segmentation2017** will be created. (Folder names don't matter, and they can be changed by code.) > **Caution**: According to line 155 in src code, the annotation will be dropped if the annotated keypoints are less than 5. 3. a4_coco_matToRefinedJson.m + Add your custom dataset ex. line 24, 136, 241 + File your_dataset.json will be created in following path: **openpose_train/dataset/COCO/json/your_dataset.json** 4. c_generateLmdbs.py + Add your custom dataset ex. line 33, 161 + Make sure following files are ready 1. lmdb files: /openpose_train/dataset/your_dataset/![](https://i.imgur.com/Uc1qvvq.png) 2. images: /openpose_train/dataset/COCO/cocoapi/images/![](https://i.imgur.com/ljOLz2z.png) 3. annotation json file: /openpose_train/dataset/COCO/json/your_dataset.json![](https://i.imgur.com/vOUFydZ.png) + Folder **lmdb_your_dataset** will be created 2 files are inside![](https://i.imgur.com/pI6E8tc.png) ## Openpose with tracking [Project Page](https://cmu-perceptual-computing-lab.github.io/spatio-temporal-affinity-fields/) [Github](https://github.com/soulslicer/openpose/tree/staf) 1. Using the same method which applied in building openpose, remember to switch to the **staf** branch. > git clone https://github.com/soulslicer/openpose.git -b staf 3. Currently, only c++ version is available. Try command below: `./openpose.bin --model_pose BODY_21A --tracking 1 --render_pose 1 --video your_video` 3. Try to use python API: ![](https://i.imgur.com/XyUjqCO.png) May be solved by links as belows: - https://github.com/soulslicer/openpose/issues/5 Go to this issue and check **sh0w**'s' comment. sh0w modifies ^1.^the pybind part to solve the data type conversion error and ^2.^enable the tracking flag. - For first modification. ***Check the [link](https://github.com/sh0w/openpose/blob/41ee8a0621f2b44be1afa3b9c9283abf8b880a65/python/openpose/openpose_python.cpp#L436) and see line 436.*** - For second modification. ***Check the [link](https://github.com/sh0w/openpose/blob/41ee8a0621f2b44be1afa3b9c9283abf8b880a65/python/openpose/openpose_python.cpp#L154) and see line 154.*** - https://github.com/CMU-Perceptual-Computing-Lab/openpose/issues/1162 ## Openpose C++ API Extension https://github.com/CMU-Perceptual-Computing-Lab/openpose/blob/master/doc/04_cpp_api.md https://github.com/CMU-Perceptual-Computing-Lab/openpose/blob/master/examples/user_code/README.md ## Caffe Draw Loss Curve - The filepath - caffe-master/tools/extra/parse_log.sh - caffe-master/tools/extra/extract_seconds.py - caffe-master/tools/extra/plot_training_log.py.example - Parse training log ```cmd ./parse_log.sh your.log ``` - Draw curve ```cmd python2 plot_training_log.py.example yourflag yourimg.png your.log ``` - yourflag ```cmd Notes: 1. Supporting multiple logs. 2. Log file name must end with the lower-cased ".log". Supported chart types: 0: Test accuracy vs. Iters 1: Test accuracy vs. Seconds 2: Test loss vs. Iters 3: Test loss vs. Seconds 4: Train learning rate vs. Iters 5: Train learning rate vs. Seconds 6: Train loss vs. Iters 7: Train loss vs. Seconds ```