[1] Tompson J J, Jain A, LeCun Y, et al. Joint training of a convolutional network and a graphical model for human pose estimation[C]//Advances in Neural Information Processing Systems. 2014:1799-1807.
[2] Toshev A, Szegedy C, DeepPose G. Human pose estimation via deep neural networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA. 2014:24-27.
[3] Sapp B, Taskar B. Modec:multimodal decomposable models for human pose estimation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2013:3674-3681.
[4] Gkioxari G, Toshev A, Jaitly N. Chained predictions using convolutional neural networks[C]//European Conference on Computer Vision. Springer, Cham, 2016:728-743.
[5] He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions On Pattern Analysis and Machine Intelligence, 2015, 37(9):1904-1916.
[6] Girshick R. Fast r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision. 2015:1440-1448.
[7] Gkioxari G, Girshick R, Malik J. Contextual action recognition with r* cnn[C]//Proceedings of the IEEE International Conference on Computer Vision. 2015:1080-1088.
[8] He K, Gkioxari G, Dollár P, et al. Mask r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision. 2017:2961-2969.
[9] Ren S, He K, Girshick R, et al. Faster r-cnn:Towards real-time object detection with region proposal networks[C]//Advances in Neural Information Processing Systems. 2015:91-99.
[10] Everingham M, Van Gool L, Williams C K I, et al. The pascal visual object classes (voc) challenge[J]. International Journal of Computer Vision, 2010, 88(2):303-338.
[11] Johnson S, Everingham M. Clustered pose and nonlinear appearance models for human pose estimation[C]//BMVC 2010. doi:10.5244/c.24.12.
[12] MPI human pose dafaset[DB/OL].[2019-05-05]. http://human-pose.mpi-inf.mpg.de/#evaluation.
[13] Tompson J, Goroshin R, Jain A, et al. Efficient object localization using convolutional networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015:648-656.
[14] Pishchulin L, Insafutdinov E, Tang S, et al. Deepcut:joint subset partition and labeling for multi person pose estimation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016:4929-4937.
[15] Wei S E, Ramakrishna V, Kanade T, et al. Convolutional pose machines[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016:4724-4732. |