Selected papers
2021
Danni. Xu, R.M. Hu, Z.X. Xiong, Z. Wang , et al. Trajectory is not Enough: Hidden Follower Detection,ACM Multimedia 2021. (CCF A会 accept)
Li X, Hu R, Wang Z, et al. Location Prediction via Bi-direction Speculation and Dual-level Association[J]. arXiv preprint arXiv:2106.15070, 2021.(CCF A会 accept)
Chenhao Hu, Ruimin Hu, Xiaochen Wang, Yulin Wu, Spatial Audio Object Coding Based on Time-Frequency Shifting and Scheduling, July 2021, Conference: 2021 IEEE International
Conference on Multimedia and Expo (ICME), DOI:10.1109/ICME51207.2021.9428297 (CCF B oral)
Chenhao Hu, Ruimin Hu, Xiaochen Wang, Yulin Wu, Efficient Multi-Step Audio Object Coding with Limited Residual Information, July 2021, Conference: 2021 IEEE International Conference on Multimedia and Expo (ICME), DOI:10.1109/ICME51207.2021.9428471 (CCF B oral)
Wenxin Huang, Dongyang Li, Ruimin Hu, Chao Liang, Person Retrieval in Physical World, July 2021, Conference: 2021 IEEE International Conference on Multimedia and Expo (ICME), DOI:10.1109/ICME51207.2021.9428411 (CCF B)
Yulin Wu, Ruimin Hu, Chenhao Hu, Shanfa Ke, Low Bitrates Audio Object Coding Using Convolutional Auto-Encoder and Densenet Mixture Model, July 2021, Conference: 2021 IEEE
International Conference on Multimedia and Expo (ICME), DOI:10.1109/ICME51207.2021.9428227 (CCF B oral)
Gang Li, Xiaochen Wang, Ruimin Hu, Huyin Zhang, Intelligibility Enhancement via Normal-to-Lombard Speech Conversion with Long Short-Term Memory Network and Bayesian
Gaussian Mixture Model, March 2021IEEE Transactions on Multimedia PP(99):1-1, DOI:10.1109/TMM.2021.3068565 (CCF B SCI 2区)
Wenqian Zhu, Zhongyuan Wang, Ruimin Hu, Dengshi Li, From Semantic to Spatial Awareness: Vehicle Re-Identification with Multiple Attention Mechanisms, January 2021IEEE Multimedia PP(99):1-1,DOI:10.1109/MMUL.2021.3052897 (SCI 2)
2020
Li G , Hu R , Zhang R , et al. A mapping model of spectral tilt in normal-to-Lombard speech conversion for intelligibility enhancement[J]. Multimedia Tools and Applications, 2020:1-21. (SCI,EI,中国计算机学会C类期刊)
Li D , Hu R , Huang W , et al. HMM-Based Person Re-identification in Large-Scale Open Scenario[M]// MultiMedia Modeling. 2020.
Hu C , Hu R , Wang X , et al. Multi-step Coding Structure of Spatial Audio Object Coding[M]// MultiMedia Modeling. 2020.
Chen, Wei & Hu, Ruimin & Wang, Xiaochen & Li, Dengshi. (2020). HRTF Representation with Convolutional Auto-encoder. MultiMedia Modeling, 605-616.
Li D, Hu R, Wang X, et al. Loudspeaker triplet selection based on low distortion within head for multichannel conversion of smart 3D home theater[J]. Concurrency and Computation: Practice and Experience, 2020, 32(13): e4796.
胡瑞敏,张亚浩,李登实,王晓晨,王超.基于逐阶共识计算的虚假物理身份属性检测方法[J].武汉大学学报(理学版),2020,66(02):103-110.
2013-2019
Wu T , Hu R , Wang X , et al. Audio object coding based on optimal parameter frequency resolution[J]. Multimedia Tools and Applications, 2019, 78(15):20723-20738. (SCI,EI,中国计算机学会C类期刊)
Zhu W, Hu R, Wang Z, et al. Deep Structural Feature Learning: Re-Identification of simailar vehicles In Structure-Aware Map Space.[C]. acm multimedia, 2019. (EI,中国计算机学会A类会议)
Wang X, Hu R, Wang Z, et al. Long Term Background Reference Based Satellite Video Coding[C]. international conference on acoustics speech and signal processing, 2019: 1822-1826. (EI,中国计算机学会B类会议 )
Chen Y, Hu R, Xiao J, et al. Multisource Surveillance Video Coding by Exploiting 3D and 2D Knolwedge[C]. international conference on acoustics speech and signal processing, 2019: 1787-1791.(EI,中国计算机学会B类会议 )
Chen Y, Hu R, Xiao J, et al. Multisource surveillance video coding with synthetic reference frame[J]. Journal of Visual Communication and Image Representation, 2019. (EI,中国计算机学会B类期刊 )
Chen Y, Hu R Xiao J, et al. Multisource surveillance video data coding with hierarchical knowledge library[J]. Multimedia Tools and Applications, 2019, 78(11): 14705-14731. (SCI,EI,中国计算机学会C类期刊)
Ke S, Hu R, Li G, et al. Multi-speakers Speech Separation Based on Modified Attractor Points Estimation and GMM Clustering[C]. international conference on multimedia and expo, 2019: 1414-1419. (EI,中国计算机学会B类会议)
Xu Z , Hu R, Chen J , et al. Semisupervised Discriminant Multimanifold Analysis for Action Recognition[J]. IEEE Transactions on Neural Networks and Learning Systems, 2019:1-12. (EI,中国计算机学会B类期刊)
Zhang R, Hu R, Li G, et al. Spectral Tilt Estimation for Speech Intelligibility Enhancement Using RNN Based on All-Pole Model[C]. conference on multimedia modeling, 2019: 144-156.
Lu S, Hu R, Liu J, et al. Structure Preserving Convolutional Attention for Image Captioning[J]. Applied Sciences, 2019, 9(14).
Zhang M, Hu R, Jiang L, et al. Three‐dimensional sound reproduction in vehicle based on data mining technique[J]. Concurrency and Computation: Practice and Experience, 2019, 31(4).
Li Q, Hu R,, Chen Y, et al. Vehicle Pose Estimation Using Mask Matching[C]. international conference on acoustics speech and signal processing, 2019: 1972-1976. (EI,中国计算机学会B类会议 )
Li G, Hu R,, Wang X, et al. A near-end listening enhancement system by RNN-based noise cancellation and speech modification[J]. Multimedia Tools and Applications, 2019, 78(11): 15483-15505. (SCI,EI,中国计算机学会C类期刊)
Ding X, Hu R,, Han Z, et al. A novel frontal facial synthesis algorithm based on individual residual face[C]//International Conference on Multimedia Modeling. Springer, Cham, 2018: 14-22. (EI)
Liao L, Hu R,, Xiao J, et al. Edge-aware context encoder for image inpainting[C]//2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2018: 3156-3160. (EI)
Li C, Hu R,, Liang C, et al. Faster seam carving for video retargeting[C]//2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, 2018: 823-827. (EI,中国计算机学会C类会议)
Wang X, Hu R,, Xiao J. Frame Rate Conversion Based High Efficient Compression Method for Video Satellite[C]//Pacific Rim Conference on Multimedia. Springer, Cham, 2018: 35-44. (EI,中国计算机学会C类会议)
Chen W, Hu R,, Wang X, et al. Individualization of head related impulse responses using division analysis[J]. China Communications, 2018, 15(5): 92-103.(SCI)
Huang Z, Hu R,, Thierry B, et al. Multi-feature fusion based background subtraction for video sequences with strong background changes[C]//2017 IEEE International Conference on Image Processing (ICIP). IEEE, 2017: 3370-3374.
Wang Z, Hu R,, Chen C, et al. Person reidentification via discrepancy matrix and matrix metric[J]. IEEE transactions on cybernetics, 2017, 48(10): 3006-3020. (中国计算机学会B类期刊高引用)
Wang Z, Hu R, Yu Y, et al. Statistical Inference of Gaussian-Laplace Distribution for Person Verification[C]. acm multimedia, 2017: 1609-1617. (EI,中国计算机学会A类会议)
Jing X Y , Zhu X , Wu F , et al. Super-Resolution Person Re-Identification With Semi-Coupled Low-Rank Discriminant Dictionary Learning[J]. IEEE Transactions on Image Processing, 2017, 26(3):1363-1378. (SCI, 中国计算机学会A类期刊)
Wu T, Hu R, Wang X, et al. High quality audio object coding framework based on non-negative matrix factorization[J]. China Communications, 2017, 14(9): 32-41.
Jiang J, Hu R, Wang Z, et al. Facial Image Hallucination Through Coupled-Layer Neighbor Embedding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2016, 26(9): 1674-1684.
Wang Z, Hu R, Yu Y, et al. Taichi distance for person re-identification[C]. international conference on acoustics, speech, and signal processing, 2017: 2052-2056. (EI,中国计算机学会C类会议)
Li Q, Hu R, Chen Y, et al. A Fine-Grained Filtered Viewpoint Informed Keypoint Prediction from 2D Images[C]. pacific rim conference on multimedia, 2017: 172-181.
Wang S, Hu R, Chen S, et al. 3D Sound Field Reproduction at Non Central Point for NHK 22.2 System[C]. conference on multimedia modeling, 2017: 3-14.
Huang W, Hu R, Liang C, et al. Structural superpixel descriptor for visual tracking[C]. international joint conference on neural network, 2017: 3146-3152.
Chen L, Hu R, Han Z, et al. A joint learning based Face Super Resolution approach via contextual topological structure[C]. international conference on acoustics, speech, and signal processing, 2017: 1088-1092. (EI,中国计算机学会C类会议)
Wang S, Hu R, Chen S, et al. Sound physical property matching between non central listening point and central listening point for NHK 22.2 system reproduction[C]. international conference on acoustics, speech, and signal processing, 2017: 436-440. (EI,中国计算机学会C类会议)
Hu R, Bao C, Zhao Q, et al. Recent development of speech and audio signal processing in network communication[J]. China Communications, 2017, 14(9).
Huang K, Hu R, Jiang J, et al. HRM graph constrained dictionary learning for face image super-resolution[J]. Multimedia Tools and Applications, 2017, 76(2): 3139-3162. (SCI,EI,中国计算机学会C类期刊)
Chen L, Hu R, Han Z, et al. Face super resolution based on parent patch prior for VLQ scenarios[J]. Multimedia Tools and Applications, 2017, 76(7): 10231-10254. (SCI,EI,中国计算机学会C类期刊)
Chen H, Chen J, Hu R, et al. Action recognition with temporal scale-invariant deep learning framework[J]. China Communications, 2017, 14(2): 163-172.
Chen L, Hu R, Liang C, et al. A novel face super resolution approach for noisy images using contour feature and standard deviation prior[J]. Multimedia Tools and Applications, 2017, 76(2): 2467-2493. (SCI,EI,中国计算机学会C类期刊)
Wang Z, Hu R, Yu Y, et al. Scale-adaptive low-resolution person re-identification via learning a discriminating surface[C]. international joint conference on artificial intelligence, 2016: 2669-2675. (EI,中国计算机学会A类会议)
Wu F, Jing X, You X, et al. Multi-view low-rank dictionary learning for image classification[J]. Pattern Recognition, 2016: 143-154. (EI,中国计算机学会B类期刊)
Ruan W , Chen J , Wang J , et al. Boosted local classifiers for visual tracking[C]// IEEE International Conference on Multimedia & Expo. IEEE Computer Society, 2016. (EI,中国计算机学会B类会议)
Gao L , Hu R , Wang X , et al. JND-based spatial parameter quantization of multichannel audio signals[J]. Eurasip Journal on Audio Speech & Music Processing, 2016, 2016(1).(A刊)
Xiao J, Hu R, Liao L, et al. Knowledge-Based Coding of Objects for Multisource Surveillance Video Data[J]. IEEE Transactions on Multimedia, 2016, 18(9): 1691-1706.
Xiong M, Chen J, Wang Z, et al. Person Re-Identification via Multiple Coarse-to-Fine Deep Metrics.[C]. european conference on artificial intelligence, 2016: 355-362. (EI,中国计算机学会B类会议)
Li D, Hu R, Wang X, et al. Multichannel reduction based on sound field within two ears[C]. international conference on multimedia and expo, 2016: 1-6. (EI,中国计算机学会B类会议)
Liao L, Hu R, Xiao J, et al. An Analysis-Oriented ROI Based Coding Approach on Surveillance Video Data[C]. pacific rim conference on multimedia, 2016: 428-438.
Lin J, Ruimin H, Xiaochen W, et al. Audio Bandwidth Extension Using Audio Super-Resolution[C]. pacific rim conference on multimedia, 2016: 540-549.
Wu T, Hu R, Gao L, et al. Analysis and Comparison of Inter-Channel Level Difference and Interaural Level Difference[C]. conference on multimedia modeling, 2016: 586-595.
Wang Z, Hu R, Liang C, et al. Zero-Shot Person Re-identification via Cross-View Consistency[J]. IEEE Transactions on Multimedia, 2016, 18(2): 260-272.(EI)
Wu T, Hu R, Gao L, et al. Analysis and Comparison of Inter-Channel Level Difference and Interaural Level Difference[C]. conference on multimedia modeling, 2016: 586-595.
Xu Z, Hu R, Chen J, et al. Global Contrast Based Salient Region Boundary Sampling for Action Recognition[C]. conference on multimedia modeling, 2016: 187-198.
Jiang J, Hu R, Wang Z, et al. CDMMA: Coupled discriminant multi-manifold analysis for matching low-resolution face images[J]. Signal Processing, 2016: 162-172.(SCI,中国计算机学会C类期刊)
Huang W, Hu R, Liang C, et al. Camera Network Based Person Re-identification by Leveraging Spatial-Temporal Constraint and Multiple Cameras Relations[C]. conference on multimedia modeling, 2016: 174-186.
Huang K, Hu R, Jiang J, et al. Face Image Super-Resolution Through Improved Neighbor Embedding[C]. conference on multimedia modeling, 2016: 409-420.
Zhang L, Hu R, Li D, et al. Adaptive Multichannel Reduction Using Convex Polyhedral Loudspeaker Array[C]. conference on multimedia modeling, 2016: 421-431.
Yang Y, Wang Y, Hu R, et al. Level Ratio Based Inter and Intra Channel Prediction with Application to Stereo Audio Frame Loss Concealment[C]. conference on multimedia modeling, 2016: 654-661.
Jiang J, Hu R, Wang Z, et al. Facial Image Hallucination Through Coupled-Layer Neighbor Embedding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2016, 26(9): 1674-1684.
Wang Z, Hu R, Yu Y, et al. Multi-Level Fusion for Person Re-identification with Incomplete Marks[C]. acm multimedia, 2015: 1267-1270. (EI,中国计算机学会A类会议)
Wang Z, Hu R, Liang C, et al. Person Re-identification Using Data-Driven Metric Adaptation[C]. conference on multimedia modeling, 2015: 195-207.
Wang S, Hu R, Chen S, et al. 3D Panning Based Sound Field Enhancement Method for Ambisonics[C]. pacific rim conference on multimedia, 2015: 135-145.
Wang S, Hu R, Chen S, et al. A down-mixing method for 22.2 multichannel system reproduction[C]. international conference on acoustics, speech, and signal processing, 2015: 634-638. (EI,中国计算机学会C类会议)
Zhang M, Hu R, Chen S, et al. Spatial perception reproduction of sound events based on sound property coincidences[C]. international conference on multimedia and expo, 2015: 1-6. (EI,中国计算机学会B类会议)
Yin L, Hu R, Chen S, et al. A Block-Based Background Model for Surveillance Video Coding[C]. data compression conference, 2015: 476-476. (EI,中国计算机学会B类会议)
Hu J, Hu R, Chen Y, et al. Joint Weighted Sparse Representation Based Median Filter for Depth Video Coding[C]. data compression conference, 2015: 450-450. (EI,中国计算机学会B类会议)
Gao L, Hu R, Yang Y, et al. Azimuthal Perceptual Resolution Model Based Adaptive 3D Spatial Parameter Coding[C]. conference on multimedia modeling, 2015: 534-545.
Jiang L, Hu R, Wang X, et al. Low Bitrates Audio Bandwidth Extension Using a Deep Auto-Encoder[C]. pacific rim conference on multimedia, 2015: 528-537.
Yang C, Hu R, Su L, et al. Multi-channel Object-Based Spatial Parameter Compression Approach for 3D Audio[C]. pacific rim conference on multimedia, 2015: 354-364.
Li D, Hu R, Wang X, et al. Multichannel Simplification Based on Deviation of Loudspeaker Positions[C]. advances in multimedia, 2015: 544-553.
Xie S, Yang Y, Hu R, et al. Signal-Aware Parametric Quality Model for Audio and Speech over IP Networks[C]. conference on multimedia modeling, 2015: 487-497.
Xiao J, Liao L, Hu J, et al. Exploiting global redundancy in big surveillance video data for efficient coding[J]. Cluster Computing, 2015, 18(2): 531-540.
Xiao J, Chen Y, Liao L, et al. Global Coding of Multi-source Surveillance Video Data[C]. data compression conference, 2015: 33-42. (EI,中国计算机学会B类会议)
Zhong R, Hu R, Wang Z, et al. 3D hybrid just noticeable distortion modeling for depth image-based rendering[J]. Multimedia Tools and Applications, 2015, 74(23): 10457-10478. (SCI,EI,中国计算机学会C类期刊)
Wang S, Hu R, Chen S, et al. A down-mixing method for 22.2 multichannel system reproduction[C]. international conference on acoustics, speech, and signal processing, 2015: 634-638. (EI,中国计算机学会C类会议)
Liao L, Hu R, Xiao J, et al. Exploiting effects of parts in fine-grained categorization of vehicles[C]. international conference on image processing, 2015: 745-749.
Xu Z, Hu R, Chen J, et al. How much bandwidth does surveillance system require[C]. international conference on image processing, 2015: 1762-1766. (EI,中国计算机学会C类会议)
Zhang M, Hu R, Chen S, et al. Spatial perception reproduction of sound events based on sound property coincidences[C]. international conference on multimedia and expo, 2015: 1-6. (EI,中国计算机学会B类会议)
Jing X, Zhu X, Wu F, et al. Super-resolution Person re-identification with semi-coupled low-rank discriminant dictionary learning[C]. computer vision and pattern recognition, 2015: 695-704. (EI, 中国计算机学会A类会议)
Qu S, Hu R, Chen S, et al. Face hallucination via Cauchy regularized sparse representation[C]. international conference on acoustics, speech, and signal processing, 2015: 1216-1220. (EI,中国计算机学会C类会议)
Gao L, Hu R, Yang Y, et al. Azimuthal Perceptual Resolution Model Based Adaptive 3D Spatial Parameter Coding[C]. conference on multimedia modeling, 2015: 534-545
Jiang J, Hu R, Han Z, et al. Coupled Discriminant Multi-Manifold Analysis with Application to Low-Resolution Face Recognition[C]. conference on multimedia modeling, 2015: 37-48. (EI,中国计算机学会C类会议)
[20] Jiang J, Hu R, Wang Z, et al. Face Super-Resolution via Multilayer Locality-Constrained Iterative Neighbor Embedding and Intermediate Dictionary Learning[J]. IEEE Transactions on Image Processing, 2014, 23(10): 4220-4231.(SCI, 中国计算机学会A类期刊)
Zhong R, Hu R, Wang Z, et al. 3D hybrid just noticeable distortion modeling for depth image-based rendering[J]. Multimedia Tools and Applications, 2015, 74(23): 10457-10478. (SCI,EI,中国计算机学会C类期刊)
Jiang J, Hu R, Han Z, et al. Low-Resolution and Low-Quality Face Super-Resolution in Monitoring Scene via Support-Driven Sparse Coding[C]. signal processing systems, 2014, 75(3): 245-256. (SCI)
Hu J, Hu R, Wang Z, et al. Adaptive Learning Based View Synthesis Prediction for Multi-View Video Coding[C]. signal processing systems, 2014, 74(1): 115-126.(SCI)
Jiang J , Hu R , Wang Z , et al. Noise Robust Face Hallucination via Locality-Constrained Representation[J]. IEEE Transactions on Multimedia, 2014, 16(5):1268-1281. (SCI,中国计算机学会C类会议)
Huang Z, Hu R, Wang Z, et al. Background Subtraction With Video Coding[J]. IEEE Signal Processing Letters, 2013, 20(11): 1058-1061. (SCI)
Gao L, Hu R, Yang Y, et al. A spatial priority based scalable audio coding[C]. international conference on acoustics speech and signal processing, 2014: 3670-3674. (EI,中国计算机学会B类会议 )
Leng Q, Hu R, Liang C, et al. Bidirectional ranking for person re-identification[C]. international conference on multimedia and expo, 2013: 1-6. (EI,中国计算机学会B类会议)
Wang Y, Hu R, Liang C, et al. Camera compensation using feature projection matrix for person re-identification[C]. international conference on multimedia and expo, 2013: 1-6. (EI,中国计算机学会B类会议)
Lan C, Hu R, Huang K, et al. Face hallucination with shape parameters projection constraint[C]. acm multimedia, 2010: 883-886. (EI,中国计算机学会A类会议)
Chen H, Hu R, Mao D, et al. Video coding using dynamic texture synthesis[C]. international conference on multimedia and expo, 2010: 203-208. (EI,中国计算机学会B类会议)
Chen H, Hu R, Hu J, et al. Temporal color Just Noticeable Distortion model and its application for video coding[C]. international conference on multimedia and expo, 2010: 713-718. (EI,中国计算机学会B类会议)
Hu R, Hang B, Ma Y, et al. A bottom-up audio attention model for surveillance[C]. international conference on multimedia and expo, 2010: 564-567.(EI,中国计算机学会B类会议)
Books and Edited Books
多媒体信源编码技术与安防监控应急系统,胡瑞敏,湖北科学技术出版,2007
avs技术创新报告(2002-2010),数字音视频编解码技术标准工作组,人民邮电出版社,2011