Ayuda
Ir al contenido

Dialnet


Low complexity hevc intra coding

  • Autores: José Damián Ruiz Coll
  • Directores de la Tesis: José Luis Martínez Martínez (dir. tes.), Gerardo Fernández-Escribano (dir. tes.)
  • Lectura: En la Universidad de Castilla-La Mancha ( España ) en 2016
  • Idioma: inglés
  • Tribunal Calificador de la Tesis: José Antonio Gámez Martín (presid.), Manuel Pérez Malumbres (secret.), Velibor Adzic (voc.)
  • Programa de doctorado: Programa de Doctorado en Tecnologías Informáticas Avanzadas por la Universidad de Castilla-La Mancha
  • Materias:
  • Enlaces
    • Tesis en acceso abierto en: RUIdeRA
  • Resumen
    • 1. MOTIVATION Over the last few decades, much research has focused on the development and optimization of video codecs for media distribution to end-users via the Internet, broadcasts or mobile networks, but also for videoconferencing and for the recording on optical disks for media distribution. Most of the video coding standards for delivery are characterized by using a high efficiency hybrid schema, based on inter-prediction coding for temporal picture decorrelation, and intra-prediction coding for spatial picture decorrelation. As is well known, this schema achieves a high performance at the expense of several drawbacks, such as a high coding latency, low stream robustness, and a high coding complexity due to the motion-estimation process of the inter-prediction, among others. Nevertheless, high efficiency video and image codecs are also required in other fields with quite different requirements, such as still-picture photography storage, live TV interviews where low latency is needed for natural communication between the interlocutors, and also for professional edition and post-production tasks commonly used in the TV and cinema industries, where high quality and fast access to the individual pictures are required. These production codecs, named mezzanine codecs, are highly relevant for the audio-visual industry, which demands two things: very high compression efficiency and a low computational burden. A high video coding performance allows a reduction in the storage space for archiving applications, and also a long recording capability on small physical supports, i.e. optical disks and solid state memories. Regarding low computational complexity, this is mainly motived by the fact that these mezzanine codecs are used on portable devices, mainly camcorders, and thus one wishes that the codec can be implemented on low cost processors, and just as importantly, with low power consumption, allowing a long operational autonomy. The latest video coding standard was approved in January of 2013, and it is the result of the collaboration between the ITU-T and the ISO/IEC international organizations grouped into the Joint Collaborative Team Video Coding group (JCT-VC). This standard is officially called Recommendation ITU-T H.265 and ISO/IEC 23008-2, and informally known as HEVC (High Efficiency Video Coding). Exhaustive quality assessments have proved that HEVC can reduce the bit rate of its predecessor, namely H.264/AVC, by half for the same perceptual quality. The high compression efficiency of HEVC has led to this standard being adopted for future TV services using the new UHD format by the two worldwide broadcasting organisations, Digital Video Broadcasting (DVB) and the Advanced Television Systems Committee (ATSC). It has also been adopted in other application fields, such as by the Blu-ray Disc Association (BDA) for the next generation of Ultra-HD Blu-ray. This fact will probably accelerate HEVC's expansion and development, prompting a rapid replacement of the H.264/AVC standard in the consumer markets and also in massive media services such as video streaming, VoD and OTT (Over The Top). HEVC is also considered by the industry as the best candidate to replace the current mezzanine compression codecs, due to the high performance of the novel HEVC intra-prediction coding. This schema significatively improves H.264/AVC's performance, mostly by using a high density of angular predictors. For this reason, the HEVC standard has approved a set of specific profiles which exclusively use the intra-prediction schema, known as ¿Main still Picture¿ and the ¿Main Intra¿ profiles, with support for different bit-depths and chroma sampling formats. However, the high compression performance of HEVC comes at the expense of a huge computational complexity increase compared with other codecs, which is hindering the rapid adoption of HEVC by the professional market. The speeding up of the intra-prediction coding can be achieved by applying advanced techniques that allow the taking of decisions that are needed in the different stages of intra-prediction with low complexity. This approach constitutes the basis of this thesis, which addresses the complexity reduction of HEVC intra-coding by using non-traditional techniques used in the video coding standards, such as Machine Learning and image processing algorithms for texture orientation detection. 2. RESEARCH OBJECTIVES In this dissertation four main objectives have been proposed for the computational reduction of the real time implementation of the intra-prediction coding in HEVC, and these are the following: Study of state-of-the-art of HEVC and fast intra-prediction approaches. As an initial aim we have defined an in-depth study of the HEVC architecture and the new coding tools introduced in this standard, paying special attention to the performance differences between H.264/AVC and HEVC. One of the main objectives is the detailed study of the different algorithms that comprises the intra-prediction in HEVC, and the analysis of the computational complexities required for these. Then, we will conduct a detailed study of the most relevant approaches proposed in the literature for fast partitioning and mode decision in intra-prediction, with particular attention to the algorithms already adopted in the Rate Distortion Optimization (RDO) stage of the test model proposed for the HEVC reference software. The conclusion of this study will constitute the basis for the design of the efficient low complexity algorithms proposed in this thesis. Development and evaluation of a fast partitioning decision algorithm. Being aware that the partitioning of the coding units is the most critical decision that the encoder has to take, in terms of computational burden but also regarding the impact on quality, the second objective proposed in this thesis is the complexity reduction of the partitioning decision. With the aim of tackling this task with high efficiency, the use of Machine Learning techniques for the design of a decision tree will be studied. Special efforts will be made in the training stage of the decision trees, selecting the optimal number and type of attributes, allowing a high precision classifier with the minimum computational cost. It required that the architecture of the proposed approach permit a scalable implementation with different decision nodes, in order to achieve different levels of speed-up and different levels of performance reductions, compared with the HEVC reference software. Development and evaluation of a fast mode decision algorithm. With the aim of exploiting the strong correlation between the texture orientation of the image and the angular modes defined in HEVC, the development and evaluation of a fast mode decision algorithm based on texture orientation will be studied. In order to achieve this objective, an in-depth study of the different texture and edge detection techniques published in the literature will be carried out. The proposal will be highly efficient for the detection of the dominant gradient in all the range of coding units, from 64x64 to 4x4 pixels, with a low computation complexity. The aim of the approach will be to achieve a significant speed-up of intra-prediction coding by reducing the number of modes to be evaluated, with a very low performance penalty. Combination of both proposals in a full fast intra-prediction algorithm. As a last aim, the integration of both fast decision approaches in a unified architecture is proposed. The analysis and performance evaluation of the combined proposal will allow us to know the best performance by speeding up both decisions. The analysis of the results will also show the mutual inference of the wrong classification decisions of both algorithms in terms of bit coding efficiency degradation. That is, how a wrong mode decision affects the global encoding performance when a wrong partitioning decision is taken, or vice versa, how a wrong partitioning decision affects the global encoding performance when a wrong mode decision is taken. ¿ 3. CONCLUSIONS The major conclusions obtained in this thesis are summarized in the following lines: Regarding HEVC Intra-prediction encoding. Nowadays, multimedia services are widely delivered through wireless and broadcast networks to a broad range of devices with very diverse computational capabilities. With the aim of achieving high compression efficiency, these services commonly use a long GOP encoding structure, which applies an inter-picture prediction scheme, but intra-prediction encoding also has to be used at least in one picture of each GOP. Intra-prediction is currently an active research topic due to the practical application of the intra encoding scheme in several fields, such as: professional media production, where high quality and random access to the individual pictures are required; very low delay communications, in which intra-coding appears to be the only available solution; and also delivery through networks with transmission error, typically packet loss, because the non-temporal dependency between pictures avoids temporal error propagation. Nowadays, the high efficiency of HEVC and its performance supremacy over previous video coding standards, such as H.264/AVC, is unquestionable. However, the extremely high computational burden demanded by the HEVC algorithms makes HEVC encoding a challenge for developers, and also for the research community. In the last few years, wireless and portable devices have experienced an incredible evolution, their multimedia capabilities increasing with very high resolution picture and video cameras for real time personal communications. The evolution of wireless networks has followed a similar trend, especially the cellular networks supporting 3G, 4G and the forthcoming 5G technologies, which allow huge bandwidths of tens, and soon hundreds, of Mbps, which were unthinkable a few years ago. However, video communication using these new multimedia functionalities imposes a huge computational burden that becomes a severe problem for battery-powered portable devices. The novel approaches presented in this thesis prove that the non-conventional techniques used in the video coding field, such as Machine Learning and image processing tools, make it possible to achieve a considerable complexity reduction when they are efficiently applied, which can be regarded as a solution for the above mentioned scenarios under computational or consumption constraints. The results reported in Chapter 5 show that the combination of both approaches can obtain a wide range of speed-ups, from 30% to nearly 70%, in a trade-off with slight performance losses in the range of 0.4% to 4%, when it is compared with the HM reference software. This noteworthy complexity reduction for HEVC intra-prediction coding will encourage the quick introduction and development of the new generation of video coding standards in a new generation of multimedia devices such as smartphones, tablets and smart TVs. Regarding a fast partitioning decision for HEVC intra-prediction encoding There is no doubt that the optimal CTU partitioning decision is the most computationally intensive process in intra-prediction, because of the high number of available PU sizes evaluated by the RDO stage. The proposal presented in this thesis addresses this problem by introducing a fast PU classifier which has been designed using Machine Learning models. ML techniques are commonly used to solve big data problems where a big number, which can be tens or sometimes hundreds, of attributes are known. These techniques make it possible to carry out a high accuracy data classification or infer a behavior with high reliability. However, the application of ML techniques to the intra-prediction scenario, where initial features of the CTU are unavailable or cannot be gathered from other stages of the encoder, constitutes one of the most challenging aspects addressed in this thesis due to the critical balance that has to be reached between the number of attributes to be computed, and the cost of computing these attributes. The approach presented in this thesis is based on decision trees whose rules were obtained by training a model using ML techniques supported by the WEKA tool. It is worth highlighting that the size of the training data set and the type of data set constituted the key elements in achieving a high accuracy classifier. Regarding the selection of the data set, this was assisted by the SI and TI metrics, which have proven to be very effective criteria for image complexity characterization. While attending to these parameters, the data type was selected by covering the full range of possible CU patterns, from very low textured images to high complexity images. As for the size of the data set, just 8 frames were used for the training stage, which is 0.081% of the total frames (9,780 frames belonging to the 22 test sequences) used for the validation test. The results obtained confirm that a high accuracy classifier can be achieved by training decision trees with a small data set, as long as a high quality data type selection is performed. Regarding the attributes used by the classifier, it has been found that first order statistics such as variance and the mean of PUs are not sufficient to classify a PU as Split or Not-Split with high precision. However, by introducing as an attribute the variance of these statistics of mean and variance but computed over the four sub-PUs, classifier accuracy is significantly increased. In addition, these statistics can be computed with a very low computational burden, thus the balance between the number and effectiveness of attributes and their cost is very profitable. Results for the training and non-training sequences have demonstrated that a significant speed-up, of over 52%, of the CTU partitioning decision can be achieved in intra-prediction encoding, with only a slight bit rate increase of less than 2%, favoring real-time software and hardware implementation. Finally, a comparative study with the most prominent fast CTU partitioning decision algorithms proposed in the literature has been conducted. The results confirm that the approach presented in this thesis achieves better performance for the full node implementation, with a higher complexity reduction and a lower bit rate penalty. Regarding fast mode decision for HEVC intra-prediction encoding The optimal mode decision constitutes the second critical parameter that has to be selected for intra-prediction, in terms of computational burden. In this thesis, the correlation between the optimal mode decision and the edge and texture orientation is exploited. Based on that observation, a novel texture orientation detection approach has been proposed which computes the MDV along a set of co-lines with rational slopes. The first accomplishment reached in this proposal is the definition of the reduced number of orientations, just 12 directions, which can estimate a set of modes close to the optimal mode in-between the 33 angular intra modes of HEVC. A noteworthy feature of this approach is that these orientations with rational slopes are exclusively defined in an integer position of the discrete lattice ¿, thus no pixel interpolation is required, significatively reducing the computational burden. The second key point of this proposal has been the demonstration that the pixel correlation is maximal in the dominant texture orientation, and consequently the variance computed in that direction obtains the lowest value, compared with the variance computed in other directions. This observation constitutes the basis of the proposed MDV metric, which presents several advantages such as the low complexity computation of the variance, and a scalability implementation in co-segments, allowing the computation of the all PU sizes in just one pass. The next achievement has been the improvement of the MDV metric by introducing the concept of Sliding Window. In this new approach, the directional variance computation for an NxN PU is expanded to an (N+1)x(N+1) window, thus the neighbouring pixels used as reference samples for the construction of the predictor are also included in the calculation of the directional variance. MDV-SW makes it possible to reduce the bit rate penalty of the MDV proposal by half. The results show that the proposed fast mode decision achieves a significant time saving of 30% with a negligible impact on the encoding performance, obtaining an average BD-rate of just 0.4%. Finally, a comparison with the 5 most noteworthy fast mode decision algorithms proposed in the literature has been presented. The results show that the MDV-SW proposal obtains the best performance with the lowest ¿ Rate¿¿ Time ratio. Regarding the combination of fast partitioning and mode decision approaches for HEVC intra-prediction encoding Finally, both proposals presented in this thesis, the fast partitioning and fast mode decision, are combined, creating an overall fast intra-prediction algorithm, called FPMD, which achieves a very high overall performance. The key point of the combined approach is that both decisions are taken by computing low complexity algorithms, first using decision trees (if-else statements) for the partitioning decision, and then optimal orientation is estimated by computing the MDV in 12 directions. The combined approach is a flexible solution because it can also be implemented using any combination of nodes including the fast mode decision, and thus better performance is achieved compared with the standalone partitioning approach. The simulation results for FPMD show a wide range of speed-ups, from 44% for the N64+MDV-SW to 67% for the overall implementation, namely N64+N32+N16+MDV-SW, at the expense of slight penalties in terms of BD-rate, from 1.1% to 4.6%, respectively. ¿ 4. REFERENCES [Bai11] Jing Bai; Huaji Zhou, "Edge detection approach based on directionlet transform," in Multimedia Technology (ICMT), 2011 International Conference on , vol., no., pp.3512-3515, 26-28 July 2011. [Bam92] Bamberger, R.H.; Smith, M.J.T., "A filter bank for the directional decomposition of images: theory and design," in Signal Processing, IEEE Transactions on , vol.40, no.4, pp.882-893, Apr 1992. [Bha97] Vasudev Bhaskaran and Konstantinos Konstantinides. 1997. Image and Video Compression Standards: Algorithms and Architectures (2nd ed.). Kluwer Academic Publishers, Norwell, MA, USA. [Bos11] F. Bossen, On Software Complexity, HM 6.0 Reference Software, JCTVC-G757, Geneva, Switzerland, Nov. 2011. [Boy15] Boyce, J.M.; Ye, Y.; Chen, J.; Ramasubramonian, A.K., "Overview of SHVC: Scalable Extensions of the High Efficiency Video Coding (HEVC) Standard," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.PP, no.99, pp.1-1, 2015. [Bri06] Britanak, V., et al.: Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations. Academic Press, New York (2006). [Can99] E. J. Cand`es and D. L. Donoho, ¿Curvelets - a surprisingly effective nonadaptive representation for objects with edges,¿ in Curve and Surface Fitting, A. Cohen, C. Rabut, and L. L. Schumaker, Eds. Saint-Malo:Vanderbilt University Press, 1999. [Car10] Carrillo, P.; Tao Pin; Kalva, H., "Low complexity H.264 video encoder design using machine learning techniques," Consumer Electronics (ICCE), 2010 Digest of Technical Papers International Conference on , vol., no., pp.461,462, 9-13 Jan. 2010. [Cas96] Castleman K. R., 1996. Digital Image Processing. Englewood Cliffs, NJ: Prentice-Hall. [Cen15] Y.-F. Cen et al., A fast CU depth decision mechanism for HEVC, Inf Process. Lett. (2015). [Cha79] Tony F. Chan, Gene H. Golub, and Randall J. LeVequ. ¿Updating Formulae and a Pairwise Algorithm for Computing Sample Variances¿. Technical Report. Stanford University, Stanford, CA, USA, 1979. [Che13] Gaoxing Chen; Zhenyu Pei; Lei Sun; Zhenyu Liu; Ikenaga, T., "Fast intra prediction for HEVC based on pixel gradient statistics and mode refinement," Signal and Information Processing (ChinaSIP), IEEE China Summit & International Conference on, pp. 514-517, 6-10 Jul. 2013. [Che14] Keji Chen; Yizhou Duan; Jun Sun; Zongming Guo, "Towards efficient wavefront parallel encoding of HEVC: Parallelism analysis and improvement," in Multimedia Signal Processing (MMSP), 2014 IEEE 16th International Workshop on , vol., no., pp.1-6, 22-24 Sept. 2014. [Chi12] Chih-Ming Fu; Alshina, E.; Alshin, A.; Yu-Wen Huang; Ching-Yeh Chen; Chia-Yang Tsai; Chih-Wei Hsu; Shaw-Min Lei; Jeong-Hoon Park; Woo-Jin Han, "Sample Adaptive Offset in the HEVC Standard," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.22, no.12, pp.1755-1764, Dec. 2012. [Cho13] Seunghyun Cho; Munchurl Kim, "Fast CU Splitting and Pruning for Suboptimal CU Partitioning in HEVC Intra Coding," Circuits and Systems for Video Technology, IEEE Transactions on , vol.23, no.9, pp.1555,1564, Sept. 2013. [Con98] J.H. Conway and N.J.A. Sloane. Sphere packing, lattices and groups. Springer-Verlag, 1998. [Cor15] Correa, G.; Assuncao, P.A.; Volcan Agostini, L.; da Silva Cruz, L.A., "Fast HEVC Encoding Decisions Using Data Mining," Circuits and Systems for Video Technology, IEEE Transactions on , vol.25, no.4, pp.660,673, April 2015. [Do05] M. N. Do and M. Vetterli, ¿The contourlet transform: An efficient directional multiresolution image representation,¿ IEEE Trans. Image Process. vol. 14, no. 12, pp. 2091-2106, Dec. 2005. [Duf09] F. Dufaux, G. J. Sullivan, and T. Ebrahimi, ¿The JPEG XR image coding standard [Standards in a Nutshell]¿ IEEE Signal Processing Magazine, Vol. 26, Nov. 2009. [Efr83] Efron, B., & Gong, G. (1983). A leisurely look at the bootstrap, the jackknife, and cross-validation. American Statistician, 37, 36¿48. [Fay96] Fayyad, U. M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R, Editors (1996), Advances in Knowledge Discovery and Data Mining, MIT Press, Cambridge, MA [Fer08a] Gerardo Fernández-Escribano, Rashid Jillani, Christopher Holder, Hari Kalva, Jose Luis Martinez Martinez, and Pedro Cuenca, ¿Video encoding and transcoding using machine learning,¿ Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008 (MDM '08), 9th International Workshop on, pp. 53-62, Las Vegas, Nevada, 24-27 Aug. 2008. [Fer08b] Gerardo Fernández-Escribano, Hari Kalva, Pedro Cuenca, Luis Orozco-Barbosa, Antonio Garrido, ¿A Fast MB Mode Decision Algorithm for MPEG-2 to H.264 P-Frame Transcoding,¿ Circuits and Systems for Video Technology, IEEE Trans on, vol. 18, no. 2, pp. 172¿185, Feb. 2008. [Fer10] Fernandez-Escribano, G.; Kalva, H.; Martinez, J.L.; Cuenca, P.; Orozco-Barbosa, L.; Garrido, A., "An MPEG-2 to H.264 Video Transcoder in the Baseline Profile," Circuits and Systems for Video Technology, IEEE Transactions on , vol.20, no.5, pp.763,768, May 2010. [Fly15] Flynn, D.; Marpe, D.; Naccari, M.; Nguyen, T.; Rosewarne, C.; Sharman, K.; Sole, J.; Xu, J., "Overview of the Range Extensions for the HEVC Standard: Tools, Profiles and Performance," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.PP, no.99, pp.1-1, 2015. [Gab46] D. Gabor, ¿Theory of Communication,¿ J. Institute of Electrical Engineers, 93, 1946, 429¿457. [Gup13] S. Gupta and S. G. Mazumdar, ¿Sobel Edge Detection Algorithm,¿ vol. 2, no. 2, pp. 1578¿1583, 2013. [Han98] Hand, D. J. (1998), ¿Data Mining: Statistics and More?¿, The American Statistician, May (52:2), 112-118. [Han10] Han, W.-J.; Min, J.; Kim, I.-K.; Alshina, E.; Alshin, A.; Lee, T.; Chen, J.; Seregin, V.; Lee, S.; Hong, Y. M.; Cheon, M.-S.; Shlyakhov, N.; McCann, K.; Davies, T.; Park, J.-H.,"Improved Video Compression Efficiency Through Flexible Unit Representation and Corresponding Extension of Coding Tools," IEEE Transactions on, vol.20, no.12, pp.1709-1720, Dec. 2010. [Han12] Philippe Hanhart ; Martin Rerabek ; Francesca De Simone ; Touradj Ebrahimi; Subjective quality evaluation of the upcoming HEVC video compression standard . Proc. SPIE 8499, Applications of Digital Image Processing XXXV, 84990V (October 15, 2012). [Hon13] Honghai Yu; Winkler, S., "Image complexity and spatial information," Quality of Multimedia Experience (QoMEX), 2013 Fifth International Workshop on , vol., no., pp.12,17, 3-5 July 2013. [Hua13] Han Huang; Yao Zhao; Chunyu Lin; Huihui Bai, "Fast bottom-up pruning for HEVC intraframe coding," Visual Communications and Image Processing (VCIP), 2013 , vol., no., pp.1,5, 17-20 Nov. 2013. [Hul07] J.V. Hulse, T.M. Khoshgoftaar, and A. Napolitano, ¿Experimental Perspectives on Learning from Imbalanced Data,¿ Proc. 24th Int¿l Conf. Machine Learning, pp. 935-942, 2007. [ISO-11172] ISO/IEC 11172-2. ¿ Information technology ¿ Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s, 1988. [ISO-13818-2] ISO/IEC 13818-2. ¿ Information technology ¿ Generic coding of moving pictures and associated audio information ¿ Part 2: Video, 1996. [ISO-14496-2] ISO/IEC 14496-2. ¿ Information technology ¿ Coding of audio-visual objects ¿ Part 2: Visual, 1999. [ISO-14496-10] ISO/IEC 14496-10. ¿ Information technology ¿ Coding of audio-visual objects, 2003. [ISO-23008-2] ISO/IEC DIS 23008-2. Information technology ¿ High efficiency coding and media delivery in heterogeneous environments ¿ Part 2: High efficiency video coding, 2013. [ITU-T H.261] ITU-T Recommendation H.261, Video codec for audiovisual services at px64 kbits/s, 1988. [ITU-T H.262] ITU-T Recommendation H.262, Information technology - Generic coding of moving pictures and associated audio information: Video, 1996. [ITU-T H.263] ITU-T Recommendation H.263, Video coding for low bit rate communication, Version 2, 1998. [ITU-T H.264] ITU-T Recommendation H.264, Advanced video coding for generic audiovisual services, 2003. [ITU-T H.265] ITU-T Recommendation H.265, High efficiency video coding, 2013. [ITU-T T. T.81] ITU-T Rec. T.81, ¿Information technologies ¿ Digital compression and coding of continuous-tone still images ¿ requirements and guidelines¿, 1992. [ITU-T T.800] ITU-T Rec. T.800, ¿JPEG2000 Image Coding System: Core Coding System¿ (JPEG2000 Part 1), 2000. [ITU-T T.832] ITU-T Rec. T.832, ¿Information technology ¿ JPEG XR image coding system ¿ Image coding specification¿, 2012. [ITU-T P.910] ITU-T Recommendation P.910, ¿Subjective Video Quality Assessment Methods for Multimedia Applications,¿ International Telecommunication Union, Geneva (1999). [Jap00] N. Japkowicz, editor, Proceedings of the AAAI'2000. Workshopon Learning from Imbalanced Data Sets,. AAAI Tech Report WS-00-05. [Jay10] Jayachandra, D.; Makur, A., "Directional Variance: A measure to find the directionality in a given image segment," Circuits and Systems (ISCAS), Proceedings of 2010 IEEE International Symposium on , vol., no., pp.1551,1554, May 30 2010-June 2 2010. [JCTVC- D122] C.-M. Fu, C.-Y. Chen, Y.-W. Huang, and S. Lei, ¿CE8 Subset 3: Picture Quadtree Adaptive Offset¿, document JCTVC-D122, Jan. 2011. [JCTVC-H0012] Gary Sullivan, Koohyar Minoo, ¿Objective quality metric and alternative methods for measuring coding efficiency¿, document JCTVC-H0012, ITU-T/ISO/IEC Joint Collaborative Team on Video Coding (JCT-VC), 8th Meeting: San Jose, CA, USA, 1 ¿ 10 February 2012. [JCTVC-K1002] Il-Koo Kim, Ken McCann, Kazuo Sugimoto, Benjamin Bross, Woo-Jin Han, ¿HM9: High Efficiency Video Coding (HEVC) Test Model 9 Encoder Description¿, document JCTVC-K1002, ITU-T/ISO/IEC Joint Collaborative Team on Video Coding (JCT-VC), 11th Meeting: Shanghai, CN, 10¿19 October 2012. [JCTVC-L1100] F. Bossen, ¿Common Test Conditions and Software Reference Configurations,¿ document JCTVC-L1100, ITU-T/ISO/IEC Joint Collaborative Team on Video Coding (JCT-VC), 12th Meeting: Geneve, CH 14 ¿ 23 Jan. 2013. [JCT-VC Reference Software] Joint Collaborative Team on Video Coding Reference Software, version HM 16.6 [Online]. Available: https://hevc.hhi.fraunhofer.de/ [Jia12] Wei Jiang; Ma, Hanjie; Yaowu Chen, "Gradient based fast mode decision algorithm for intra prediction in HEVC," Consumer Electronics, Communications and Networks (CECNet), 2012 2nd International Conference on, pp.1836-1840, 21-23 Apr. 2012. [Jil09] Jillani, R.; Kalva, H., "Low complexity intra MB encoding in AVC/H.264," Consumer Electronics, IEEE Transactions on , vol.55, no.1, pp.277,285, February 2009. [Kan11] S. Kanumuri, T. K. Tan, and F. Bossen, Enhancements to Intra Coding, JCTVC-D235, Daegu, Korea, Jan. 2011. [Kha13] Khan, Muhammad Usman Karim; Shafique, Muhammad; Henkel, Jorg, "An adaptive complexity reduction scheme with fast prediction unit decision for HEVC intra encoding," Image Processing (ICIP), 2013 20th IEEE International Conference on , vol., no., pp.1578,1582, 15-18 Sept. 2013. [Kim13] Younhee Kim, DongSan Jun, Soon-heung Jung, Jin Soo Choi, and Jinwoong Kim, "A Fast Intra-Prediction Method in HEVC Using Rate-Distortion Estimation Based on Hadamard Transform," ETRI Journal, vol. 35, no. 2, pp. 270-280, Apr. 2013. [Kor10] J. Korhonen and J. You, ¿Improving objective video quality assessment with content analysis,¿ in Proc. VPQM, Scottsdale, AZ, USA, Jan. 2010. [Lai12] Lainema, J.; Bossen, F.; Woo-Jin Han; Junghye Min; Ugur, K., "Intra Coding of the HEVC Standard," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.22, no.12, pp.1792-1801, Dec. 2012 [Lei08] Zhang Lei; Makur, A., "Enumeration of Downsampling Lattices in Two-Dimensional Multirate Systems," Signal Processing, IEEE Transactions on , vol.56, no.1, pp.414,418, Jan. 2008. [Li11] B. Li, G. J. Sullivan, and J. Xu, ¿Comparison of Compression Performance of HEVC Working Draft 4 with AVC High Profile,¿ Doc. JCTVC-G399, Nov. 2011 [Lia11] Liang Zhao; Li Zhang; Ma, Siwei; Debin Zhao, "Fast mode decision algorithm for intra prediction in HEVC," Visual Communications and Image Processing (VCIP), 2011 IEEE , vol., no., pp.1,4, 6-9 Nov. 2011. [Loh11] Loh, W.-Y. (2011), Classification and regression trees. WIREs Data Mining Knowl Discov, 1: 14¿23. [Lok10] Kar Seng Loke, "Wedgelets-based automatic object contour detection," in Natural Computation (ICNC), 2010 Sixth International Conference on , vol.7, no., pp.3664-3668, 10-12 Aug. 2010. [Lon13] Long-Sheng Chen; Jui-Yu Lin, "A study on review manipulation classification using decision tree," Service Systems and Service Management (ICSSSM), 2013 10th International Conference on, vol., no., pp.680,685, 17-19 July 2013. [Mal89] S. G. Mallat, ¿A Theory for Multiresolution Signal Decomposition: The Wavelet Representation,¿ IEEE Trans. Pattern Analysis and Machine Intelligence, PAMI-11, 7, July 1989, 674¿693. [Mar03] Marpe, D.; Schwarz, H.; Wiegand, T., "Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.13, no.7, pp.620-636, July 2003. [Mar04] J. M. Martínez, ¿MPEG-7 Overview (version 10),¿ MPEG Document, ISO/IEC JTC1/SC29/WG11 N6828, Palma de Mallorca, October 2004. [Mar05] Marpe, D.; Wiegand, T.; Gordon, S., "H.264/MPEG4-AVC fidelity range extensions: tools, profiles, performance, and application areas," in Image Processing, 2005. ICIP 2005. IEEE International Conference on , vol.1, no., pp.I-593-6, 11-14 Sept. 2005. [Mar08] Martinez, J.L.; Fernandez-Escribano, G.; Kalva, H.; Weerakkody, W. A R J; Fernando, W. A C; Garrido, A., "feedback free DVC architecture using machine learning," Image Processing, 2008. ICIP 2008. 15th IEEE International Conference on, vol., no., pp.1140,1143, 12-15 Oct. 2008. [Mar10] Marpe, D.; Schwarz, H.; Bosse, S.; Bross, B.; Helle, P.; Hinz, T.; Kirchhoffer, H.; Lakshman, H.; Tung Nguyen; Oudin, S.; Siekmann, M.; Su¿hring, K.; Winken, M.; Wiegand, T., "Video Compression Using Nested Quadtree Structures, Leaf Merging, and Improved Techniques for Motion Representation and Entropy Coding," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.20, no.12, pp.1676-1687, Dec. 2010. [Nem¿10] Nem¿ic¿, O.; Rimac-Drlje, S.; Vranjes, M., "Multiview Video Coding extension of the H.264/AVC standard," in ELMAR, 2010 PROCEEDINGS , vol., no., pp.73-76, 15-17 Sept. 2010. [Ngu12] Tung Nguyen; Marpe, D., "Performance analysis of HEVC-based intra coding for still image compression," Picture Coding Symposium (PCS), 2012 , vol., no., pp.233,236, 7-9 May 2012. [Min08] J. Min, S. Lee, I. Kim, W.-J. Han, J. Lainema, and K. Ugur, ¿Unification of the Directional Intra Prediction Methods in TMuC,¿ JCTVC-B100, Geneva, Switzerland, Jul. 2010. [Min10] J. Min, S. Lee, I. Kim, W.-J. Han, J. Lainema, and K. Ugur, ¿Unification of the Directional Intra Prediction Methods in TMuC,¿ JCTVC-B100, Geneva, Switzerland, Jul. 2010. [Min11]A. Minezawa, K. Sugimoto, and S. Sekiguchi, An Improved Intra Vertical and Horizontal Prediction, JCTVC-F172, Torino, Italy, Jul. 2011. [Mis13] Misra, K.; Segall, A.; Horowitz, M.; Shilin Xu; Fuldseth, A.; Minhua Zhou, "An Overview of Tiles in HEVC," in Selected Topics in Signal Processing, IEEE Journal of , vol.7, no.6, pp.969-977, Dec. 2013. [Mit02] Mitra Basu, 2002. Gaussian-Based Edge-Detection Methods¿A Survey. IEEE Transactions on Systems, Man, and Cybernetic, 252-260. [Nor12] Norkin, A.; Bjontegaard, G.; Fuldseth, A.; Narroschke, M.; Ikeda, M.; Andersson, K.; Minhua Zhou; Van der Auwera, G., "HEVC Deblocking Filter," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.22, no.12, pp.1746-1754, Dec. 2012. [Ohm12] Ohm, J.; Sullivan, G.J.; Schwarz, H.; Thiow Keng Tan; Wiegand, T., "Comparison of the Coding Efficiency of Video Coding Standards¿Including High Efficiency Video Coding (HEVC)," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.22, no.12, pp.1669-1684, Dec. 2012. [Ohm13] Ohm, J.; Sullivan, G.J., "High efficiency video coding: the next frontier in video compression [Standards in a Nutshell]," in Signal Processing Magazine, IEEE, vol.30, no.1, pp.152-158, Jan. 2013. [Ort99] Ortega A, Ramchandran K (1999) Rate-distortion methods for image and video compression: an overview. IEEE Signal Process J 23¿50. [Pan05] F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, ¿Fast mode decision algorithm for intra prediction in H.264/AVC video coding,¿ IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 7, pp. 813¿822, Jul. 2005. [Pia10] Y. Piao, J. H. Min, and J. Chen, Encoder Improvement of Unified Intra Prediction, JCTVC-C207, JCT-VC of ISO/IEC and ITU-T, Guangzhou, Chnia, Oct. 2010. [Pin04] M H Pinson and S Wolf, ¿A new standardized method for objectively measuring video quality,¿ IEEE Trans. Broadcasting, vol. 50, no. 3, pp. 312¿322, 2004. [Pra01] William K. Pratt. 2001. Digital Image Processing: PIKS Inside (3rd ed.). John Wiley & Sons, Inc., New York, NY, USA. [Pre70] J. M. S. Prewitt, ¿Object Enhancement and Extraction,¿ in Picture Processing and Psychopictorics, B. S. Lipkin and A. Rosenfeld, Eds., Academic Press, New York. 1970. [Qui93] J.R. Quinlan. ¿C4.5: Programs for Machine Learning¿. Morgan Kaufmann, 1993. [Rao74] Ahmed, N.; Natarajan, T.; Rao, K.R., "Discrete Cosine Transform," in Computers, IEEE Transactions on , vol.C-23, no.1, pp.90-93, Jan. 1974. [Rao76] Rao, K.R.; Ahmed, N., "Orthogonal transforms for digital signal processing," in Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '76. , vol.1, no., pp.136-140, Apr 1976. [Rob65] L. G. Roberts, ¿Machine Perception of Three-Dimensional Solids,¿ in Optical and Electro-Optical Information Processing, J. T. Tippett et al., Eds., MIT Press, Cambridge, MA, 1965, 159¿197. [Rui14a] Ruiz, D.; Adzic, V.; Fernandez-Escribano, G.; Kalva, H.; Martinez, J.L.; Cuenca, P., "Fast partitioning algorithm for HEVC Intra frame coding using machine learning," in Image Processing (ICIP), 2014 IEEE International Conference on , vol., no., pp.4112-4116, 27-30 Oct. 2014. [Rui14b] Damián Ruiz, Velibor Adzic, Hari Kalva, Gerardo Fernández-Escribano, ¿Análisis estadístico del particionado de bloques en la predicción Intra-frame de HEVC.¿ Jornadas de Paralelismo (JP2014), Valladolid, Spain, Sep. 2014. [Rui15a] Ruiz-Coll1, D.; Fernandez-Escribano, G.; Martinez, J.L.; Cuenca, P., "Dominant gradient detection using Mean Directional Variance for intra-picture prediction in HEVC,¿ 15th International Conference on Computational and Mathematical Methods in Science and Engineering (CMMSE 2015), Rota, Spain, Volume: 1233-1244, Ju. 2015. [Rui15b] Damián Ruiz, Gerardo Fernández-Escribano, Velibor Adzic, Hari Kalva, José Luis Martínez, Pedro Cuenca, ¿Fast CU partitioning algorithm for HEVC intra coding using Data Mining.¿ Accepted to be published in the Multimedia Tools and Application Journal in 2016. [Rui15c] Damián Ruiz, Gerardo Fernández-Escribano, José Luis Martínez, Pedro Cuenca, ¿Fast intra mode decision algorithm based on texture orientation detection in HEVC.¿ Submitted to the Journal of Signal Processing: Image Communication. In process of major revisions. [Rui15d] Damián Ruiz, Gerardo Fernández-Escribano, José Luis Martínez, Pedro Cuenca, ¿Análisis estadístico de la eficiencia de la predicción direccional intra-cuadro en HEVC.¿ Jornadas de Paralelismo (JP2015), Cordoba, Spain, Sep. 2015. [Sax11] A. Saxena and F. C. Fernandes, CE7: Mode-Dependent DCT/DST Without 4*4 Full Matrix Multiplication for Intra Prediction, JCTVCE125, Geneva, Switzerland, Mar. 2011. [Sch07] Schwarz, H.; Marpe, D.; Wiegand, T., "Overview of the Scalable Video Coding Extension of the H.264/AVC Standard," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.17, no.9, pp.1103-1120, Sept. 2007. [She13] Liquan Shen; Zhaoyang Zhang; Ping An, "Fast CU size decision and mode decision algorithm for HEVC intra coding," Consumer Electronics, IEEE Transactions on, vol.59, no.1, pp.207-213, Feb. 2013. [Sil12] da Silva, T.L.; Agostini, L.V.; da Silva Cruz, L.A., "Fast HEVC intra prediction mode decision based on EDGE direction information," Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European, vol., no., pp.1214,1218, 27-31 Aug. 2012. [Sol12] Sole, J.; Joshi, R.; Nguyen Nguyen; Tianying Ji; Karczewicz, M.; Clare, G.; Henry, F.; Duenas, A., "Transform Coefficient Coding in HEVC," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.22, no.12, pp.1765-1777, Dec. 2012. [Sto01] Stolberg, H.-J.; Berekovic, M.; Pirsch, P.; Runge, H., "Implementing the MPEG-4 advanced simple profile for streaming video applications," in Multimedia and Expo, 2001. ICME 2001. IEEE International Conference on , vol., no., pp.230-233, 22-25 Aug. 2001. [Sul99] Sullivan GJ, Wiegand T (1999) Rate-distortion optimization for video compression. IEEE Signal Process J 74 -90. [Sul12] Sullivan, G.J.; Ohm, J.; Woo-Jin Han; Wiegand, T., "Overview of the High Efficiency Video Coding (HEVC) Standard," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.22, no.12, pp.1649-1668, Dec. 2012. [Sun12] Heming Sun; Dajiang Zhou; Goto, S., "A Low-Complexity HEVC Intra Prediction Algorithm Based on Level and Mode Filtering," Multimedia and Expo (ICME), 2012 IEEE International Conference on , vol., no., pp.1085,1090, 9-13 July 2012 [Sze12] Sze, V.; Budagavi, M., "High Throughput CABAC Entropy Coding in HEVC," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.22, no.12, pp.1778-1791, Dec. 2012. [Sze13] Sze, V.; Budagavi, M., "A comparison of CABAC throughput for HEVC/H.265 VS. AVC/H.264," Signal Processing Systems (SiPS), 2013 IEEE Workshop on , vol., no., pp.165,170, 16-18 Oct. 2013. [Tia12] Guifen Tian; Goto, S., "Content adaptive prediction unit size decision algorithm for HEVC intra coding," Picture Coding Symposium (PCS), 2012, vol., no., pp.405,408, 7-9 May 2012. [Tou05] Tourapis, A.M.; Feng Wu; Shipeng Li, "Direct mode coding for bipredictive slices in the H.264 standard," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.15, no.1, pp.119-126, Jan. 2005. [Ugu10] K. Ugur, K. R. Andersson, and A. Fuldseth, Video Coding Technology Proposal by Tandberg, Nokia, and Ericsson, JCTVC-A119, Dresden, Germany, Apr. 2010. [VCEG-AM91]VCEG and MPEG. Joint call for proposals on video compression technology. Doc. VCEG-AM91. ITU-T SG16/Q6 VCEG, Kyoto, JP (2010). [VCEG-M33] G. Bjøntegaard, "Calculation of average PSNR differences between RD-curves", ITU-T SG16 Q.6 Document, VCEG-M33, Austin, US, Apr. 2001. [Vel06] Velisavljevic, V.; Beferull-Lozano, B.; Vetterli, M.; Dragotti, P.L., "Directionlets: anisotropic multidirectional representation with separable filtering," Image Processing, IEEE Transactions on , vol.15, no.7, pp.1916,1933, July 2006. [Was10] Wasikowski, M.; Xue-wen Chen, "Combating the Small Sample Class Imbalance Problem Using Feature Selection," Knowledge and Data Engineering, IEEE Transactions on , vol.22, no.10, pp.1388,1400, Oct. 2010. [Wie03] Wiegand, T.; Sullivan, G.J.; Bjontegaard, G.; Luthra, A., "Overview of the H.264/AVC video coding standard," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.13, no.7, pp.560-576, July 2003. [Wie03] M. Wien, ¿Variable block-size transform for H.264/AVC,¿ IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 7, pp. 604¿619, Jul. 2003. [Wit05] Ian H. Witten and Eibe Frank. ¿Data Mining: Practical Machine Learning Tools and Techniques¿. 2nd Edition, Morgan Kaufmann, San Francisco, 2005. [Yan12] Shunqing Yan; Liang Hong; Weifeng He; Qin Wang, "Group-Based Fast Mode Decision Algorithm for Intra Prediction in HEVC," Signal Image Technology and Internet Based Systems (SITIS), 2012 Eighth International Conference on , vol., no., pp.225,229, 25-29 Nov. 2012. [Yao14] Yingbiao Yao;, Xiaojuan Li,Yu Lu, "Fast intra mode decision algorithm for HEVC based on dominant edge assent distribution," Multimedia Tools and Applications journal, Springer US, vol., pp. 1,19, 2014. [Zha14] Liang Zhao, Xiaopeng Fan, Siwei Ma, Debin Zhao, ¿Fast intra-encoding algorithm for High Efficiency Video Coding,¿ Signal Processing: Image Communication, Volume 29, Issue 9, October 2014, Pages 935-944.


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno