Going Deep in Point Cloud Coding

Fernando Pereira

Instituto Superior Técnico, Universidade de Lisboa - Instituto de Telecomunicações


With the rising popularity of virtual and augmented reality applications, 3D visual representation formats such as point clouds (PCs) have become a hot research topic. Since PCs are essentially a set of points in the 3D space with associated features, they are naturally suitable to facilitate user interaction and offer a high level of immersion. However, as providing realistic, interactive and immersive experiences typically requires PCs with a rather large number of points, efficient coding is critical as recognized by standardization groups such as MPEG and JPEG, which have been developing PC coding standards. Scalability is often a requirement for several PC applications where the access time to a PC is relevant, even if at lower quality or resolution, usually by partially decoding a bitstream structured in multiple layers. Although it may come at the cost of a reduced compression efficiency, scalable PC coding is nonetheless a coding paradigm that has been relatively unexplored in the literature.

The popularity of deep learning in multimedia processing tasks has largely increased in recent years due to its impressive performance. In terms of coding, recent deep learning-based image coding solutions offer very promising results, even outperforming state-of-the-art image codecs. Part of this success may be attributed to convolutional neural networks, which take advantage of the spatial redundancy by hierarchically detecting patterns to obtain a more meaningful latent representation. In this context, it is natural to extend the deep learning-based coding approach to PCs, for example coding 3D blocks of voxels instead of 2D blocks of pixels as for image and video coding.

In this context, this talk will address the emerging developments in point cloud coding, notably the recent MPEG and JPEG standardization projects as well as the very recent deep learning-based coding approach, with a special focus on scalability.


Fernando Pereira is currently with the Department of Electrical and Computers Engineering of Instituto Superior Técnico and with Instituto de Telecomunicações, Lisbon, Portugal.

He is Area Editor of the Signal Processing: Image Communication Journal and Associate Editor of the EURASIP Journal on Image and Video Processing, and is or has been member of the Editorial Board of the Signal Processing Magazine, Associate Editor of IEEE Transactions of Circuits and Systems for Video Technology, IEEE Transactions on Image Processing, IEEE Transactions on Multimedia, and IEEE Signal Processing Magazine. In 2013-2015, he was the Editor-in-Chief of the IEEE Journal of Selected Topics in Signal Processing.

He was an IEEE Distinguished Lecturer in 2005 and elected as an IEEE Fellow in 2008 for “contributions to object-based digital video representation technologies and standards”. He has been elected to serve on the Signal Processing Society Board of Governors in the capacity of Member-at-Large for a 2012 and a 2014-2016 term. Since January 2018, he is the SPS Vice-President for Conferences.

Since 2013, he is also a EURASIP Fellow for “contributions to digital video representation technologies and standards”. He has been elected to serve on the European Signal Processing Society Board of Directors for a 2015-2018 term. Since 2015, he is also an IET Fellow.

He has been the MPEG Requirements Group Chair from 2002 to 2007 and since February 2016 he is the JPEG Requirements Group Chair.

He is/has been a member of the Scientific and Program Committees of many international conferences and workshops. He has been the General Chair of the Picture Coding Symposium (PCS) in 2007, the Technical Program Co-Chair of the Int. Conference on Image Processing (ICIP) in 2010 and 2016, the Technical Program Chair of the International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) in 2008 and 2012, and the General Chair of the International Conference on Quality of Multimedia Experience (QoMEX) in 2016.

He has contributed more than 300 papers in international journals, conferences and workshops, and made several tens of invited talks at conferences and workshops. His areas of interest are visual data analysis, coding, description, adaptation, quality assessment and advanced multimedia services.


