November 2020 – MPEG 3D Graphics

There are more and more research articles on point cloud compression. Some of them are proposing new methods or some improvements on top of both V-PCC and G-PCC. Some others are analyzing and discussing the various compression approaches and tools in V-PCC and G-PCC. And some of them are presenting compression results. A common characteristic of many of these articles is that they are using the MPEG Reference Software for generating the V-PCC and G-PCC bitstreams. In general, using this Reference Software is OK, however, the authors should understand what Reference Software means, how it should be used and how it should not be used.

The purpose of this article is to clarify the role of the MPEG PCC Reference Software.

MPEG PCC has as a main objective to standardize the bistream syntax of an encoded representation of point cloud data. The direct implication of this is that the decoder behaviour is standardized (what the decoder should do when reading a bit from the compressed data). A less direct implication is that the encoder is NOT standardized. Any encoder able to output the conformant bitstream is OK, by whatever means it uses to create it. By doing so, MPEG allows to have competition in the market, different organisations being able to invent new mecanisms and tools to better exploit signal properties and obtain therefore better compression performances. Such strategy gives also a longer life to the MPEG standards, the same version of the decoder being able to decode less or more performant bitstreams (several generations of Encoders can be built on same Decoder).

During the incremental production of the standard (textual specification), MPEG is also producing what is called a Test Model (TM), a pair of Encoder and Decoder software implementations, used to evaluate various tools proposed by various entities participating to the standardisation. All tools producing benefits are important and the ones who have normative impact (meaning impact on the bitstream synthax) are included in the TM for the next iteration. Sometimes, some tools with non-normative impact may be included in the encoder, but this is not a general rule. In general, tools with non normative impact are kept by companies to be used in their own product, therefore obtaining a competitive advantage in the market.

At the end of the standardisation process, the TM containing only the tools retained in the standard becomes the Reference Software. It consists in a Decoder, able to decode the conformant bitstreams and (sometimes) in an Encoder, able to produce them. In the case of V-PCC and G-PCC, there are two TMs and each of them has both the Encoder and Decoder. The Reference Software is freely distributed by ISO and many companies are using it as a basis for their products and substancially improve them (especially the Encoder) before puting it on the market.

OK, you may say, so what?! This is old story everybody knows, what is the point here?

As I mentioned at the begining of this post, many papers are using the V-PCC and G-PCC Reference Software and present compression performances. The problem is that they forget to mention that what they compare are the performances of a non-optimized Encoder that has as roles to help the development of the standard (validate/invalidate tools) during the two years of creating it and to lower the entry barrier for standard users. A compression performance analysis of the Reference Software is not representative for the full capability of the standard. As an exemple, for MPEG-4 Video, the third generation of the commercial encoders (using the same bitstream synthax) was 3 times better than the Reference Software. Same is expected to happen for V-PCC and G-PCC and then, such articles become obsolete, even before reaching their readers.