Community

AES Convention Papers Forum

Bandwidth Extension Method Based on Generative Adversarial Nets for Audio Compression

Document Thumbnail

The compression ratio of core-encoder can be improved significantly by reducing the bandwidth of the audio signal, resulting in the poor listening perception. This paper proposes a bandwidth extension method based on generative adversarial nets (GAN) for extending the bandwidth of an audio signal, to create a more natural sound. The method uses GAN as a generative model to fit the distribution of the MDCT coefficients of the audio signals in the high-frequency components. Through minimax two-player gaming, more natural high-frequency information can be estimated. On this basis, a codec system is built up. To evaluate the proposed bandwidth extension system the MUSHRA experiments were carried on and the results show that there is comparable performance with HE-AAC.

Authors:
Affiliation:
AES Convention: Paper Number:
Publication Date:
Subject:

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this paper yet.

Subscribe to this discussion

RSS Feed To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you would like to start a discussion about this paper and are an AES member then you can login here:
Username:
Password:

If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

AES - Audio Engineering Society