Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Dyson School of Design Engineering
  4. HRTF spatial upsampling in the spherical harmonics domain employing a generative adversarial network
 
  • Details
HRTF spatial upsampling in the spherical harmonics domain employing a generative adversarial network
File(s)
DAFx24_paper_10.pdf (1.89 MB)
Published version
Author(s)
Hu, Xuyi
Li, Jian
Picinali, Lorenzo
Hogg, Aidan
Type
Conference Paper
Abstract
A Head-Related Transfer Function (HRTF) is able to capture alterations a sound wave undergoes from its source before it reaches the entrances of a listener’s left and right ear canals, and is imperative for creating immersive experiences in virtual and augmented reality (VR/AR). Nevertheless, creating personalized HRTFs demands sophisticated equipment and is hindered by time-consuming data acquisition processes. To counteract these challenges, various techniques for HRTF interpolation and up-sampling have been proposed. This paper illustrates how Generative Adversarial Networks (GANs) can be applied to HRTF data upsampling in the spherical harmonics domain. We propose using Autoencoding Generative Adversarial Networks (AE-GAN) to upsample low-degree spherical harmonics coefficients and get a more accurate representation of the full HRTF set. The proposed method is bench-marked against two baselines: barycentric interpolation and HRTF
selection. Results from log-spectral distortion (LSD) evaluation suggest that the proposed AE-GAN has significant potential for upsampling very sparse HRTFs, achieving 17% improvement over baseline methods.
Date Issued
2024-09-03
Date Acceptance
2024-05-08
Citation
Proceedings of the 27th International Conference on Digital Audio Effects (DAFx24), 2024, pp.396-403
URI
http://hdl.handle.net/10044/1/112397
URL
https://www.axdesign.co.uk/publications/hrtf-spatial-upsampling-in-the-spherical-harmonics-domain-employing-a-generative-adversarial-network
Publisher
University of Surrey
Start Page
396
End Page
403
Journal / Book Title
Proceedings of the 27th International Conference on Digital Audio Effects (DAFx24)
Copyright Statement
Copyright: © 2024 Xuyi Hu*
et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License, which
permits unrestricted use, distribution, adaptation, and reproduction in any medium,
provided the original author and source are credited.
License URL
https://creativecommons.org/licenses/by/4.0/
Identifier
https://www.axdesign.co.uk/publications/hrtf-spatial-upsampling-in-the-spherical-harmonics-domain-employing-a-generative-adversarial-network
Source
27th International Conference on Digital Audio Effects (DAFx24)
Publication Status
Published
Start Date
2024-09-03
Finish Date
2024-09-07
Coverage Spatial
Guildford, UK
Date Publish Online
2024-09-03
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback