{"product_id":"19aes-eb08-applications-in-audio","title":"19AES-EB08: Applications in Audio","description":"\u003cp\u003eSaturday, October 19, 3:30 pm — 4:30 pm (1E11)\u003c\/p\u003e\n\u003cp\u003eChair:\u003cbr\u003e\u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8936\"\u003eSunil G. Bharitkar\u003c\/a\u003e\u003c\/em\u003e, HP Labs., Inc. - San Francisco, CA, USA\u003cbr\u003e\u003cbr\u003eEB8-1 Vibrary: A Consumer-Trainable Music Tagging Utility—\u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8586\"\u003eScott Hawley\u003c\/a\u003e\u003c\/em\u003e, Belmont University - Nashville, TN, USA; \u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8927\"\u003eJason Bagley\u003c\/a\u003e\u003c\/em\u003e, Art+Logic - Pasadena, CA, USA; \u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8928\"\u003eBrett Porter\u003c\/a\u003e\u003c\/em\u003e, Art+Logic - Fanwood, NJ, USA; \u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8929\"\u003eDaisey Traynham\u003c\/a\u003e\u003c\/em\u003e, Art+Logic - Pasadena, CA, USA\u003cbr\u003eWe present the engineering underlying a consumer application to help music industry professionals find audio clips and samples of personal interest within their large audio libraries typically consisting of heterogeneously-labeled clips supplied by various vendors. We enable users to train an indexing system using their own custom tags (e.g., instruments, genres, moods), by means of convolutional neural networks operating on spectrograms. Since the intended users are not data scientists and may not possess the required computational resources (i.e., Graphics Processing Units, GPUs), our primary contributions consist of (i) designing an intuitive user experience for a local client application to help users create representative spectrogram datasets, and (ii) \"seamless\" integration with a cloud-based GPU server for efficient neural network training.\u003cbr\u003e\u003cbr\u003eEB8-2 Casualty Accessible and Enhanced (A\u0026amp;E) Audio: Trialling Object-Based Accessible TV Audio—\u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8930\"\u003eLauren Ward\u003c\/a\u003e\u003c\/em\u003e, University of Salford - Salford, UK; BBC R\u0026amp;D - Salford, UK; \u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8931\"\u003eMatthew Paradis\u003c\/a\u003e\u003c\/em\u003e, BBC Research and Development - London, UK; \u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8932\"\u003eBen Shirley\u003c\/a\u003e\u003c\/em\u003e, University of Salford - Salford, Greater Manchester, UK; Salsa Sound Ltd - Salford, Greater Manchester, UK; \u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8933\"\u003eLaura Russon\u003c\/a\u003e\u003c\/em\u003e, BBC Studios - Cardiff, Wales, UK; \u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8934\"\u003eRobin Moore\u003c\/a\u003e\u003c\/em\u003e, BBC Research \u0026amp; Development - Salford, UK; \u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8935\"\u003eRhys Davies\u003c\/a\u003e\u003c\/em\u003e, BBC Studios - Cardiff, Wales, UK\u003cbr\u003eCasualty Accessible and Enhanced (A\u0026amp;E) Audio is the first public trial of accessible audio technology using a narrative importance approach. This trial allows viewers to personalize the audio of an episode of the BBC’s \"Casualty\" drama series based on their hearing needs. Using a simple interface the audio can be varied between the broadcast mix and an accessible mix containing narratively important non-speech sounds, enhanced dialogue, and attenuated background sounds. This paper describes the trial’s development, implementation, and it’s evaluation by normal and hard of hearing listeners (n=5209 on 20\/8\/2019). 299 participants also completed a survey, rating the technology 3.6\/5 stars. 73% reported the technology made the content more enjoyable or easier to understand.\u003cbr\u003e\u003cbr\u003eEB8-3 Generative Modeling of Metadata for Machine Learning Based Audio Content Classification—\u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8936\"\u003eSunil G. Bharitkar\u003c\/a\u003e\u003c\/em\u003e, HP Labs., Inc. - San Francisco, CA, USA\u003cbr\u003eAutomatic content classification technique is an essential tool in multimedia applications. Present research for audio-based classifiers look at short- and long-term analysis of signals, using both temporal and spectral features. In this paper we present a neural network to classify between the movie (cinematic, TV shows), music, and voice using metadata contained in either the audio\/video stream. Towards this end, statistical models of the various metadata are created since a large metadata dataset is not available. Subsequently, synthetic metadata are generated from these statistical models, and the synthetic metadata is input to the ML classifier as feature vectors. The resulting classifier is then able to classify real-world content (e.g., YouTube) with an accuracy ˜ 90% with very low latency (viz., ˜ on an average 7 ms) based on real-world metadata.\u003cbr\u003e\u003cbr\u003eEB8-4 Individual Headphone Equalization at the Eardrum with New Apps for Computers and Cellphones—\u003cem\u003e\u003ca href=\"http:\/\/www.aes.org\/events\/147\/presenters\/?ID=8425\"\u003eDavid Griesinger\u003c\/a\u003e\u003c\/em\u003e, David Griesinger Acoustics - Cambridge, MA, USA\u003cbr\u003eEar canal resonances that concentrate energy on the eardrum are highly individual, and headphones alter or eliminate them. The result is inaccurate timbre and in-head localization. We have developed computer apps that use an equal loudness test to match the sound spectrum at the eardrum from a pair of headphones to the spectrum at the eardrums from a frontal loudspeaker. The result is precise timbre and frontal localization. The improvement in sound is startling. In this presentation we will demonstrate the process and the easy to use software that is now available for VST, AAX, Windows, MAC, Android and IOS cellphones. \u003cem\u003e[Presentation only; not available in E-Library]\u003c\/em\u003e\u003c\/p\u003e","brand":"Audio Engineering Society","offers":[{"title":"Default Title","offer_id":49970094735675,"sku":"19AES-EB08","price":18.0,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0903\/3560\/9147\/files\/19AESPic_b5df2008-dca9-4fe6-8682-2ab813bf16b5.jpg?v=1737908009","url":"https:\/\/mobiltape.com\/products\/19aes-eb08-applications-in-audio","provider":"Mobiltape","version":"1.0","type":"link"}