- Research Article
- Open access
- Published:
Audio Key Finding: Considerations in System Design and Case Studies on Chopin's 24 Preludes
EURASIP Journal on Advances in Signal Processing volume 2007, Article number: 056561 (2006)
Abstract
We systematically analyze audio key finding to determine factors important to system design, and the selection and evaluation of solutions. First, we present a basic system, fuzzy analysis spiral array center of effect generator algorithm, with three key determination policies: nearest-neighbor (NN), relative distance (RD), and average distance (AD). AD achieved a 79% accuracy rate in an evaluation on 410 classical pieces, more than 8% higher RD and NN. We show why audio key finding sometimes outperforms symbolic key finding. We next propose three extensions to the basic key finding system—the modified spiral array (mSA), fundamental frequency identification (F0), and post-weight balancing (PWB)—to improve performance, with evaluations using Chopin's Preludes (Romantic repertoire was the most challenging). F0 provided the greatest improvement in the first 8 seconds, while mSA gave the best performance after 8 seconds. Case studies examine when all systems were correct, or all incorrect.
References
Chew E: Towards a mathematical model of tonality, Doctoral dissertation.
Chew E: Modeling tonality: applications to music cognition. Proceedings of the 23rd Annual Meeting of the Cognitive Science Society (CogSci '01), August 2001, Edinburgh, Scotland, UK 206–211.
Chuan C-H, Chew E: Fuzzy analysis in pitch-class determination for polyphonic audio key finding. Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR '05), September 2005, London, UK 296–303.
Longuet-Higgins HC, Steedman MJ: On interpreting bach. In Machine Intelligence. Volume 6. Edinburgh University Press, Edinburgh, Scotland, UK; 1971:221–241.
Krumhansl CL: Quantifying tonal hierarchies and key distances. In Cognitive Foundations of Musical Pitch. Oxford University Press, New York, NY, USA; 1990:16–49. chapter 2
Temperley D: What's key for key? the Krumhansl-Schmuckler key-finding algorithm reconsidered. Music Perception 1999,17(1):65–100.
Chuan C-H, Chew E: Polyphonic audio key finding using the spiral array CEG algorithm. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '05), July 2005, Amsterdam, The Netherlands 21–24.
Gómez E, Herrera P: Estimating the tonality of polyphonic audio files: cognitive versus machine learning modelling strategies. Proceedings of 5th International Conference on Music Information Retrieval (ISMIR '04), October 2004, Barcelona, Spain 92–95.
Pauws S: Musical key extraction from audio. Proceedings of 5th International Conference on Music Information Retrieval (ISMIR '04), October 2004, Barcelona, Spain 96–99.
1st Annual Music Information Retrieval Evaluation eXchange, MIREX 2005, https://doi.org/www.music-ir.org/mirex2005/index.php/Main_Page
Chuan C-H, Chew E: Audio key finding using FACEG: fuzzy analysis with the CEG algorithm. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Gómez E: Key estimation from polyphonic audio. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
İzmirli Ö: An algorithm for audio key finding. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Pauws S: KEYEX: audio key extraction. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Purwins H, Blankertz B: Key finding in audio. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Zhu Y: An audio key finding algorithm. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Chew E, François ARJ: Interactive multi-scale visualizations of tonal evolution in MuSA.RT Opus 2. Computers in Entertainment 2005,3(4):1–16. special issue on Music Visualization
Chew E, Chen Y-C: Mapping MIDI to the spiral array: disambiguating pitch spellings. Proceedings of the 8th INFORMS Computing Society Conference (ICS '03), January 2003, Chandler, Ariz, USA 259–275.
Chew E, Chen Y-C: Real-time pitch spelling using the spiral array. Computer Music Journal 2005,29(2):61–76. 10.1162/0148926054094378
İzmirli Ö: Template based key finding from audio. Proceedings of the International Computer Music Conference (ICMC '05), September 2005, Barcelona, Spain
Electronic Music Studios in the University of Iowa, https://doi.org/theremin.music.uiowa.edu/MIS.html
Klapuri AP: Multiple fundamental frequency estimation based on harmonicity and spectral smoothness. IEEE Transactions on Speech and Audio Processing 2003,11(6):804–816. 10.1109/TSA.2003.815516
Klapuri A: A perceptually motivated multiple-F0 estimation method. Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, October 2005, New Paltz, NY, USA
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Chuan, CH., Chew, E. Audio Key Finding: Considerations in System Design and Case Studies on Chopin's 24 Preludes. EURASIP J. Adv. Signal Process. 2007, 056561 (2006). https://doi.org/10.1155/2007/56561
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1155/2007/56561