Skip to main content
  • Research Article
  • Open access
  • Published:

An Overcomplete Signal Basis Approach to Nonlinear Time-Tone Analysis with Application to Audio and Speech Processing

Abstract

Although a beating tone and the two pure tones which give rise to it are linearly dependent, the ear considers them to be independent as tone sensations. A linear time-frequency representation of acoustic data is unable to model these phenomena. A time-tone sensation approach is proposed for inclusion within audio analysis systems. The proposed approach extends linear time-frequency analysis of acoustic data, by accommodating the nonlinear phenomenon of beats. The method replaces the one-dimensional tonotopic axis of linear time-frequency analysis with a two-dimensional tonotopic plane, in which one direction corresponds to tone, and the other to its frequency of modulation. Some applications to audio prostheses are discussed. The proposed method relies on an intuitive criterion of optimal representation which can be applied to any overcomplete signal basis, allowing for many signal processing applications.

References

  1. Rabiner L, Juang B-H: Fundamentals of Speech Recognition, Signal Processing Series. Prentice-Hall, Englewood Cliffs, NJ, USA; 1993.

    Google Scholar 

  2. Gold B, Morgan N: Speech and Audio Signal Processing: Processing and Perception of Speech and Music. John Wiley & Sons, New York, NY, USA; 2000.

    Google Scholar 

  3. Beyer RT: Sounds of Our Times: Two Hundred Years of Acoustics. American Institute of Physics, New York, NY, USA; 1998.

    Google Scholar 

  4. Moore BCJ: An Introduction to the Psychology of Hearing. 5th edition. Academic Press, London, UK; 2003.

    Google Scholar 

  5. Robles L, Ruggero MA, Rich NC: Two-tone distortion in the basilar membrane of the cochlea. Nature 1991, 349(6308):413–414. 10.1038/349413a0

    Article  Google Scholar 

  6. Rhode WS, Robles L: Evidence from Mossbauer experiments for nonlinear vibration in the cochlea. Journal of the Acoustical Society of America 1974, 55(3):588–596. 10.1121/1.1914569

    Article  Google Scholar 

  7. Robles L, Ruggero MA, Rich NC: Basilar membrane mechanics at the base of the chinchilla cochlea. I. Input-output functions, tuning curves, and response phases. Journal of the Acoustical Society of America 1986, 80(5):1364–1374. 10.1121/1.394389

    Article  Google Scholar 

  8. Murugasu E, Russell IJ: Salicylate ototoxicity: the effects on basilar membrane displacement, cochlear microphonics, and neural responses in the basal turn of the guinea pig cochlea. Auditory Neuroscience 1995, 1: 139–150.

    Google Scholar 

  9. Ruggero MA, Rich NC, Recio A, Narayan SS, Robles L: Basilar-membrane responses to tones at the base of the chinchilla cochlea. Journal of the Acoustical Society of America 1997, 101(4):2151–2163. 10.1121/1.418265

    Article  Google Scholar 

  10. Russell IJ, Nilsen KE: The location of the cochlear amplifier: spatial representation of a single tone on the Guinea pig basilar membrane. Proceedings of the National Academy of Sciences of the United States of America 1997, 94(6):2660–2664. 10.1073/pnas.94.6.2660

    Article  Google Scholar 

  11. Oxenham AJ, Moore BCJ: Additivity of masking in normally hearing and hearing-impaired subjects. Journal of the Acoustical Society of America 1995, 98(4):1921–1934. 10.1121/1.413376

    Article  Google Scholar 

  12. Oxenham AJ, Plack CJ: Suppression and the upward spread of masking. Journal of the Acoustical Society of America 1998, 104(6):3500–3510. 10.1121/1.423933

    Article  Google Scholar 

  13. Plack CJ, Oxenham AJ: Basilar membrane nonlinearity and the growth of forward masking. Journal of the Acoustical Society of America 1998, 103(3):1598–1608. 10.1121/1.421294

    Article  Google Scholar 

  14. Hicks ML, Bacon SP: Psychophysical measures of auditory nonlinearities as a function of frequency in individuals with normal hearing. Journal of the Acoustical Society of America 1999, 105(1):326–338. 10.1121/1.424526

    Article  Google Scholar 

  15. Moore BCJ, Vickers DA, Plack CJ, Oxenham AJ: Inter-relationship between different psychoacoustic measures assumed to be related to the cochlear active mechanism. Journal of the Acoustical Society of America 1999, 106(5):2761–2778. 10.1121/1.428133

    Article  Google Scholar 

  16. Oxenham AJ, Moore BCJ, Vickers DA: Short-term temporal integration: evidence for the influence of peripheral compression. Journal of the Acoustical Society of America 1997, 101(6):3676–3687. 10.1121/1.418328

    Article  Google Scholar 

  17. Yates GK: Basilar membrane nonlinearity and its influence on auditory nerve rate-intensity functions. Hearing Research 1990, 50(1–2):145–162. 10.1016/0378-5955(90)90041-M

    Article  Google Scholar 

  18. Moore BCJ, Glasberg BR: A model of loudness perception applied to cochlear hearing loss. Auditory Neuroscience 1997, 3: 289–311.

    Google Scholar 

  19. Moore BCJ, Oxenham AJ: Psychoacoustic consequences of compression in the peripheral auditory system. Psychological Review 1998, 105(1):108–124.

    Article  Google Scholar 

  20. Bohnke F, Arnold W: Nonlinear mechanics of the organ of Corti caused by Deiters cells. IEEE Transactions on Biomedical Engineering 1998, 45(10):1227–1233. 10.1109/10.720200

    Article  Google Scholar 

  21. Friedman DH: Implementation of a nonlinear wave-digital-filter cochlear model. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP~'90), April 1990, Albuquerque, NM, USA 1: 397–400.

    Google Scholar 

  22. Hirahara T, Komakine T: A computational cochlear nonlinear preprocessing model with adaptive Q circuits. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '89), May 1989, Glasgow, UK 1: 496–499.

    Google Scholar 

  23. Deng L, Kheirallah I: Dynamic formant tracking of noisy speech using temporal analysis on outputs from a nonlinear cochlear model. IEEE Transactions on Biomedical Engineering 1993, 40(5):456–467. 10.1109/10.243416

    Article  Google Scholar 

  24. Heil P, Rajan R, Irvine DRF: Topographic representation of tone intensity along the isofrequency axis of cat primary auditory cortex. Hearing Research 1994, 76(1–2):188–202. 10.1016/0378-5955(94)90099-X

    Article  Google Scholar 

  25. Reale RA, Imig TJ: Tonotopic organization in auditory cortex of the cat. Journal of Comparative Neurology 1980, 192(2):265–291. 10.1002/cne.901920207

    Article  Google Scholar 

  26. Morel A, Garraghty PE, Kaas JH: Tonotopic organization, architectonic fields, and connections of auditory cortex in macaque monkeys. Journal of Comparative Neurology 1993, 335(3):437–459. 10.1002/cne.903350312

    Article  Google Scholar 

  27. Cansino S, Williamson SJ, Karron D: Tonotopic organization of human auditory association cortex. Brain Research 1994, 663(1):38–50. 10.1016/0006-8993(94)90460-X

    Article  Google Scholar 

  28. Pantev C, Bertrand O, Eulitz C, et al.: Specific tonotopic organizations of different areas of human auditory cortex revealed by simultaneous magnetic and electric recordings. Electroencephalography and Clinical Neurophysiology 1995, 94(1):26–40. 10.1016/0013-4694(94)00209-4

    Article  Google Scholar 

  29. Lauter JL, Herscovitch P, Formby C, Raichle ME: Tonotopic organization in human auditory cortex revealed by positron emission tomography. Hearing Research 1985, 20(3):199–205. 10.1016/0378-5955(85)90024-3

    Article  Google Scholar 

  30. Lockwood AH, Salvi RJ, Coad ML, et al.: The functional anatomy of the normal human auditory system: responses to 0.5 and 4.0 kHz tones at varied intensities. Cerebral Cortex 1999, 9(1):65–76. 10.1093/cercor/9.1.65

    Article  Google Scholar 

  31. Wessinger CM, Buonocore MH, Kussmaul CL, Mangun GR: Tonotopy in human auditory cortex examined with functional magnetic resonance imaging. Human Brain Mapping 1997, 5(1):18–25. 10.1002/(SICI)1097-0193(1997)5:1<18::AID-HBM3>3.0.CO;2-Q

    Article  Google Scholar 

  32. Bilecen D, Scheffler K, Schmid N, Tschopp K, Seelig J: Tonotopic organization of the human auditory cortex as detected by BOLD-FMRI. Hearing Research 1998, 126(1–2):19–27. 10.1016/S0378-5955(98)00139-7

    Article  Google Scholar 

  33. Talavage TM, Ledden PJ, Benson RR, Rosen BR, Melcher JR: Frequency-dependent responses exhibited by multiple regions in human auditory cortex. Hearing Research 2000, 150(1–2):225–244. 10.1016/S0378-5955(00)00203-3

    Article  Google Scholar 

  34. Schönwiesner M, von Cramon DY, Rübsamen R: Is it tonotopy after all? NeuroImage 2002, 17(3):1144–1161. 10.1006/nimg.2002.1250

    Article  Google Scholar 

  35. Lee N, Schwartz SC: Robust transient signal detection using the oversampled Gabor representation. IEEE Transactions on Signal Processing 1995, 43(6):1498–1502. 10.1109/78.388862

    Article  Google Scholar 

  36. Dallos P, Cheatham MA: Nonlinearities in cochlear receptor potentials and their origins. Journal of the Acoustical Society of America 1989, 86(5):1790–1796. 10.1121/1.398611

    Article  Google Scholar 

  37. Wegel RL, Lane CE: The auditory masking of one pure tone by another and its probable relation to the dynamics of the inner ear. Physical Review 1924, 23(2):266–285. 10.1103/PhysRev.23.266

    Article  Google Scholar 

  38. Lim HH, Anderson DJ: Feasibility experiments for the development of a midbrain auditory prosthesis. Proceedings of 1st Annual International IEEE EMBS Conference on Neural Engineering, March 2003, Capri Island, Italy 193–196.

    Google Scholar 

  39. Snyder RL, Sinex DG, McGee JD, Walsh EW: Acute spiral ganglion lesions change the tuning and tonotopic organization of the cat inferior colliculus neurons. Hearing Research 2000, 147(1–2):200–220. 10.1016/S0378-5955(00)00132-5

    Article  Google Scholar 

  40. Mallat SG, Zhang Z: Matching pursuits with time-frequency dictionaries. IEEE Transactions on Signal Processing 1993, 41(12):3397–3415. 10.1109/78.258082

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Reilly, R.B. An Overcomplete Signal Basis Approach to Nonlinear Time-Tone Analysis with Application to Audio and Speech Processing. EURASIP J. Adv. Signal Process. 2006, 065431 (2006). https://doi.org/10.1155/ASP/2006/65431

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1155/ASP/2006/65431

Keywords