Real-time F0 Estimation of Melody and Bass Lines in Musical Audio Signals
This project is proposed and researched by
 
 Masataka Goto.
Japanese version is here.
We have developed a robust method for estimating 
 the fundamental frequency (F0) of melody and bass lines
 in monaural real-world musical audio signals
 containing sounds of various instruments.
Most previous F0-estimation methods
 had great difficulty dealing with
 such complex audio signals
 because they were designed to deal with mixtures of only a few sounds.
To make it possible to estimate the F0 of the melody and bass lines,
 we propose a predominant-F0 estimation method called PreFEst
 that does not rely on the F0's unreliable frequency component and
 obtains the most predominant F0
 supported by harmonics within an intentionally limited frequency range.
It evaluates the relative dominance of every possible F0
 by using the Expectation-Maximization algorithm and
 considers the temporal continuity of F0s
 by using a multiple-agent architecture.
Experimental results show that our real-time system can detect
 the melody and bass lines in audio signals sampled from
 commercially distributed compact discs.
 
References:
-  Masataka Goto:
	A Real-time Music Scene Description System:
	Predominant-F0 Estimation for Detecting Melody and Bass Lines
	in Real-world Audio Signals,
	Speech Communication (ISCA Journal), Vol.43, No.4, pp.311-329,
	September 2004.
	
	
	
 -  Masataka Goto:
	A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals,
	Proceedings of
	the 18th International Congress on Acoustics
	(ICA 2004),
	pp.II-1085-1088, April 2004.
	(Invited Paper)
	
	
	
 -  Masataka Goto,
	A Predominant-F0 Estimation Method for
	Real-world Musical Audio Signals:
	MAP Estimation for Incorporating Prior Knowledge
	about F0s and Tone Models,
	Proceedings of CRAC-2001
	(Workshop on Consistent & Reliable Acoustic Cues for Sound Analysis),
	September 2001.
	
	
	
 -  Masataka Goto:
	A Predominant-F0 Estimation Method for CD Recordings:
	MAP Estimation using EM Algorithm for Adaptive Tone Models,
	Proceedings of
	the 2001 IEEE International Conference on Acoustics, Speech, and
	Signal Processing
	(ICASSP 2001),
	pp.V-3365-3368, May 2001.
	
	
	
 -  Masataka Goto:
	F0 Estimation of Melody and Bass Lines
	 in Musical Audio Signals,
	The Transactions of the Institute of Electronics,
	Information and Communication Engineers D-II,
	Vol.J84-D-II, No.1, pp.12-22, January 2001. (in Japanese)
 -  Masataka Goto:
	A Real-time Music Scene Description System:
	System Overview and Extension of F0 Estimation Method,
	Information Processing Society of Japan SIG Notes,
	2000-MUS-37-2, Vol.2000, No.94, pp.9-16, October 2000 (in Japanese).
 -  Masataka Goto:
	A Robust Predominant-F0 Estimation Method
	for Real-time Detection of Melody and Bass Lines in CD Recordings,
	Proceedings of
	the 2000 IEEE International Conference on Acoustics, Speech, and
	Signal Processing
	(ICASSP 2000),
	pp.II-757-760, June 2000.
	
	
	
 -  Masataka Goto:
	Music Scene Description: A Predominant-F0 Estimation Method
	for Detecting Melody and Bass Lines,
	Proceedings of the Seventh Meeting of
	Special Interest Group on AI Challenges, SIG-Challenge-9907-8,
	pp.45-52, November 1999.
 -  Masataka Goto and Satoru Hayamizu:
	A Real-time Music Scene Description System:
	Detecting Melody and Bass Lines in Audio Signals,
	Working Notes of 
	the IJCAI-99 Workshop on Computational Auditory Scene Analysis,
	pp.31-40, August 1999.
	
	
	
 -  Masataka Goto:
	F0 Estimation of Melody and Bass Lines
	in Real-world Musical Audio Signals,
	Information Processing Society of Japan SIG Notes,
	99-MUS-31-16, Vol.99, No.68, August 1999 (in Japanese).
	
	
	
 
Back to:
Please E-mail comments and questions to
 Masataka GOTO
<m.goto [at] aist.go.jp>
last update: May 11, 2000