SmartMusicKIOSK: Music Listening Station with Chorus-Search Function

Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)

This project is proposed and researched by Masataka Goto.

Japanese version is here.
Japanese version

box Introduction:

SmartMusicKIOSK is a new music-playback interface for trial listening. In music stores, customers typically search out the chorus or ``hook'' of a song by repeatedly pressing the fast-forward button, rather than passively listening to the music. This activity is not well-supported by current technology. This research achieves a function for jumping to the chorus section and other key parts of a song, plus a function for visualizing song structure. These functions eliminate the hassle of searching for the chorus and make it easier for a listener to find desired parts of a song, thereby facilitating an active listening experience. This interface, which enables a listener to look for a section of interest by interactively changing the playback position, is useful not only for trial listening but also for more general purposes in selecting and using music. The proposed functions are achieved through an automatic audio-based chorus-section detection method, and the results of implementing them in a listening station have demonstrated their usefulness. While entire songs of no interest to the listener can be skipped on conventional music-playback interfaces, SmartMusicKIOSK is the first interface that allows the user to easily skip sections of no interest even within a song.

SmartMusicKIOSK implemented on a tablet PC
SmartMusicKIOSK implemented on a tablet PC.


box Background:

When ``trial listening'' to prerecorded music on compact discs (CDs) at a music store, a listener often takes an active role in the playback of musical pieces or songs by picking out only those sections of interest. This new type of music interaction differs from passive music appreciation in which people usually listen to entire musical selections. To give some background, music stores in recent years have installed music listening stations to allow customers to listen to CDs on a trial basis to facilitate a purchasing decision. In general, the main objective of listening to music is to appreciate it, and it is common for a listener to play a musical selection from start to finish. In trial listening, however, the objective is to quickly determine whether a selection is the music one has been looking for and whether one likes it, so listening to entire selections in the above manner is rare. In the case of popular music, for example, customers often want to listen to the most representative, uplifting part of a song, i.e., the chorus or refrain, to pass judgment on that song. This desire produces a special way of listening in which the trial listener first listens briefly to a song's ``intro'' and then jumps ahead in search of the chorus by repeatedly pushing the fast-forward button, eventually finding the chorus and listening to it.

The functions provided by conventional listening stations for music CDs, however, do not support this unique way of trial listening very well. These listening stations are equipped with playback-operation buttons typical of an ordinary CD player, and among these, only the fast-forward and rewind buttons can be used to find the chorus section of a song. On the other hand, the digital listening stations that have recently been installed in music stores enable playback of musical selections from a hard disk or over a network. Here, however, only one part (e.g., the beginning) of each musical selection (an interval of about 30 to 45 seconds) is mechanically excerpted and stored, which means that a trial listener may not necessarily hear the chorus section.

Against the above background, I developed SmartMusicKIOSK, in which a trial listener can jump to the beginning of a song's chorus (perform an instantaneous fast-forward to the chorus) by simply pushing the button for this function.


box Interface:

Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
SmartMusicKIOSK screen display.

For music that would normally not be understood unless some time was taken for listening, the problem here is how to enable changing between specific playback positions before actual listening. To solve this problem, I propose the following two methods assuming the main target to be popular music.

  1. ``Jump to chorus'' function: Automatic jumping to the beginning of sections relevant to a song's structure (lower window)
    The structure of a song is automatically analyzed beforehand and functions are provided enabling automatic jumping to sections that would be of interest to listeners. These functions are ``jump to chorus (NEXT CHORUS button),'' ``jump to previous section in song (PREV SECTION button),'' and ``jump to next section in song (NEXT SECTION button).'' With these functions, a listener can directly jump to and listen to chorus sections, or jump to the previous or next repeated section of the song.
  2. ``Music map'' function: Visualization of song contents (upper window)
    A function is provided to enable the contents of a song to be visualized to help the listener decide where to jump next. Specifically, this function provides a visual representation of the song's structure consisting of chorus sections and repeated sections (results of automatic chorus-section detection). In the above figure, the horizontal axis is the time axis covering the entire song; the top row shows chorus sections, the five lower rows show repeated sections, and the bottom thin horizontal bar is a playback slider. On each row, colored sections indicate similar (repeated) sections. For example, the bottom row with two short colored sections indicates the similarity between the ``intro'' and ``ending'' of this song.


box Chorus-Section Detection:

To enable the handling of a large number of songs, this research aims for a general and robust chorus-section detection method using no prior information on acoustic features unique to choruses. To this end, I focus on the fact that chorus sections are usually the most repeated sections of a song and adopt the following basic strategy: find sections that repeat and output those that appear most often. It must be pointed out, however, that it is difficult for a computer to judge repetition because it is rare for repeated sections to be exactly the same.

I therefore developed a method that overcomes this difficulty and automatically detects the beginning and end points of chorus sections and repeated sections in compact-disc recordings of popular music. Most previous methods detected as a chorus a repeated section of a given length and had difficulty in identifying both ends of a chorus section and in dealing with modulations (key changes). By analyzing relationships between various repeated sections, my method can detect all the chorus sections in a song and estimate both ends of each section. It can also detect modulated chorus sections by introducing a similarity measure that enables modulated repetition to be judged correctly. Experimental results with a popular-music database show that this method detects the correct chorus sections in 80 of 100 songs.


box Video Clips:


box Screen Snapshots:

"NEXT CHORUS" button (jump to chorus)
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(A-1) When a user pushes the "PLAY" button, SmartMusicKIOSK starts playing.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(A-2) SmartMusicKIOSK keeps playing.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(A-3) When the user pushes the "NEXT CHORUS" button, SmartMusicKIOSK jumps to the start of the next chorus section in the song from the present cursor position.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(A-4) When the user pushes the "NEXT CHORUS" button again, it jumps to the chorus section next to the previous one.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(A-5) When the user pushes the "NEXT CHORUS" button again, it jumps to the chorus section next to the previous one.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(A-6) When the user pushes the "NEXT CHORUS" button after the final chorus section, it returns to the first chorus section.

"NEXT SECTION" button (jump to next section in song)
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(B-1) When a user pushes the "PLAY" button, SmartMusicKIOSK starts playing.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(B-2) SmartMusicKIOSK keeps playing.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(B-3) When the user pushes the "NEXT SECTION" button, SmartMusicKIOSK jumps to the start of the next repeated section in the song from the present cursor position.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(B-4) When the user pushes the "NEXT SECTION" button again, it jumps to the repeated section next to the previous one.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(B-5) When the user pushes the "NEXT SECTION" button again, it jumps to the repeated section next to the previous one.

"PREV SECTION" button (jump to previous section in song)
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(C-1) When a user pushes the "PLAY" button, SmartMusicKIOSK starts playing.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(C-2) SmartMusicKIOSK keeps playing.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(C-3) When the user pushes the "PREV SECTION" button, SmartMusicKIOSK jumps to the start of the previous repeated section in the song from the present cursor position.
Snapshot of SmartMusicKIOSK (RWC-MDB-P-2001 No. 18)
(C-4) When the user pushes the "PREV SECTION" button again, it jumps to the previous repeated section.


box Conclusion:

``SmartMusicKIOSK makes the music listening experience more active and interactive.''

The main contribution of this research is to propose a novel music-playback interface SmartMusicKIOSK, considering that conventional playback-operation buttons on CD players or media-player software have not been improved for a long time. One of the innovations brought by CD players is to enable a listener to immediately skip a song (track) of no interest --- i.e., ``listen to any track of a CD whenever one likes.'' I believe that the SmartMusicKIOSK brings a similar innovation at a different level: it enables a listener to immediately skip a structural section (part) of no interest --- i.e., ``listen to any part of a song whenever one likes'' without having to follow the timeline of the original song. I hope this research opens up new vistas for future research that reexamines the entire functional makeup of music-playback interfaces to make interaction between people and music more active and enriching.


box Acknowledgments:

This research utilized the RWC Music Database "RWC-MDB-P-2001" (Popular Music). The author would like to thank Hideki Asoh (National Institute of Advanced Industrial Science and Technology) for his valuable discussions.


References:

  1. Masataka Goto: A Chorus-Section Detection Method for Musical Audio Signals and Its Application to a Music Listening Station, IEEE Transactions on Audio, Speech and Language Processing, Vol.14, No.5, pp.1783-1794, September 2006.
    PDF (IEEE server)
  2. Masataka Goto, SmartMusicKIOSK: Music-playback interface based on chorus-section detection method, The Journal of the Acoustical Society of America, Vol.115, No.5, Pt.2, p.2494, May 2004. (Invited Paper of The 147th Meeting of the Acoustical Society of America)
  3. Masataka Goto, Lay Language Paper of ``SmartMusicKIOSK: Music Listening Station with Chorus-Search Function'', invited for the Acoustical Society of America 147th Meeting Lay Language Papers and News Conferences, May 2004.
  4. Masataka Goto: SmartMusicKIOSK: Music Listening Station with Chorus-Search Function, Proceedings of the 16th Annual ACM Symposium on User Interface Software and Technology (UIST 2003), pp.31-40, November 2003.
    PDF
  5. Masataka Goto: A Chorus-Section Detecting Method for Musical Audio Signals, Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), pp.V-437-440, April 2003.
    PDF
  6. Masataka Goto: SmartMusicKIOSK: Music Listening Station with Chorus-Search Function, Transactions of Information Processing Society of Japan, Vol.44, No.11, pp.2737-2747, November 2003. (in Japanese)
  7. Masataka Goto: SmartMusicKIOSK: Music Listening Station with Chorus-Search Function, Proceedings of Interaction 2003 of IPSJ (Information Processing Society of Japan), pp.9-16, February 2003. (in Japanese)
  8. Masataka Goto: A Real-time Music Scene Description System: A Chorus-Section Detecting Method, IPSJ (Information Processing Society of Japan) SIG Notes, 2002-MUS-47-6, Vol.2002, No.100, pp.27-34, October 2002. (in Japanese)
    PDF

box Awards:


Reports on Newspaper, Television, and Magazine outside JAPAN

  1. The SmartMusicKIOSK was reported in an article "How to listen to old records in the 21st century" on the on-line news site "Inside Science News Service (ISNS)" published by the American Institute of Physics (2004/04/16)
  2. The SmartMusicKIOSK was reported in an article "Smart music system skips to chorus" on the on-line news site "Nature Science Update" published by the Nature Publishing Group (2004/05/28)

Reports on Newspaper, Television, and Magazine in JAPAN

  1. The SmartMusicKIOSK was reported in an article by Kaoru Misaki on the Japanese on-line news site "ITmedia News" (2003/03/06)
  2. The SmartMusicKIOSK was reported in an article by Yutaka Kobayashi on the Japanese on-line news site "ITmedia News" (2003/03/06)
  3. The SmartMusicKIOSK was reported in the Japanese magazine "Robocon Magazine" (2003/04/14)
  4. The SmartMusicKIOSK was reported in the morning edition of the Japanese daily newspaper "Nihon Keizai Shimbun" (2003/08/01)
  5. The SmartMusicKIOSK was reported in the Japanese newspaper "Nikkei Ryutsu Shimbun (Nikkei Marketing Journal)" (2003/09/09)
  6. The SmartMusicKIOSK was reported in an article "Smart music system skips to chorus" on the Japanese on-line news site "Nature BioNews" (2004/06/10)

box Back to:


Masataka GOTO <m.goto [at] aist.go.jp>
All pages are copyrighted by the author. Unauthorized reproduction is strictly prohibited.

last update: June 2, 2004