[ English | Japanese ]


VocaWatcher: Natural Singing Motion Generator for a Humanoid Robot

Shuuji Kajita / Tomoyasu Nakano / Masataka Goto / Yosuke Matsusaka / Shin'ichiro Nakaoka / Kazuhito Yokoi
National Institute of Advanced Industrial Science and Technology (AIST)

Email: vocawatcher [at] m.aist.go.jp


Demonstration: YouTube


Abstract:

This research presents VocaWatcher, a novel robot motion generator that enables a humanoid robot to sing a song with realistic facial expressions as well as natural synthesized singing voices. Such a robot singer is important not only for showing an attractive humanoid robot application in an entertainment scene, but also for promoting the state-of-theart integration of robot engineering, music processing, and image processing. To overcome difficulties generating natural facial expressions precisely synchronized with singing voices, VocaWatcher imitates a human singer by analyzing a video clip of the human singing recorded by a single video camera. It can control mouth, eye, and neck motions by imitating the corresponding human movements estimated without any markers in the video, and also synthesize singing voices by imitating the pitch and dynamics of the human singing in the same video.

VocaWatcher
Fig 1. Overview of VocaWatcher.

Demonstrations:

VocaWatcher Demo

VocaWatcher Demonstration with Detailed Explanations
Song PROLOGUE (RWC-MDB-P-2001 No.7)
Target human singer Sariyajin NOTE: The linked page is written in Japanese.
Humanoid robot HRP-4C
Singing synthesizer Hatsune Miku (Vocaloid2)




VocaWatcher Demo

VocaWatcher Demonstration
Song Packaged (Lyrics/Music/Arrange: kz) NOTE: The linked page is written in Japanese.
Target human singer Sariyajin NOTE: The linked page is written in Japanese.
Humanoid robot HRP-4C
Singing synthesizer Megpoid (Vocaloid2)




VocaWatcher Demo

VocaWatcher Demonstration
Song Ouro (Lyrics/Music/Arrange: luschka) NOTE: The linked page is written in Japanese.
Target human singer luschka NOTE: The linked page is written in Japanese.
Humanoid robot HRP-4C
Singing synthesizer VY1 (Vocaloid2)


Acknowledgments:

This research utilized the RWC Music Database "RWC-MDB-P-2001" (Popular Music). In our current implementation, VocaWatcher uses commercial singing synthesis software based on Yamaha's Vocaloid2 technology.

We thank to members of the Humanoid Robotics Group of AIST, especially, Kanako Miura for her help and advice at an early stage experiment, and Kenta Yonekura for his work of choreographing HRP-4C's arm motions. We also thank to Yoshio Mastumoto for his helpful advice. Finally, we gratefully acknowledge the support of Hirohisa Hirukawa and Satoshi Sekiguchi, directors of Intelligent Systems Research Institute and Information Technology Research Institute of AIST, respectively.


References:

  1. Masataka Goto, Tomoyasu Nakano, Shuuji Kajita, Yosuke Matsusaka, Shin'ichiro Nakaoka, and Kazuhito Yokoi:
    "VocaListener and VocaWatcher: Imitating a Human Singer by Using Signal Processing",
    In Proceedings of the 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012),
    pp.5393-5396, March 2012.
    [PDF]
  2. Shuuji Kajita, Tomoyasu Nakano, Masataka Goto, Yosuke Matsusaka, Shin'ichiro Nakaoka, and Kazuhito Yokoi,
    "VocaWatcher: Natural Singing Motion Generator for a Humanoid Robot,"
    In Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2011),
    pp.2000-2007, September 2011.
    [PDF][DOI (IEEE Xplore)]

The copyright of each publication is retained by the corresponding academic society (copyright holder). Copyright and Consent Form (IEEE).


Shuuji Kajita, Tomoyasu Nakano, Masataka Goto, Yosuke Matsusaka, Shin'ichiro Nakaoka, Yoshio Matsumoto, and Kazuhito Yokoi.

Copyright (C) 2011 by National Institute of Advanced Industrial Science and Technology (AIST).