Masataka Goto: List of Research Projects (Including Collaborative Research)

The following show only major publications of each research project. The web page of "Publications" shows all publications.

Japanese version is here.
Japanese version

box Music Information Processing:

Understanding Musical Audio Signals (Real-time Music Scene Description System)

  1. Sound Source Identification of Drum Sounds in Drum Solo Performances (Masataka Goto, Yoichi Muraoka) [1992-1993]
    Masataka Goto, Yoichi Muraoka: A Sound Source Separation System for Percussion Instruments, The Transactions of the Institute of Electronics, Information and Communication Engineers D-II, Vol.J77-D-II, No.5, pp.901-911, May 1994. (in Japanese)
  2. Beat Tracking for CD Recordings (Masataka Goto, Yoichi Muraoka) [1993-1998]
    Masataka Goto: An Audio-based Real-time Beat Tracking System for Music With or Without Drum-sounds, Journal of New Music Research, Vol.30, No.2, pp.159-171, June 2001.
    ( Best Paper Award for Young Researchers of The 1998 Kansai-Section Joint Convention of Institutes of Electrical Engineering, Japan, 1999. )
  3. F0 Estimation for Bass Solo Performances (Yoshiaki Kikuchi, Masataka Goto, Yoichi Muraoka) [1995-1996]
    Yoshiaki Kikuchi, Masataka Goto, Yoichi Muraoka: A automatic transcription system for bass guiter, Proceedings of the 52th Annual Convention IPS Japan, 5Z-7, March 1996. (in Japanese)
  4. Real-time Music Scene Description System (Masataka Goto) [1998-]
    Masataka Goto: Music Scene Description Project: Toward Audio-based Real-time Music Understanding, Proceedings of the 4th International Conference on Music Information Retrieval (ISMIR 2003), pp.231-232, October 2003.
    Masataka Goto: A Real-time Music-scene-description System: Predominant-F0 Estimation for Detecting Melody and Bass Lines in Real-world Audio Signals, Speech Communication (ISCA Journal), Vol.43, No.4, pp.311-329, September 2004.
  5. F0 Estimation of Melody and Bass Lines in CD Recordings (Masataka Goto) [1998-]
    Masataka Goto: A Real-time Music-scene-description System: Predominant-F0 Estimation for Detecting Melody and Bass Lines in Real-world Audio Signals, Speech Communication (ISCA Journal), Vol.43, No.4, pp.311-329, September 2004.
  6. Sound Source Identification of Musical Instruments (Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno) [2000-]
    Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno: Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps, EURASIP Journal on Advances in Signal Processing, Vol.2007, Article ID 51979, 15 pages, 2007.
    Tetsuro Kitahara, Masataka Goto, and Hiroshi G. Okuno: Pitch-dependent Identification of Musical Instrument Sounds, Applied Intelligence (The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies) Vol.23, No.3, pp.267-275, December 2005.
  7. Chord Name Identification for CD Recordings (Hiroko Yamada, Masataka Goto, Hiroshi Saruwatari, Kiyohiro Shikano) [2001-2003]
    Hiroko Yamada, Masataka Goto, Hiroshi Saruwatari, and Kiyohiro Shikano: Multi-Timbre Chord Classification Method for Musical Audio Signals: Application to Musical Pieces, Proceeding of the 2003 Spring Meeting of the Acoustical Society of Japan, pp.835-836, March 2003. (in Japanese)
  8. Chorus Section Detection for CD Recordings (Masataka Goto) [2002-]
    Masataka Goto: A Chorus-Section Detection Method for Musical Audio Signals and Its Application to a Music Listening Station, IEEE Transactions on Audio, Speech and Language Processing, Vol.14, No.5, pp.1783-1794, September 2006.
    Masataka Goto: A Chorus-Section Detecting Method for Musical Audio Signals, Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), pp.V-437-440, April 2003.
  9. Sound Source Identification of Drum Sounds in CD Recordings (Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno) [2003-]
    Kazuyoshi Yoshii, Masataka Goto, and Hiroshi G. Okuno: Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods, Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR 2004), pp.184-191, October 2004.
    Kazuyoshi Yoshii, Masataka Goto, and Hiroshi G. Okuno: Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates with Harmonic Structure Suppression, IEEE Transactions on Audio, Speech, and Language Processing, Vol.15, No.1, pp.333-345, January 2007.
    Kazuyoshi Yoshii, Masataka Goto, and Hiroshi G. Okuno: AdaMast: A Drum Sound Recognizer based on Adaptation and Matching of Spectrogram Templates, Proceedings of the 2nd Music Information Retrieval Evaluation eXchange (MIREX 2005), September 2005.
  10. Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music (Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno) [2005-]
    Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno: Instrogram: A New Musical Instrument Recognition Technique Without Using Onset Detection Nor F0 Estimation, Proceedings of the 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), pp.V-229-232, May 2006.
    Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno: Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music, IPSJ (Information Processing Society of Japan) Journal, Vol.48, No.1, pp.214-226, January 2007.

Active Music Listening Interface

  1. SmartMusicKIOSK: Music Listening Station with Chorus-Search Function (Masataka Goto) [2002-]
    Masataka Goto: SmartMusicKIOSK: Music Listening Station with Chorus-Search Function, Proceedings of the 16th Annual ACM Symposium on User Interface Software and Technology (UIST 2003), pp.31-40, November 2003.
    Masataka Goto: A Chorus-Section Detection Method for Musical Audio Signals and Its Application to a Music Listening Station, IEEE Transactions on Audio, Speech and Language Processing, Vol.14, No.5, pp.1783-1794, September 2006.
    ( Interaction 2003 Best Paper Award, IPSJ (The Information Processing Society of Japan) Symposium, 2003. )
    ( IPSJ Best Paper Award, IPSJ (The Information Processing Society of Japan), 2005. )
  2. Musicream: Music Playback Interface for Streaming, Sticking, Sorting, and Recalling Musical Pieces (Takayuki Goto, Masataka Goto) [2004-]
    Masataka Goto and Takayuki Goto: Musicream: New Music Playback Interface for Streaming, Sticking, Sorting, and Recalling Musical Pieces, Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR 2005), pp.404-411, September 2005.
    ( Interaction 2005 Best Interactive Presentation Award, IPSJ (The Information Processing Society of Japan) Symposium, 2005. )
  3. INTER:D and Drumix: Audio Player With Real-Time Drum Part Editing Function (Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno) [2004-]
    Kazuyoshi Yoshii, Masataka Goto, and Hiroshi G. Okuno: INTER:D: A Drum Sound Equalizer for Controlling Volume and Timbre of Drums, Proceedings of the 2nd European Workshop on the Integration of Knowledge, Semantic and Digital Media Technologies (EWIMT2005), pp.205-212, November 2005.
    Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno: Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening, IPSJ (Information Processing Society of Japan) Journal, Vol.48, No.3, pp.1229-1239, March 2007.
    ( FIT Paper Award, FIT 2004 (Forum on Information Technology), 2004. )
    ( Interaction 2006 Best Interactive Presentation Award, IPSJ (The Information Processing Society of Japan) Symposium, 2006. )
  4. MusicRainbow: User Interface to Discover Artists Using Audio-based Similarity and Web-based Labeling (Elias Pampalk, Masataka Goto) [2006-]
    Elias Pampalk and Masataka Goto: MusicRainbow: A New User Interface to Discover Artists Using Audio-based Similarity and Web-based Labeling, Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR 2006), pp.367-370, October 2006.

Music Information Retrieval / Recommendation

  1. Query by Humming (QBH) System (Song Retrieval by Singing Voices) (Tomonari Sonoda, Tomonori Kaizuka, Masataka Goto, Yoichi Muraoka) [1996-1998]
    Tomonari Sonoda, Masataka Goto, and Yoichi Muraoka: A WWW-based Melody Retrieval System, Proceedings of the 1998 International Computer Music Conference, pp.349-352, October 1998.
  2. Query by Humming (QBH) System (Music Signal Spotting by Singing Voices) (Takuichi Nishimura, Hiroki Hashiguchi, Junko Takita, Masataka Goto, Ryuichi Oka) [2000-2001]
    Takuichi Nishimura, Hiroki Hashiguchi, Junko Takita, J. Xin Zhang, Masataka Goto, and Ryuichi Oka: Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming, Proceedings of the 2nd Annual International Symposium on Music Information Retrieval (ISMIR 2001), pp.211-218, October 2001.
  3. Speech-Recognition Interfaces for Music Information Retrieval (Masataka Goto, Katunobu Itou, Koji Kitayama, Tetsunori Kobayashi) [2000-]
    Masataka Goto, Katunobu Itou, Koji Kitayama, and Tetsunori Kobayashi: Speech-Recognition Interfaces for Music Information Retrieval: ``Speech Completion'' and ``Speech Spotter'', Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR 2004), pp.403-408, October 2004.
  4. Drum Pattern Retrieval by Voice Percussion (Tomoyasu Nakano, Jun Ogata, Masataka Goto, Yuzuru Hiraga) [2003-]
    Tomoyasu Nakano, Jun Ogata, Masataka Goto, and Yuzuru Hiraga: A Drum Pattern Retrieval Method by Voice Percussion, Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR 2004), pp.550-553, October 2004.
  5. Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model (Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno) [2006-]
    Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno: Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences, Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR 2006), pp.296-301, October 2006.
    Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno: An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model, IEEE Transactions on Audio, Speech, and Language Processing, Vol.16, No.2, pp.435-447, February 2008.

Singing Voice Processing

  1. Lyrics Recognition for Solo Singing Voices (Akira Sasou, Masataka Goto, Satoru Hayamizu, Kazuyo Tanaka, Hironao Ozeki, Takayuki Kamata) [2002-]
    Akira Sasou, Masataka Goto, Satoru Hayamizu, Kazuyo Tanaka: An Auto-Regressive, Non-Stationary Excited Signal Parameter Estimation Method and an Evaluation of a Singing-Voice Recognition, Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), pp.I-237-240, March 2005.
  2. Discrimination between Singing and Speaking Voices (Yasunori Ohishi, Masataka Goto, Katunobu Itou, Kazuya Takeda) [2004-]
    Yasunori Ohishi, Masataka Goto, Katunobu Itou, and Kazuya Takeda: Discrimination between Singing and Speaking Voices, Proceedings of the 9th European Conference on Speech Communication and Technology (Eurospeech 2005), pp.1141-1144, September 2005.
    Yasunori Ohishi, Masataka Goto, Katunobu Itou, and Kazuya Takeda: On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices, Proceedings of the 9th International Conference on Music Perception and Cognition (ICMPC 2006), pp.1831-1837, August 2006.
  3. Singer Identification of Singing Voices in CD Recordings (Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno) [2004-]
    Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno: Singer Identification Based on Accompaniment Sound Reduction and Reliable Frame Selection, Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR 2005), pp.329-336, September 2005.
    Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno: F0 Estimation Method for Singing Voice in Polyphonic Audio Signal Based on Statistical Vocal Model and Viterbi Search, Proceedings of the 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), pp.V-253-256, May 2006.
  4. Automatic Singing Skill Evaluation of Solo Singing Voices (Tomoyasu Nakano, Masataka Goto, Yuzuru Hiraga) [2005-]
    Tomoyasu Nakano, Masataka Goto, and Yuzuru Hiraga: Subjective Evaluation of Common Singing Skills Using the Rank Ordering Method, Proceedings of the 9th International Conference on Music Perception and Cognition (ICMPC 2006), pp.1507-1512, August 2006.
    Tomoyasu Nakano, Masataka Goto, and Yuzuru Hiraga: An Automatic Singing Skill Evaluation Method for Unknown Melodies Using Pitch Interval Accuracy and Vibrato Features, Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), pp.1706-1709, September 2006.
  5. Automatic Synchronization between Lyrics and Vocal in CD Recordings (Hiromasa Fujihara, Masataka Goto, Jun Ogata, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno) [2006-]
    Hiromasa Fujihara, Masataka Goto, Jun Ogata, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno: Automatic Synchronization between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals, Proceedings of the IEEE International Symposium on Multimedia (ISM 2006), pp.257-264, December 2006.

Interactive Systems

  1. Automatic Jazz Accompaniment System Reacting to Solo (Isao Hidaka, Masataka Goto, Yoichi Muraoka) [1993-1995]
    Isao Hidaka, Masataka Goto, and Yoichi Muraoka: An Automatic Jazz Accompaniment System Reacting to Solo, Proceedings of the 1995 International Computer Music Conference, pp.167-170, September 1995.
  2. VirJa Session: Virtual Jazz Session System (Masataka Goto, Isao Hidaka, Hideaki Matsumoto, Yosuke Kuroda, Yoichi Muraoka) [1995-1998]
    Masataka Goto, Isao Hidaka, Hideaki Matsumoto, Yosuke Kuroda, and Yoichi Muraoka: A Jazz Session System for Interplay among All Players --- VirJa Session (Virtual Jazz Session System) ---, Proceedings of the 1996 International Computer Music Conference, pp.346-349, August 1996.
    ( IPSJ SIG Research Award (SIGMUS: The Special Interest Group on MUSic and computer), IPSJ (The Information Processing Society of Japan), 1997. )
  3. Cindy: Interactive Performance of Music-controlled CG Dancer (Masataka Goto, Yoichi Muraoka) [1995-1997]
    Masataka Goto and Yoichi Muraoka: A Virtual Dancer "Cindy" --- Interactive Performance of a Music-controlled CG Dancer ---, Proceedings of the Lifelike Computer Characters '96, p.65, October 1996.
    Masataka Goto and Yoichi Muraoka: Interactive Performance of a Music-danced CG Dancer, Proceedings of the 3rd Workshop on Interactive Systems and Software 1995 (WISS '95), pp.9-18, December 1995. (in Japanese)
  4. Herbie-kun: Jazz Chord Reharmonizer in Deductive Object-oriented Framework (Masataka Goto, Keiji Hirata) [1996]
    Masataka Goto and Keiji Hirata: Herbie-kun: A Jazz Chord Reharmonizer in a Deductive Object-oriented Framework, Information Processing Society of Japan SIG Notes (Technical Report), 96-MUS-16-6, Vol.96, No.75, July 1996. (in Japanese)
  5. Guitarist Simulator: Learning-Based Jam Session System Imitating Player's Personality Model (Masatoshi Hamanaka, Masataka Goto, Hideki Asoh, Nobuyuki Otsu) [1999-2004]
    Masatoshi Hamanaka, Masataka Goto, Hideki Asoh, and Nobuyuki Otsu: A Learning-Based Jam Session System that Imitates a Player's Personality Model, Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI-03), pp.51-58, August 2003.
    Masatoshi Hamanaka, Masataka Goto, and Nobuyuki Otsu: Learning-Based Jam Session System for a Guitar Trio, Proceedings of the 2001 International Computer Music Conference, pp.467-470, September 2001.
  6. Quantization Method for MIDI Performances (Masatoshi Hamanaka, Masataka Goto, Hideki Asoh, Nobuyuki Otsu) [2000-2002]
    Masatoshi Hamanaka, Masataka Goto, Hideki Asoh, and Nobuyuki Otsu: A Learning-Based Quantization: Unsupervised Estimation of the Model Parameters, Proceedings of the 2003 International Computer Music Conference (ICMC 2003), pp.369-372, October 2003.
  7. Mix-Down Assistant Interface with Reuse of Examples (Akio Yatsui, Masataka Goto, Haruhiro Katayose) [2003-]
    Haruhiro Katayose, Akio Yatsui, and Masataka Goto: A Mix-Down Assistant Interface with Reuse of Examples, Proceedings of the 1st International Conference on Automated Production of Cross Media Content for Multi-channel Distribution (AXMEDIS 2005), pp.9-16, November 2005.
    ( FIT Paper Award, FIT 2003 (Forum on Information Technology), 2003. )
  8. Voice Drummer: Music Notation Interface of Drum Sounds Using Voice Percussion Input (Tomoyasu Nakano, Masataka Goto, Jun Ogata, Yuzuru Hiraga) [2004-]
    Tomoyasu Nakano, Masataka Goto, Jun Ogata, and Yuzuru Hiraga: Voice Drummer: A Music Notation Interface of Drum Sounds Using Voice Percussion Input, Proceedings of the 18th Annual ACM Symposium on User Interface Software and Technology (UIST 2005), pp.49-50, October 2005. (Demos)

Network

  1. RMCP: Musical Information Processing based on Remote Music Control Protocol (Masataka Goto, Ryo Neyama, Yoichi Muraoka) [1991-]
    Masataka Goto, Ryo Neyama, and Yoichi Muraoka: RMCP: Remote Music Control Protocol --- Design and Applications ---, Proceedings of the 1997 International Computer Music Conference, pp.446-449, September 1997.
    ( Best Paper Award of The jus 10th Anniversary International UNIX Symposium, jus (Japan UNIX Society), 1992. )
    ( NICOGRAPH Best Paper Award of The CG Education Symposium, NICOGRAPH (Nippon Computer Graphics Conference), 1993. )
  2. VirJa Session on WWW: Jazz Session System With Virtual Players Downloaded from WWW (Ryo Neyama, Masataka Goto, Yoichi Muraoka) [1996-1998]
    Ryo Neyama, Masataka Goto, Yoichi Muraoka: Toward VirJa Session on WWW, Proceedings of the 54th Annual Convention IPS Japan, 7J-05, March 1997. (in Japanese)
  3. Mug: Expandable Multiuser Framework for Java (Ryo Neyama, Masataka Goto, Tomonari Sonoda, Yoichi Muraoka) [1998-1999]
    Ryo Neyama, Masataka Goto, Tomonari Sonoda, Yoichi Muraoka: Mug: An Expandable Multiuser Framework for Java, Proceedings of the 58th Annual Convention IPS Japan, 5M-05, March 1999. (in Japanese)
  4. Open RemoteGIG: Open-to-the-Public Distributed Session System Overcoming Network Latency (Masataka Goto, Ryo Neyama) [1997-2002]
    Masataka Goto and Ryo Neyama: Open RemoteGIG: An Open-to-the-Public Distributed Session System Overcoming Network Latency, Transactions of Information Processing Society of Japan, Vol.43, No.2, pp.299-309, February 2002. (in Japanese)
  5. Openism: Open-to-the-public Distributed Session System with Melody-correction-based Improvisation Support Function (Yuu Misawa, Yutaka Hosono, Akifumi Nishina, Katsuhisa Ishida, Tetsuro Kitahara, Masataka Goto, Masayuki Takeda) [2004-]
    Yuu Misawa, Yutaka Hosono, Akifumi Nishina, Katsuhisa Ishida, Tetsuro Kitahara, Masataka Goto, and Masayuki Takeda: Openism: An Open-to-the-public Distributed Session System with a Melody-correction-based Improvisation Support Function, Proceedings of the 13th Workshop on Interactive Systems and Software 2005 (WISS 2005), pp.87-92, December 2005. (in Japanese)

Computer Graphics

  1. VirStA System: Distributed Computer Graphics Animation System based on Virtual Stage and Virtual Actors (Masataka Goto, Tetsuya Abe, Hideaki Matsumoto, Yoichi Muraoka) [1995-1998]
    Masataka Goto, Tetsuya Abe, Hideaki Matsumoto, Yoichi Muraoka: VirStA System: A Distributed CG Animation System based on Virtual Stage and Virtual Actors --- I. System Overview, Proceedings of the 53th Annual Convention IPS Japan, 1P-5, September 1996. (in Japanese)
    Tetsuya Abe, Masataka Goto, Hideaki Matsumoto, Yoichi Muraoka: VirStA System: A Distributed CG Animation System based on Virtual Stage and Virtual Actors --- II. Real-time Implementation in Distributed Environment, Proceedings of the 53th Annual Convention IPS Japan, 1P-6, September 1996. (in Japanese)
    Hideaki Matsumoto, Masataka Goto, Tetsuya Abe, Yoichi Muraoka: VirStA System: A Distributed CG Animation System based on Virtual Stage and Virtual Actors --- III. CG Animation of Jazz Session Players, Proceedings of the 53th Annual Convention IPS Japan, 1P-7, September 1996. (in Japanese)
  2. Interactive Dog (Tetsuya Abe, Masataka Goto, Yosuke Kuroda, Yoichi Muraoka) [1995]
    Tetsuya Abe, Masataka Goto, Yosuke Kuroda, and Yoichi Muraoka: An Interactive Dog --- Three Kinds of Interaction with a Virtual Dog ---, Proceedings of the 3rd Workshop on Interactive Systems and Software 1995 (WISS '95), pp.19-28, December 1995. (in Japanese)
  3. Generating Real-time Computer Graphics Animation of Virtual Players (Hideaki Matsumoto, Masataka Goto, Yoichi Muraoka) [1995-1998]
    Hideaki Matsumoto, Masataka Goto, and Yoichi Muraoka: Generating CG Animation of Virtual Players by Musical Performance, Information Processing Society of Japan SIG Notes (Technical Report), 98-CG-89-3, Vol.98, No.16, February 1998. (in Japanese)

Music Databases

  1. RWC Music Database (Masataka Goto, Hiroki Hashiguchi, Takuichi Nishimura, Ryuichi Oka) [2000-]
    Masataka Goto, Hiroki Hashiguchi, Takuichi Nishimura, and Ryuichi Oka: RWC Music Database: Popular, Classical, and Jazz Music Databases, Proceedings of the 3rd International Conference on Music Information Retrieval (ISMIR 2002), pp.287-288, October 2002.
    Masataka Goto, Hiroki Hashiguchi, Takuichi Nishimura, and Ryuichi Oka: RWC Music Database: Music Genre Database and Musical Instrument Sound Database, Proceedings of the 4th International Conference on Music Information Retrieval (ISMIR 2003), pp.229-230, October 2003.
    Masataka Goto: Development of the RWC Music Database, Proceedings of the 18th International Congress on Acoustics (ICA 2004), pp.I-553-556, April 2004. (Invited Paper)
    ( Award for the Best Presentation, JSMPC (The Japanese Society for Music Perception and Cognition), 2002. )
  2. AIST Annotation: Annotation for the RWC Music Database (Masataka Goto) [2001-]
    Masataka Goto: AIST Annotation for the RWC Music Database, Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR 2006), pp.359-360, October 2006.
  3. AIST Humming Database (Masataka Goto, Takuichi Nishimura) [2004-]
    Masataka Goto and Takuichi Nishimura: AIST Humming Database: Music Database for Singing Research, Information Processing Society of Japan SIG Notes (Technical Report), 2005-MUS-61-2, Vol.2005, No.82, pp.7-12, August 2005. (in Japanese)

box Speech Information Processing:

Speech Interface

  1. Speech Completion: On-demand Completion Assistance Using Filled Pauses (Masataka Goto, Katunobu Itou, Tomoyosi Akiba, Satoru Hayamizu) [2000-]
    Masataka Goto, Katunobu Itou, Tomoyosi Akiba, and Satoru Hayamizu: Speech Completion: New Speech Interface with On-demand Completion Assistance, Proceedings of HCI International 2001, Vol.1, pp.198-202, August 2001.
    Masataka Goto, Katunobu Itou, and Satoru Hayamizu: Speech Completion: On-demand Completion Assistance Using Filled Pauses for Speech Input Interfaces, Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP-2002), pp.1489-1492, September 2002.
    ( WISS2000 Best Paper Award, The 8th Workshop on Interactive Systems and Software, 2000. )
    ( WISS2000 Best Presentation Award, The 8th Workshop on Interactive Systems and Software, 2000. )
    ( Awaya Prize (award for outstanding presentations given to promising young researchers, to commemorate Dr. Kiyoshi Awaya), ASJ (The Acoustical Society of Japan), 2001. )
    ( Award for Outstanding Poster Presentation, ASJ (The Acoustical Society of Japan), 2001. )
    ( IPSJ SIG Research Award (SIGSLP: The Special Interest Group on Spoken Language Processing), IPSJ (The Information Processing Society of Japan), 2002. )
  2. Speech Shift: Direct Speech-Input-Mode Switching through Intentional Control of Voice Pitch (Yukihiro Omoto, Masataka Goto, Katunobu Itou, Tetsunori Kobayashi) [2001-]
    Masataka Goto, Yukihiro Omoto, Katunobu Itou, and Tetsunori Kobayashi: Speech Shift: Direct Speech-Input-Mode Switching through Intentional Control of Voice Pitch, Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003), pp.1201-1204, September 2003.
  3. Speech Starter: Noise-Robust Endpoint Detection Using Filled Pauses (Koji Kitayama, Masataka Goto, Katunobu Itou, Tetsunori Kobayashi) [2002-]
    Koji Kitayama, Masataka Goto, Katunobu Itou, and Tetsunori Kobayashi: Speech Starter: Noise-Robust Endpoint Detection by Using Filled Pauses, Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003), pp.1237-1240, September 2003.
  4. Speech Spotter: On-demand Speech Recognition in Human-to-Human Conversation (Koji Kitayama, Masataka Goto, Katunobu Itou, Tetsunori Kobayashi) [2003-]
    Masataka Goto, Koji Kitayama, Katunobu Itou, and Tetsunori Kobayashi: Speech Spotter: On-demand Speech Recognition in Human-Human Conversation on the Telephone or in Face-to-Face Situations, Proceedings of the 8th International Conference on Spoken Language Processing (ICSLP-2004), pp.1533-1536, October 2004.
  5. Speech Repair: Quick Error Correction Just by Using Selection Operation (Jun Ogata, Masataka Goto) [2004-]
    Jun Ogata and Masataka Goto: Speech Repair: Quick Error Correction Just by Using Selection Operation for Speech Input Interfaces, Proceedings of the 9th European Conference on Speech Communication and Technology (Eurospeech 2005), pp.133-136, September 2005.
    ( WISS2004 Best Paper Award, The 12th Workshop on Interactive Systems and Software, 2004. )
  6. Speech Pen: Pen Input Interface Capable of Utilizing Speech Recognition for Digital Writing (Kazutaka Kurihara, Masataka Goto, Jun Ogata, Takeo Igarashi) [2005-]
    Kazutaka Kurihara, Masataka Goto, Jun Ogata, and Takeo Igarashi: Speech Pen: Predictive Handwriting based on Ambient Multimodal Recognition, Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems (CHI 2006), pp.851-860, April 2006.
  7. PodCastle (Masataka Goto, Jun Ogata, Kouichirou Eto) [2006-]
    Masataka Goto, Jun Ogata, and Kouichirou Eto: PodCastle: A Web 2.0 Approach to Speech Recognition Research, Proceedings of Interspeech 2007, September 2007.
    Jun Ogata, Masataka Goto, and Kouichirou Eto: Automatic Transcription for a Web 2.0 Service to Search Podcasts, Proceedings of Interspeech 2007, September 2007.
  8. Presentation Sensei (Kazutaka Kurihara, Masataka Goto, Jun Ogata, Yosuke Matsusaka, Takeo Igarashi) [2006-]

Understanding Speech Audio Signals

  1. Real-time Hesitation (Filled Pause) Detection for Spontaneous Speech (Masataka Goto, Katunobu Itou, Satoru Hayamizu) [1998-]
    Masataka Goto, Katunobu Itou, and Satoru Hayamizu: A Real-time Filled Pause Detection System for Spontaneous Speech Recognition, Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), pp.227-230, September 1999.
  2. Dynamic Pronunciation Modeling for Spontaneous Speech Recognition (Jun Ogata, Masataka Goto, Futoshi Asano) [2003-]
    Jun Ogata, Masataka Goto, and Futoshi Asano: Dynamic Pronunciation Modeling for Spontaneous Speech Recognition, the 2004 Spring Meeting of the Acoustical Society of Japan, pp.203-204, March 2004. (in Japanese)
  3. N-best Search methods for Minimum Word Error Rate Speech Recognition (Jun Ogata, Masataka Goto) [2004-]
    Jun Ogata and Masataka Goto: N-best Search methods for Minimum Word Error Rate Speech Recognition, the 2004 Autumn Meeting of the Acoustical Society of Japan, pp.195-196, September 2004. (in Japanese)
  4. Speaker Identification under Noisy Environments (Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno) [2005-]
    Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno: Speaker Identification under Noisy Environments by Using Harmonic Structure Extraction and Reliable Frame Weighting, Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), pp.1459-1462, September 2006.

Sound Source Segregation

  1. Sound Stream Segregation based on Residue-Driven Architecture (Tomohiro Nakatani, Masataka Goto, Takeshi Kawabata, Hiroshi G. Okuno) [1995-1998]
    Tomohiro Nakatani, Masataka Goto, and Hiroshi G. Okuno: Localization by Harmonic Structure and Its Application to Harmonic Sound Stream Segregation, Proceedings of the 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1996), pp.653-656, May 1996.
  2. Speech Event Tracking and Separation based on the Audio and Video Information Fusion (Futoshi Asano, Hideki Asoh, Isao Hara, Takashi Yoshimura, Jun Ogata, Naoyuki Ichimura, Yoichi Motomura, Masataka Goto, Kiyoshi Yamamoto) [2001-]
    Futoshi Asano, Masataka Goto, Katunobu Itou, and Hideki Asoh: Real-time Sound Source Localization and Separation System and Its Application to Automatic Speech Recognition, Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), pp.1013-1016, September 2001.

Publications:


box Back to:


Masataka GOTO <m.goto [at] aist.go.jp>
All pages are copyrighted by the author. Unauthorized reproduction is strictly prohibited.

last update: March 9, 2007