Anssi Rämö

Senior Researcher, Voice Applications
Media Content Representation

Background

I joined NRC Tampere in 1999. I did my master's thesis on speech coding at the Tampere University of Technology, and started on the same things in NRC. Since then I have been working on various speech coding related projects. Quite a lot of  my work has gone to ITU-T related standardization efforts such as ITU-4k, G.718, G.729.1E and G.718B. My current standardization effort is the ongoing 3GPP EVS.

Research Interests

  • Speech and audio coding
  • Spatial audio capture
  • Video and image coding
  • Data compression

Conferences

Conference Presentations

[poster] Anssi Rämö and Henri Toukomaa "Voice Quality Evaluation of Recent Open Source Codecs", Interspeech 2010, Tokyo, Japan, September 2010

[poster] Anssi Rämö and Henri Toukomaa "Voice Quality Characterization of IETF Opus Codec", Interspeech 2011, Florence, Italy, August 2011

Personal Information

I'm a radioamateur. I'm active member of both Tampere University of Technology radio amateur club OH3TR as well Nokia Tampere Radio Amateur Club OH3AV.

Education

  • M.Sc. Computer Science, Tampere University of Technology, Finland, 1999.
  • PhD on Signal processing, Tampere University of Technology, still on going..

Publications

[pdf] Alan McCree et al., "A 4 kb/s Hybrid MELP/CELP Speech Coding Candidate for ITU Standardization", ICASSP-2002, Orlando, FL, USA, May 2002

[pdf] Jacek Stachurski et al., "Hybrid MELP/CELP Coding at Bit Rates from 6.4 to 2.4 kb/s", ICASSP-2003, Hong Kong, China, 2003 (presented in Montreal, Canada, May 2004)

[pdf] Anssi Rämö, "Improving LSF quantization Performance with Sorting", ICSP-2004 Beijing, China, September 2004

[pdf] Anssi Rämö, Jani Nurminen, Sakari Himanen, and Ari Heikkinen, "Segmental Speech Coding for Storage Applications" ICSLP-2004 , Jeju, Korea, October 2004

[pdf] Anssi Rämö and Henri Toukomaa, "On Comparing Speech Quality of Various Narrow- And Wideband Speech Codecs" ISSPA-2005 , Sydney, Australia, August 2005

[pdf] Jani Nurminen, Sakari Himanen and Anssi Rämö, "Efficient Technique for Quantization of Pitch Contours" Speech Prosody 2006 , Dresden, Germany, May 2006

[pdf] Anssi Rämö at al., "Quality Evaluation of the G.EV-VBR Speech Codec", ICASSP-2008 Las Vegas, NV, U.S.A., April 2008

[pdf] Milan Jelinek et al., "ITU-T G.EV-VBR Baseline Codec", ICASSP-2008 Las Vegas, NV, U.S.A., April 2008

[pdf] Tommy Vaillancourt at al., "ITU-T EV-VBR: a Robust 8-32 kbit/s Scalable Coder for Error Prone Telecommunications Channels", Eusipco-2008 Lausanne, Switzerland, August 2008

[pdf] Lasse Laaksonen and Anssi Rämö "Using Noise Reduction in Mode Selection and Pitch Search", ICSPCS-2008 Gold Coast, Australia, December 2008

[pdf] Mikko Tammi, Lasse Laaksonen, Anssi Rämö, and Henri Toukomaa "Scalable Superwideband Extension for Wideband Coding"ICASSP-2009, Taipei, Taiwan, April 2009

[pdf][pdf-results] Anssi Rämö "Voice Quality Evaluation of Various Codecs", ICASSP-2010, Dallas, U.S.A., March 2010

[pdf] Anssi Rämö and Henri Toukomaa "Voice Quality Evaluation of Recent Open Source Codecs", Interspeech 2010, Tokyo, Japan, September 2010

[pdf] Christina Dicke, Viljakaisa Aaltonen, Anssi Rämö, and Miikka Vilermo "Talk to me: The Influence of Audio Quality on the Perception of Social Presence", HCI-2010, Dundee, U.K., September 2010

[pdf] Kari Järvinen, Imed Bouazizi, Lasse Laaksonen, Pasi Ojala, and Anssi Rämö "Media coding for the next generation mobile system LTE", Elsevier Computer Communications, pp. 1916-1927, October 2010

[pdf] Anssi Rämö and Henri Toukomaa "Voice Quality Characterization of IETF Opus Codec", Interspeech 2011, Florence, Italy, August 2011

Anssi Rämö's M.Sc. Thesis "Pitch Modification and Quantization for Offline Speech Coding", Tampere University of Technology, 1999.

Patents

Five accepted patents and twenty patents pending

US 7,003,454 Method and System for Line Spectral Frequency Vector Quantization in Speech Codec

US 7,523,032 Speech coding method, device, coding module, system and software program product for pre-processing the phase structure of a to be encoded speech signal to match the phase structure of the decoded signal

US 7,587,314 Single-codebook vector quantization fot multiple rate applications

US 7,752,038 Pitch lag estimation

US 7,813,922 Audio quantization