menu
{ "item_title" : "Invariant Features and Enhanced Speaker Normalization for Automatic Speech Recognition", "item_author" : [" Florian Muller "], "item_description" : "Automatic speech recognition systems have to handle various kinds of variabilities sufficiently well in order to achieve high recognition rates in practice. One of the variabilities that has a major impact on the performance is the vocal tract length of the speakers. Normalization of the features and adaptation of the acoustic models are commonly used methods in speech recognition systems. In contrast to that, a third approach follows the idea of extracting features with transforms that are invariant to vocal tract lengths changes. This work presents several approaches for extracting invariant features for automatic speech recognition systems. The robustness of these features under various training-test conditions is evaluated and it is described how the robustness of the features to noise can be increased. Furthermore, it is shown how the spectral effects due to different vocal tract lengths can be estimated with a registration method and how this can be used for speaker normalization.", "item_img_path" : "https://covers4.booksamillion.com/covers/bam/3/83/253/319/3832533192_b.jpg", "price_data" : { "retail_price" : "57.00", "online_price" : "57.00", "our_price" : "57.00", "club_price" : "57.00", "savings_pct" : "0", "savings_amt" : "0.00", "club_savings_pct" : "0", "club_savings_amt" : "0.00", "discount_pct" : "10", "store_price" : "" } }
Invariant Features and Enhanced Speaker Normalization for Automatic Speech Recognition|Florian Muller

Invariant Features and Enhanced Speaker Normalization for Automatic Speech Recognition

local_shippingShip to Me
On Order. Usually ships in 2-4 weeks
FREE Shipping for Club Members help

Overview

Automatic speech recognition systems have to handle various kinds of variabilities sufficiently well in order to achieve high recognition rates in practice. One of the variabilities that has a major impact on the performance is the vocal tract length of the speakers. Normalization of the features and adaptation of the acoustic models are commonly used methods in speech recognition systems. In contrast to that, a third approach follows the idea of extracting features with transforms that are invariant to vocal tract lengths changes. This work presents several approaches for extracting invariant features for automatic speech recognition systems. The robustness of these features under various training-test conditions is evaluated and it is described how the robustness of the features to noise can be increased. Furthermore, it is shown how the spectral effects due to different vocal tract lengths can be estimated with a registration method and how this can be used for speaker normalization.

This item is Non-Returnable

Details

  • ISBN-13: 9783832533199
  • ISBN-10: 3832533192
  • Publisher: Logos Verlag Berlin
  • Publish Date: January 2013
  • Dimensions: 8.04 x 5.68 x 0.56 inches
  • Shipping Weight: 0.65 pounds
  • Page Count: 247

Related Categories

You May Also Like...

    1

BAM Customer Reviews