Bøger af Yiteng Huang - plingboeger.dk

Speech Enhancement in the Karhunen-Loeve Expansion Domain

af Jacob Benesty

322,95 kr.

This book is devoted to the study of the problem of speech enhancement whose objective is the recovery of a signal of interest (i.e., speech) from noisy observations. Typically, the recovery process is accomplished by passing the noisy observations through a linear filter (or a linear transformation). Since both the desired speech and undesired noise are filtered at the same time, the most critical issue of speech enhancement resides in how to design a proper optimal filter that can fully take advantage of the difference between the speech and noise statistics to mitigate the noise effect as much as possible while maintaining the speech perception identical to its original form. The optimal filters can be designed either in the time domain or in a transform space. As the title indicates, this book will focus on developing and analyzing optimal filters in the Karhunen-Loeve expansion (KLE) domain. We begin by describing the basic problem of speech enhancement and the fundamental principles to solve it in the time domain. We then explain how the problem can be equivalently formulated in the KLE domain. Next, we divide the general problem in the KLE domain into four groups, depending on whether interframe and interband information is accounted for, leading to four linear models for speech enhancement in the KLE domain. For each model, we introduce signal processing measures to quantify the performance of speech enhancement, discuss the formation of different cost functions, and address the optimization of these cost functions for the derivation of different optimal filters. Both theoretical analysis and experiments will be provided to study the performance of these filters and the links between the KLE-domain and time-domain optimal filters will be examined. Table of Contents: Introduction / Problem Formulation / Optimal Filters in the Time Domain / Linear Models for Signal Enhancement in the KLE Domain / Optimal Filters in the KLE Domain with Model 1 / Optimal Filters in the KLE Domain with Model 2 / Optimal Filters in the KLE Domain with Model 3 / Optimal Filters in the KLE Domain with Model 4 / Experimental Study

Bog
322,95 kr.

Se mere

A Perspective on Single-Channel Frequency-Domain Speech Enhancement

af Jacob Benesty & Yiteng Huang

404,95 kr.

This book focuses on a class of single-channel noise reduction methods that are performed in the frequency domain via the short-time Fourier transform (STFT). The simplicity and relative effectiveness of this class of approaches make them the dominant choice in practical systems. Even though many popular algorithms have been proposed through more than four decades of continuous research, there are a number of critical areas where our understanding and capabilities still remain quite rudimentary, especially with respect to the relationship between noise reduction and speech distortion. All existing frequency-domain algorithms, no matter how they are developed, have one feature in common: the solution is eventually expressed as a gain function applied to the STFT of the noisy signal only in the current frame. As a result, the narrowband signal-to-noise ratio (SNR) cannot be improved, and any gains achieved in noise reduction on the fullband basis come with a price to pay, which is speechdistortion. In this book, we present a new perspective on the problem by exploiting the difference between speech and typical noise in circularity and interframe self-correlation, which were ignored in the past. By gathering the STFT of the microphone signal of the current frame, its complex conjugate, and the STFTs in the previous frames, we construct several new, multiple-observation signal models similar to a microphone array system: there are multiple noisy speech observations, and their speech components are correlated but not completely coherent while their noise components are presumably uncorrelated. Therefore, the multichannel Wiener filter and the minimum variance distortionless response (MVDR) filter that were usually associated with microphone arrays will be developed for single-channel noise reduction in this book. This might instigate a paradigm shift geared toward speech distortionless noise reduction techniques. Table of Contents: Introduction / Problem Formulation / Performance Measures / Linear and Widely Linear Models / Optimal Filters with Model 1 / Optimal Filters with Model 2 / Optimal Filters with Model 3 / Optimal Filters with Model 4 / Experimental Study

Bog
404,95 kr.

Se mere

Microphone Array Signal Processing

af Jacob Benesty, Jingdong Chen & Yiteng Huang

2.010,95 - 2.020,95 kr.

The main objective of this concise book is to derive and explain the most fundamental algorithms from a strictly broadband (signals and/or processing) viewpoint. Thanks to the approach taken here, new concepts come to light that have great potential.

Bog
2.010,95 kr.

Se mere

Noise Reduction in Speech Processing

af Jacob Benesty, Jingdong Chen, Israel Cohen & mfl.

1.110,95 - 1.333,95 kr.

This text offers an overview of the state-of-the-art of noise reduction algorithms, focusing on the most useful techniques. In addition, it rigorously derives these techniques and proves many fundamental and intuitive results often taken for granted.

Bog
1.110,95 kr.

Se mere

Acoustic MIMO Signal Processing

af Jacob Benesty, Jingdong Chen & Yiteng Huang

921,95 - 1.153,95 kr.

This book reviews acoustic MIMO scenarios for improving human-machine interaction. The book establishes an acoustic MIMO paradigm, linking MIMO signal processing with classical signal processing. Also presented is a novel analysis of acoustic applications to reinforce the fundamentals of acoustic MIMO signal processing.

Bog
921,95 kr.

Se mere