Thesis
Biomechanically informed nonlinear speech signal processing
- Abstract:
- Linear digital signal processing based around linear, time-invariant systems theory finds substantial application in speech processing. The linear acoustic source-filter theory of speech production provides ready biomechanical justification for using linear techniques. Nonetheless, biomechanical studies surveyed in this thesis display significant nonlinearity and non-Gaussinity, casting doubt on the linear model of speech production. In order therefore to test the appropriateness of linear systems assumptions for speech production, surrogate data techniques can be used. This study uncovers systematic flaws in the design and use of exiting surrogate data techniques, and, by making novel improvements, develops a more reliable technique. Collating the largest set of speech signals to-date compatible with this new technique, this study next demonstrates that the linear assumptions are not appropriate for all speech signals. Detailed analysis shows that while vowel production from healthy subjects cannot be explained within the linear assumptions, consonants can. Linear assumptions also fail for most vowel production by pathological subjects with voice disorders. Combining this new empirical evidence with information from biomechanical studies concludes that the most parsimonious model for speech production, explaining all these findings in one unified set of mathematical assumptions, is a stochastic nonlinear, non-Gaussian model, which subsumes both Gaussian linear and deterministic nonlinear models. As a case study, to demonstrate the engineering value of nonlinear signal processing techniques based upon the proposed biomechanically-informed, unified model, the study investigates the biomedical engineering application of disordered voice measurement. A new state space recurrence measure is devised and combined with an existing measure of the fractal scaling properties of stochastic signals. Using a simple pattern classifier these two measures outperform all combinations of linear methods for the detection of voice disorders on a large database of pathological and healthy vowels, making explicit the effectiveness of such biomechanically-informed, nonlinear signal processing techniques.
Actions
Authors
Contributors
+ Roberts, S
- Division:
- MPLS
- Department:
- Mathematical Institute
- Role:
- Supervisor
+ McSharry, P
- Division:
- MPLS
- Department:
- Mathematical Institute
- Role:
- Supervisor
+ Moroz, I
- Division:
- MPLS
- Department:
- Mathematical Institute
- Role:
- Supervisor
+ Engineering and Physical Sciences Research Council
More from this funder
- Funding agency for:
- Little, M
- Publication date:
- 2007
- Type of award:
- DPhil
- Level of award:
- Doctoral
- Awarding institution:
- Oxford University, UK
- Language:
-
English
- Keywords:
- Subjects:
- UUID:
-
uuid:6f5b84fb-ab0b-42e1-9ac2-5f6acc9c5b80
- Local pid:
-
ora:4828
- Deposit date:
-
2011-01-19
Terms of use
- Copyright holder:
- Max A Little
- Copyright date:
- 2007
If you are the owner of this record, you can report an update to it here: Report update to this record