Document Type Master's Dissertation Author Combrinck, Hendrik Petrus URN etd-12212006-141335 Document Title A cost, complexity and performance comparison of two automatic language identification architectures Degree M Eng (Computer Engineering) Department Electrical, Electronic and Computer Engineering Supervisor
Advisor Name Title Prof E C Botha Committee Chair Keywords
- speech processing systems
- pattern recognition systems
Date 1999-11-01 Availability unrestricted AbstractThis dissertation investigates the cost-complexity-performance relationship between two automatic language identification systems. The first is a state-of-the-art archi¬tecture, trained on about three hours of phonetically hand-labelled telephone speech obtained from the recognised OGLTS corpus. The second system, introduced by our¬selves, is a simpler design with a smaller, less complex parameter space. It is a vector quantisation-based approach which bears some resemblance to a system suggested by Sugiyama. Though trained on the same data, it has no need for any labels and is therefore less costly. A number of experiments are performed to find quasi-optimal parameters for the two systems. In further experiments the systems are evaluated and compared on a set of ten two-language tasks, spanning five languages. The more com¬plex system is shown to have a substantial performance advantage over the simpler design - 81% versus 65% on 40 seconds of speech. However, both results are well under reported state-of-the-art performance of 94% and would suggest that our systems can benefit from additional attention to implementation detail and optimisation of various parameters. Given the above, our suggested architecture may potentially provide an adequate solution where the high development cost associated with state-of-the-art technology and the necessary training corpora are prohibitive.
© 1999, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
Please cite as follows:
Combrinck, HP 1999, A cost, complexity and performance comparison of two automatic language identification architectures, MEng dissertation, University of Pretoria, Pretoria, viewed yymmdd < http://upetd.up.ac.za/thesis/available/etd-12212006-141335/ >
Filename Size Approximate Download Time (Hours:Minutes:Seconds)
28.8 Modem 56K Modem ISDN (64 Kb) ISDN (128 Kb) Higher-speed Access dissertation.pdf 2.40 Mb 00:11:07 00:05:43 00:05:00 00:02:30 00:00:12