Title page for ETD etd-01122011-033543


Document Type Master's Dissertation
Author Dean, Eileen J
Email edean@saol.com
URN etd-01122011-033543
Document Title Computer aided identification of biological specimens using self-organizing maps
Degree MSc
Department Computer Science
Supervisor
Advisor Name Title
Prof A P Engelbrecht Committee Chair
Prof A Nicholas Committee Co-Chair
Keywords
  • tree identification
  • biological keys
  • biological identification
  • clustering and visualization
  • ANN
  • artificial neural network
  • AI
  • artificial intelligence
  • botanical Identification
  • Acacia species
  • self-organizing map
  • unsupervised learning algorithm
  • SOM
Date 2010-04-25
Availability unrestricted
Abstract
For scientific or socio-economic reasons it is often necessary or desirable that biological material be identified. Given that there are an estimated 10 million living organisms on Earth, the identification of biological material can be problematic. Consequently the services of taxonomist specialists are often required. However, if such expertise is not readily available it is necessary to attempt an identification using an alternative method. Some of these alternative methods are unsatisfactory or can lead to a wrong identification. One of the most common problems encountered when identifying specimens is that important diagnostic features are often not easily observed, or may even be completely absent. A number of techniques can be used to try to overcome this problem, one of which, the Self Organizing Map (or SOM), is a particularly appealing technique because of its ability to handle missing data. This thesis explores the use of SOMs as a technique for the identification of indigenous trees of the Acacia species in KwaZulu-Natal, South Africa. The ability of the SOM technique to perform exploratory data analysis through data clustering is utilized and assessed, as is its usefulness for visualizing the results of the analysis of numerical, multivariate botanical data sets. The SOM’s ability to investigate, discover and interpret relationships within these data sets is examined, and the technique’s ability to identify tree species successfully is tested. These data sets are also tested using the C5 and CN2 classification techniques. Results from both these techniques are compared with the results obtained by using a SOM commercial package. These results indicate that the application of the SOM to the problem of biological identification could provide the start of the long-awaited breakthrough in computerized identification that biologists have eagerly been seeking.

© 2010 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.

Please cite as follows:

Dean, EJ 2010, Computer aided identification of biological specimens using self-organizing maps, MSc dissertation, University of Pretoria, Pretoria, viewed yymmdd < http://upetd.up.ac.za/thesis/available/etd-01122011-033543/ >

C11/44/ag

Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  dissertation.pdf 1.45 Mb 00:06:43 00:03:27 00:03:01 00:01:30 00:00:07

Browse All Available ETDs by ( Author | Department )

If you have more questions or technical problems, please Contact UPeTD.