QUEST Central Library Banner

Performance Analysis of NBMU & Samo Classification Techniques used for Textual Information (Record no. 60377)

MARC details
000 -LEADER
fixed length control field 02442nam a22001457a 4500
100 ## - MAIN ENTRY--AUTHOR NAME
Personal name Haseena Baloch
-- 12MCSE07
-- Supervisor Dr. Akhtar Hussain Jalbani
100 ## - MAIN ENTRY--AUTHOR NAME
Personal name 12-MCSE-07
245 ## - TITLE STATEMENT
Title Performance Analysis of NBMU & Samo Classification Techniques used for Textual Information
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Place of publication Nawabshah:
Name of publisher QUEST,
Year of publication 2016.
300 ## - PHYSICAL DESCRIPTION
Number of Pages 60p.
500 ## - GENERAL NOTE
General note ABSTRACT<br/><br/>here is a huge amount of data in various formats that is present over internet. It is difficult to classify millions of text documents manually because it requires more time and resources. Therefore, text classification is widely used for organizing text automatically. In this research, two classification techniques Naive Bayes Multinomial Updateable (NBMU) and Sequential Minimal Optimization (SMO) were applied on dataset. According to results, it was observed that, the combination of Rainbow Stopword (R) and Snowball Stemmer (SS) in NMBU classifier yielded the maximum accuracy (83%) while taking nominal time (0.07 sec) compared to other combinations of stemmers and stopwords removal. Whereas the SMO classifier yielded high accuracy (80%) by three different combinations of stemmers and stopwords removers, (Wordsfromfile) Stopword and Lovins Stemmer (WFF_LS), Regexpfromfile Stopword and Lovins Stemmer (REFF_LS),Regexpfromfile Stopword and Snowball Stemmer (REFF_SS)). However, the time taken for building the model was significantly high (500 - 1000 times higher). Based on the results of this research, it is suggested that the R & SS combination of stopwords remover and stemmer, respectively, in NMBU classifier perform best across the other selected combinations in terms of accuracy. By analysing the results, it is observed that the overall performance of SMO classifier in terms of accuracy is quite high on average compared to NBMU classifier. It was noted that the time that was taken by NBMU classifier was significantly low, compared to SMO classifier. Despite that, SMO is suggested to be utilised for the text classification due to the fact that the overall performance of this classifier is significantly higher in term of accuracy and that the text classification is a difficult task to perform, therefore, the difference in the time taken by NBMU and SMO become negligible and compensated by better accuracy.<br/><br/><br/><br/><br/><br/><br/><br/>
700 ## - ADDED ENTRY--PERSONAL NAME
Personal name Department of Computer System Engineering
856 ## - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier https://tinyurl.com/ycc3c6cv
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Koha item type Thesis and Dissertation
Holdings
Withdrawn status Lost status Home library Current library Date acquired Accession Number Koha item type
    Research Section Research Section 03/10/2018 MP/26-276 Thesis and Dissertation

Copyright © 2018,The QUEST, Nawabshah, Shaheed Benazirabad. All rights reserved
Mr. G. Farooq Channar (Librarian) QUEST, Nawabshah, Sindh, Pakistan 67480.
 Ph#: |   0244-9370381-4 Ext. 2308   Email| lib@quest.edu.pk   Web|  http://www.quest.edu.pk