Articles via Databases
Articles via Journals
Online Catalog
E-books
Research & Information Literacy
Interlibrary loan
Theses & Dissertations
Collections
Policies
Services
About / Contact Us
Administration
Littman Architecture Library
This site will be removed in January 2019, please change your bookmarks.
This page will redirect to https://digitalcommons.njit.edu/theses/405 in 5 seconds

The New Jersey Institute of Technology's
Electronic Theses & Dissertations Project

Title: Online clustering with single-pass topology based fuzzy clustering algorithm
Author: Jaiantilal, Abhishek
View Online: njit-etd2006-006
(xvi, 155 pages ~ 11.7 MB pdf)
Department: Department of Electrical and Computer Engineering
Degree: Master of Science
Program: Computer Engineering
Document Type: Thesis
Advisory Committee: Dhawan, Atam P. (Committee chair)
Manikopoulos, Constantine N. (Committee member)
Hou, Edwin (Committee member)
Date: 2006-01
Keywords: Data analysis
Data topology
Availability: Unrestricted
Abstract:

Online clustering is of significant interest for real-time data analysis. Generic offline clustering methods such as K-Means, C-Means and others are computationally expensive. The computational burden of these methods increases non-linearly with the size of the data set. In addition these methods usually require a good amount of supervised knowledge yielding a non-unique solution. For real-time data analysis, there is an important tradeoff between accuracy and computational efficiency. An unsupervised one-pass clustering method that efficiently adapts to data distribution and evaluation is proposed. This method, Topology-Based Fuzzy Clustering (TFC), uses the topology of data to discover clusters. TFC uses the method of Growing Neural Gas (GNG) method of creating linked sub-clusters and extends GNG by assigning a fuzzy membership to the sub-clusters, noting the link structure for creating clusters and influencing the learning nodes at each sub-clusters. This also gives a fuzzy estimation of data distribution within each cluster. The computational burden for TFC is proportional to the size of the initial data set and increases linearly with the addition of new data.

As TFC is based on GNG, it is an unsupervised algorithm. A supervised learning method is proposed that can be used in conjunction with TFC, to increases its accuracy with minimum computational burden. This adaptive algorithm is called the Adaptive Topology-Based Fuzzy Clustering (ATFC). In this study, the performance of ATFC and TFC is also evaluated against standard datasets.


If you have any questions please contact the ETD Team, libetd@njit.edu.

 
ETD Information
Digital Commons @ NJIT
Theses and DIssertations
ETD Policies & Procedures
ETD FAQ's
ETD home

Request a Scan
NDLTD

NJIT's ETD project was given an ACRL/NJ Technology Innovation Honorable Mention Award in spring 2003