下载此文档

《数据仓库与数据挖掘》第10章.ppt


文档分类:IT计算机 | 页数:约92页 举报非法文档有奖
1/92
下载提示
  • 1.该资料是网友上传的,本站提供全文预览,预览什么样,下载就什么样。
  • 2.下载该文档所得收入归上传者、原创者。
  • 3.下载的文档,不会出现我们的网址水印。
1/92 下载此文档
文档列表 文档介绍
第8章: 聚类分析
What is Cluster Analysis?
Types of Data in Cluster Analysis
A Categorization of Major Clustering Methods
Partitioning Methods
Hierarchical Methods
Density-Based Methods
Grid-Based Methods
Model-Based Clustering Methods
Outlier Analysis
Summary

Computational Intelligence Lab, Zhejiang University
Clustering Examples
Segment customer database based on similar buying patterns.
Group houses in a town into neighborhoods based on similar features.
Identify new plant species
Identify similar Web usage patterns
Spatial Data Analysis
create thematic maps in GIS by clustering feature spaces
detect spatial clusters and explain them in spatial data mining
Image Processing

Computational Intelligence Lab, Zhejiang University
Clustering Customers

Computational Intelligence Lab, Zhejiang University
Clustering Houses
Size Based

Computational Intelligence Lab, Zhejiang University
Clustering vs. Classification
No prior knowledge
Number of clusters
Meaning of clusters
Unsupervised learning

Computational Intelligence Lab, Zhejiang University
Clustering Problem
Given a database D={t1,t2,…,tn} of tuples and an integer value k, the Clustering Problem is to define a mapping f:D{1,..,k} where each ti is assigned to one cluster Kj, 1<=j<=k.
A Cluster, Kj, contains precisely those tuples mapped to it.
Unlike classification problem, clusters are not known a priori.
* Fuzzy Clustering

Computational Intelligence Lab, Zhejiang University
What Is Good Clustering?
A good clustering method will produce high quality clusters with
high intra-class similarity
low inter-class similarity
The quality of a clustering result depends on both the similarity measure used by the method and its implementation.
The quality of a clustering method is also measured by its ability to discover some or all of the hidden patterns.

Computational Intelligence Lab, Zhejiang University
Requirements of Clustering in D

《数据仓库与数据挖掘》第10章 来自淘豆网www.taodocs.com转载请标明出处.

非法内容举报中心
文档信息
  • 页数92
  • 收藏数0 收藏
  • 顶次数0
  • 上传人中国课件站
  • 文件大小0 KB
  • 时间2011-09-06