本文介绍了Matlab中的模糊c均值tcp转储聚类的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有一些这样表示的数据:

Hi I have some data thats represented like this:

0,tcp,http,SF,239,486,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,8,8,0.00,0.00,0.00,0.00,1.00,0.00,0.00,19,19,1.00,0.00,0.05,0.00,0.00,0.00,0.00,0.00,normal.

来自于以darpa套装为基础的kdd杯1999.

Its from the kdd cup 1999 which was based on the darpa set.

我拥有的文本文件具有成排的数据,例如,在matlab中,您可以通过键入findcluster使用通用的聚类工具,但该工具仅接受.dat文件.

the text file I have has rows and rows of data like this, in matlab there is the generic clustering tool you can use by typing findcluster but it only accepts .dat files.

我也不太确定是否接受这样的格式.林也不确定为什么在转储文件中有这么多的尾随零.

Im also not very sure if it will accept the format like this. Im also not sure why there is so many trailing zeros in the dump files.

有人可以帮助我如何利用文本文档并通过Matlab中的fcm聚类方法运行它吗?确实需要代码帮助.

Can anyone help how I can utilise the text document and run it thru a fcm clustering method in matlab? Code help is really needed.

推荐答案

FINDCLUSTER 只是用于两种群集算法的GUI界面: FCM 订阅

FINDCLUSTER is simply a GUI interface for two clustering algorithms: FCM and SUBCLUST

您首先需要从文件中读取数据,然后查看 TEXTSCAN

You first need to read the data from file, look into the TEXTSCAN function for that.

然后,您需要处理非数字属性;删除它们或以某种方式转换它们.据我所知,提到的两种算法仅支持数字数据.

Then you need to deal with non-numeric attributes; either remove them or convert them somehow. As far as I can tell, the two algorithms mentioned only support numeric data.

访问 KDD cup 数据集的原始网站以查找每个属性的描述.

Visit the original website of the KDD cup dataset to find out the description of each attribute.

这篇关于Matlab中的模糊c均值tcp转储聚类的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

10-21 11:54