[EMAIL PROTECTED] wrote:
Hi there,
I am interested in applying machine learning algorithms in detecting network 
intrusions. I read many papers and realized that the KDD-99 is the most 
well-known dataset used in the field. However, this dataset is provided by MIT 
in 1999, and obviously, its pretty old. As we all know, the defensive 
technologies are fast, and also the hacking techniques. Clearly, the KDD-99 
dataset would not provide the true representation of a network at the current 
time. So, could anyone plz tell me which dataset is more updated, specialized 
for machine learning research in IDS?
Thanks
Patrick


Hi, Patrick.

I am using the KDD-99 dataset in my research work. Though it is the most well-known datasets it has several drawbacks that limits what you can do with it. As an example, and as you note, the distribution of normal data and attack data does not represents a true real network.

I think that a better dataset is the original used to generate the KDD-99 dataset. It can be obtained from www.ll.mit.edu.

Cheers


Juan A. Suárez-Romero

------------------------------------------------------------------------
Test Your IDS

Is your IDS deployed correctly?
Find out quickly and easily by testing it with real-world attacks from CORE IMPACT. Go to http://www.securityfocus.com/sponsor/CoreSecurity_focus-ids_040708 to learn more.
------------------------------------------------------------------------

Reply via email to