This is an old question that I just dredged up in my email. There is still a question about your format here. When you say "IPs" do you mean that you have a list of IP addresses?
Or is this a server web-log? Does that mean that the destination IP is implicit. If so, you might be able to see a weak signal due to time proximity of different IP addresses, but I can't see that you would see much else. Time proximity might give you a hint about wide-spread attacks. On Wed, Feb 18, 2015 at 6:49 AM, Raghuveer <alwaysra...@yahoo.com> wrote: > > Hi, > > I was going through mahout ppts online and came accross your email ID. I > have few issues when i want to analyse my dataset. > > i am trying to find how i can make use of my dataset to present some > relations. I have a dataset of the sort > > IPs,timestamp,bytes_tranferred > > what are the different relationships i can derive from this set so that i > can present some meaningful values using mahout. Currently am planning to > use this set to represent which client (in IPs column) had more traffic for > a given time. So i will have to group IPs together i guess. Are there any > better ideas and how can i do it using JAVA code It would be really helpful > if you can show me a sample for this issue. Kindly suggest. > > Thanks, > Raghuveer > > On Tuesday, February 17, 2015 12:24 AM, Ted Dunning < > ted.dunn...@gmail.com> wrote: > > > > Please take questions like this to the Mahout mailing list. > > I really prefer to answer these questions in public. > > On Mon, Feb 16, 2015 at 3:51 AM, Raghuveer <alwaysra...@yahoo.com> wrote: > > > > > > Hi, > > I was going through mahout ppts online and came accross your email ID. I > have few issues when i want to analyse my dataset. > > i am trying to find how i can make use of my dataset to present some > relations. I have a dataset of the sort > > IPs,timestamp,bytes_tranferred > > what are the different relationships i can derive from this set so that i > can present some meaningful values using mahout. Currently am planning to > use this set to represent which client (in IPs column) had more traffic for > a given time. So i will have to group IPs together i guess. Are there any > better ideas and how can i do it using JAVA code It would be really helpful > if you can show me a sample for this issue. Kindly suggest. > > Thanks, > Raghuveer > > > > > > >