[R] Case study in forensic computing domain

2012-10-11 Thread Ambikesh Jayal
Hi,

I am looking for case studies, possibly real world, in forensic domain that
will entice forensic computing students and demonstrate the usefulness of
machine learning in forensics. Does anyone know of any such case studies?

Students should be able to replicate the case study, so it should have some
public corpus data and R code to implement the machine learning approach.

I think a case study to determine the authorship of document using machine
learning would be good. The other case study could a regression model to
detect fake currency based on size, weight and other attributes of a note.

Any pointers would be welcome.

Thanks,
Ambi.

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Fwd: Using R for Multiple Regression

2010-08-01 Thread Ambikesh Jayal
-- Forwarded message --
From: Ambikesh Jayal ambi1...@gmail.com
Date: Sun, Aug 1, 2010 at 2:24 PM
Subject: Re: [R] Using R for Multiple Regression
To: ted.hard...@manchester.ac.uk


Hi Ted,

Thanks to all those who have replied. It was very helpful.

As there can be multiple solutions, is there a way in R to show all the
possible models for a dataset?

Also in R the value of coefficient of an independent variable being shown as
NA is same as being shown as 0 (implying that this variable does not
count).

However, in trying it out as you have, you have already found out something
very important about linear regression! (And about R).

The important point being that there can be multiple equations describing a
dataset? Or one way to simplify a model is to remove the independent
variables that depend on other independent variables?


Thanks again.

Kind regards

Ambikesh Jayal,
Department of Information Systems, Computing and Mathematics
Room 134 St John's Building
Brunel University
Uxbridge, Middlesex
UB8 3PH, UK
Website: http://sites.google.com/site/ambi1999/




On Fri, Jul 30, 2010 at 5:59 PM, Ted Harding
ted.hard...@manchester.ac.ukwrote:

 On 30-Jul-10 15:07:46, Ambikesh Jayal wrote:
  Hi,
  Subject: Using R for Multiple Regression
 
  I am new to statistic but am interested in applying mathematical
  models to solve biological problems. I have used a linear model
  to generate the test data. When using this data I expect R to
  correctly identify the model but that does not seem to be the case.
  I am certain that I am doing something wrong but not able to figure
  it out.
 
  Model:
  Y = m1x1 + m2x2+ m3X3 + c
 

 
  Model Identified by R using lm(formula = y ~ x1 + x2 + x3)
  (Intercept) 8.000e+01
  x1  1.100e+01
  x2 NA
  x3 NA
 
 
  The data I am using is as follows:
 
  y x1 x2 x3
  91 1 14 2
  102 2 15 5
  113 3 16 8
  124 4 17 11
  135 5 18 14
  146 6 19 17
  157 7 20 20
  168 8 21 23
  179 9 22 26
  190 10 23 29
 
  Kind regards
  Dr. Ambikesh Jayal,

 You should look again at your data!

 You have x2 = 13 + x1, x3 = 3*x1 - 1 in these data.
 Hence your model

  Y = m1*x1 + m2*x2+ m3*X3 + c

 with m1=5, m2=6, m3=0, c=2 is the same as

  Y = 5*x1 + 6*(x1+13) + 0*(3*x1 - 1) + 2
= 11*x1 + 6*13 + 2
= 11*x1 + 80

 and R has found that the coefficient of x1 is 1.100e+01 = 11,
 and that the intercept is 8.000e+01 = 80, and has also identified
 that, after allowing for x1, x2 and x3 are irrelevant.

 So, to try out how R behaves in linear regression, you should
 use data which do not have this property that some of the independent
 variables (x1,x2,x3) are linear functions of the others.

 However, in trying it out as you have, you have already found out
 something very important about linear regression! (And about R).

 Hoping this helps,
 Ted.

 
 E-Mail: (Ted Harding) ted.hard...@manchester.ac.uk
 Fax-to-email: +44 (0)870 094 0861
 Date: 30-Jul-10   Time: 17:59:51
 -- XFMail --


[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Using R for Multiple Regression

2010-07-30 Thread Ambikesh Jayal
Hi,

Subject: Using R for Multiple Regression

I am new to statistic but am interested in applying mathematical models to
solve biological problems. I have used a linear model to generate the test
data. When using this data I expect R to correctly identify the model but
that does not seem to be the case. I am certain that I am doing something
wrong but not able to figure it out.

Model:
Y = m1x1 + m2x2+ m3X3 + c

m1=5
m2=6
m3=0
c=2

Model Identified by R using lm(formula = y ~ x1 + x2 + x3)
(Intercept) 8.000e+01
x1  1.100e+01
x2 NA
x3 NA


The data I am using is as follows:

y x1 x2 x3
91 1 14 2
102 2 15 5
113 3 16 8
124 4 17 11
135 5 18 14
146 6 19 17
157 7 20 20
168 8 21 23
179 9 22 26
190 10 23 29

Kind regards

Dr. Ambikesh Jayal,
Department of Information Systems, Computing and Mathematics
Room 134 St John's Building
Brunel University
Uxbridge, Middlesex
UB8 3PH, UK
Website: http://sites.google.com/site/ambi1999/

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] R to solve search problems

2010-06-18 Thread Ambikesh Jayal
Hi All,

I am new to R and want to use R to solve search problems (like Travelling
salesman problem, finding nearest neighbour, hill climbing). Is this
possible in R?

To start with I want as follows.

Find five float numbers whose sum is is equal to their product which should
be 8000.

x + y + z + p + r = 8000
x * y * z * p * r = 8000


Regards,
Ambikesh Jayal.
School of IS, Computing  Maths,
Brunel University,
Uxbridge, UB8 3PH,
United Kingdom.
Email: ambi1...@gmail.com

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Using R to solve search problems

2010-06-18 Thread Ambikesh Jayal
Hi All,

I am new to R and want to use R to solve search problems (like Traveling
salesman problem, finding nearest neighbor, hill climbing). Is this possible
in R?

To start with I want as follows.

Find five float numbers whose sum is is equal to their product which should
be 8000.

x + y + z + p + r = 8000
x * y * z * p * r = 8000


Regards,
Ambikesh Jayal.
School of IS, Computing  Maths,
Brunel University,
Uxbridge, UB8 3PH,
United Kingdom.
Email: ambi1...@gmail.com

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.