Hello,
    I am quite new to maximum entropy and unfortunately, I'm fighting with input
to TADM (tadm.sf.net) and to OpenNLP. Maybe my question is off-topic, however
 I would like to know the answer. However, I would like to have an equivalent
inputs for OpenNLP GIS and TADM.

For example to open nlp the input are is the set of observations with the 
format:
(outcome_i, events_i) where events_i is a set of events.

For example, let have a database of binary vectors:

1 0 0 1 1 1 1 1 1 1
0 1 0 0 1 1 1 0 0 0
0 0 0 1 1 1 1 1 1 1
1 0 1 1 1 0 1 1 1 1
0 0 0 0 0 0 0 1 1 1
1 0 0 0 1 0 1 1 1 1
0 0 0 1 0 1 0 1 1 0
1 0 1 1 1 1 1 1 0 0
0 1 0 1 1 0 1 1 1 0

let model the column zero using all other columns. The input can be:

outcome | strings representing the observations/events
class=1 | 1=0 2=0 3=1 4=1 5=1 6=1 7=1 8=1 9=1
class=0 | 1=1 2=0 3=0 4=1 5=1 6=1 7=0 8=0 9=0
class=0 | 1=0 2=0 3=1 4=1 5=1 6=1 7=1 8=1 9=1
class=1 | 1=0 2=1 3=1 4=1 5=0 6=1 7=1 8=1 9=1
class=0 | 1=0 2=0 3=0 4=0 5=0 6=0 7=1 8=1 9=1
class=1 | 1=0 2=0 3=0 4=1 5=0 6=1 7=1 8=1 9=1
class=0 | 1=0 2=0 3=1 4=0 5=1 6=0 7=1 8=1 9=0
class=1 | 1=0 2=1 3=1 4=1 5=1 6=1 7=1 8=0 9=0
class=0 | 1=1 2=0 3=1 4=1 5=0 6=1 7=1 8=1 9=0
class=0 | 1=0 2=0 3=1 4=1 5=1 6=1 7=1 8=0 9=0
class=1 | 1=0 2=0 3=1 4=0 5=1 6=1 7=0 8=0 9=1
class=0 | 1=1 2=0 3=1 4=1 5=1 6=1 7=1 8=0 9=0
class=1 | 1=1 2=0 3=1 4=1 5=0 6=1 7=0 8=0 9=0
class=0 | 1=1 2=0 3=1 4=0 5=1 6=1 7=1 8=0 9=0

The string n=m means the bit n is set to the value m, counting n from 0. 

Now, what is the equivalent input for TADM ? I suspect that the input for TADM
consists of all observations. Is this correct ? However as I have only a very 
limited
number of observations I get a probability distribution that gives probability 
of
1/n for n bits.

I understand that this question is maybe a bit out of topic, but maybe there is 
someone who knows
answer to my question.

Thanks,
     Robert

Reply via email to