Re: co-occurrence paper and code

Ted Dunning Wed, 06 Aug 2014 17:07:26 -0700

We went around this may-pole a while ago.  It is the same if the matrix is
binary.  It isn't otherwise.  Whether this code might someday be used in a
context with non-binary inputs is an open question.  Likewise, whether it
is worth saving some time by omitting a thresholding operation to binarize
the matrix is similarly not clear.


My feeling is that assuming a binary matrix is fine.



On Wed, Aug 6, 2014 at 5:54 PM, Dmitriy Lyubimov <dlie...@gmail.com> wrote:

> is this
>
>     val bcastNumInteractions =
> drmBroadcast(drmI.numNonZeroElementsPerColumn)
>
> any different from just saying `drmI.colSums`?
>
>
> On Wed, Aug 6, 2014 at 4:49 PM, Dmitriy Lyubimov <dlie...@gmail.com>
> wrote:
>
> >
> >
> >
> > On Wed, Aug 6, 2014 at 4:21 PM, Dmitriy Lyubimov <dlie...@gmail.com>
> > wrote:
> >
> > I suppose in that context LLR is considered a distance (higher scores
> mean
> >> more `distant` items, co-occurring by chance only)?
> >>
> >
> > Self-correction on this one -- having given a quick look at llr paper
> > again, it looks like it is actually a similarity (higher scores meaning
> > more stable co-occurrences, i.e. it moves in the opposite direction of
> >  p-value if it had been a classic  test
> >
> >
> >> [1] http://ssc.io/wp-content/uploads/2012/06/rec11-schelter.pdf
> >>
> >> -d
> >>
> >
> >
>

Re: co-occurrence paper and code

Reply via email to