On Tue, Jun 4, 2013 at 12:14 AM, Andreas Mueller
<amuel...@ais.uni-bonn.de>wrote:

> On 06/03/2013 04:09 PM, Lars Buitinck wrote:
> > 2013/6/3 Andreas Mueller <amuel...@ais.uni-bonn.de>:
> >> I named the variable, I think, and it is a bad name :-(
> >> Should we rename it?
> >>
> >> I think giving a count makes more sense than giving a frequency: you
> want to
> >> exclude outliers that appear only once or twice for example.
> > I actually hadn't seen this reply. It's not a bad name: it's a minimum
> > for document frequency, df. And yes, absolute counts are more common
> > than relative frequencies; usually, you just set a cutoff to reduce
> > noisy features.
> Maybe I'm just not familiar enough with the NLP slang. Frequency sounds
> relative to me.


NLP folks pass the blame to IR folks :P
------------------------------------------------------------------------------
How ServiceNow helps IT people transform IT departments:
1. A cloud service to automate IT design, transition and operations
2. Dashboards that offer high-level views of enterprise services
3. A single system of record for all IT processes
http://p.sf.net/sfu/servicenow-d2d-j
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to