What's the general consensus (if such exists) about what goes in core vs.util?
Over on MAHOUT-242 there is some discussion about where to put the n-gram / LLR collocation utilities, and since I'm relatively new here I don't feel like I can make a point about it going one place or another without an understanding of the purpose of the different modules. In some ways I can see 242 being a utility -- used for the preparation of language models or something, upon which core algorithms depend. On the other hand I could see mahout including a suite of nlp algorithms in core where 242 is simply a starting point. Drew