Re: [jira] Created: (NUTCH-103) Vivisimo like treeview and url redirect

Dawid Weiss Wed, 05 Oct 2005 00:48:21 -0700

I am planning to take a closer look to the carrot2 implementation and expose

the other algorithms to the user,

That's actually quite simple -- I was planning to do it, but have notime at the moment. The current Carrot2 code in Nutch is a preconfiguredprocess which uses the open source Lingo clustering algorithm to clusterdocuments. But the the codebase of Carrot2 there is now a scriptablecontroller, so you could basically have external scripts configuringseveral different algorithms. It really isn't that difficult. If youneed any help, let me know -- private e-mail or the newsgroup, whatever.

changes to the algorithm(s) so that speed wise be as good as vivisimo (not
only interface wise ;-)).

We don't know what Vivisimo algorithm is really like in terms of speed.Its authors and co-funders are excellent researchers, so I guess itwill be a tough beast to beat :) But of course we don't have any reasonsto be ashamed -- the open source version is quite decent. In thecommercial version we refactored the codebase and added an optionalnative matrix computation library. The speedup is significant (whichmatters only if your servers are really under a lot of load).


Dawid

Re: [jira] Created: (NUTCH-103) Vivisimo like treeview and url redirect

Reply via email to