Hi!

I'm looking for an effective way to store a large document-term matrix.
The matrix I'm looking at has about 100.000 documents and probably
1.000 terms.

Which representation of this matrix would be the most effictive to work
with?
Putting the whole thing into memory at once is not feasable, so I
wonder if anyonre of you has an idea, a link, or an algorithm she would
be willing to share...

Any help and/or idea is greatly appreciated!

Regards,
Dirk

PS: Don't know if this helps, but the language of choice for the
implementation would be VB.NET for now.

X-Google-Language: ENGLISH,ASCII-7-bit
Received: by 10.11.53.63 with SMTP id b63mr99021cwa;
        Sun, 21 May 2006 04:23:40 -0700 (PDT)
X-Google-Token: 396zjgwAAADOdpi6KfJaEtOC5Oi_WUN9
Received: from 62.214.205.71 by i40g2000cwc.googlegroups.com with HTTP;
        Sun, 21 May 2006 11:23:39 +0000 (UTC)
From: "Dirk" <[EMAIL PROTECTED]>
To: "Algorithm Geeks" <[email protected]>
Subject: Effictive storage of large document-term-matrix?
Date: Sun, 21 May 2006 04:23:39 -0700
Message-ID: <[EMAIL PROTECTED]>
User-Agent: G2/0.2
X-HTTP-UserAgent: Mozilla/5.0 (Windows; U; Windows NT 5.1; de; rv:1.8.0.3) 
Gecko/20060426 Firefox/1.5.0.3,gzip(gfe),gzip(gfe)
Mime-Version: 1.0
Content-Type: text/plain

Hi!

I'm looking for an effective way to store a large document-term matrix.
The matrix I'm looking at has about 100.000 documents and probably
1.000 terms.

Which representation of this matrix would be the most effictive to work
with?
Putting the whole thing into memory at once is not feasable, so I
wonder if anyonre of you has an idea, a link, or an algorithm she would
be willing to share...

Any help and/or idea is greatly appreciated!

Regards,
Dirk

PS: Don't know if this helps, but the language of choice for the
implementation would be VB.NET for now.


--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Algorithm Geeks" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at http://groups.google.com/group/algogeeks
-~----------~----~----~----~------~----~------~--~---

Reply via email to