Re: keeping the table ordered

Oystein Grovlen - Sun Norway Tue, 06 Feb 2007 06:42:52 -0800

Nurullah Akkaya wrote:

i am using derby in embedded mode.
i have a table of 100 million records when i do a select i get 600k to1million records table structure is as follows

Are you saying that a single select query may return 1 million records?I am not surprised if that takes more than 1 minute.

stmt.executeUpdate("CREATE TABLE POSTINGLIST ("
   +"WORDID INTEGER NOT NULL,"
   +"DOCID INTEGER NOT NULL,"
   +"ANCHORID INTEGER NOT NULL,"
   +"DOCPOSITION SMALLINT NOT NULL,"
   +"FLAG SMALLINT NOT NULL)");


stmt.executeUpdate("CREATE INDEX WORDID ON POSTINGLIST(WORDID)");
stmt.executeUpdate("CREATE INDEX DOCID ON POSTINGLIST(DOCID)");
stmt.executeUpdate("CREATE INDEX ANCHORID ON POSTINGLIST(ANCHORID)");
select docId , docPosition , anchorId, flag from postingList wherewordId = ?
99 percent of the time i select ( above query ) based on the wordId ihave implemented all the tuning tips in the manual but it stil takes somuch time( more than a minute ) disk i/o seems to be the bottleneck(no swapping occurs cpu is idle during select derby uses index) what iwant to do is keep the tables sorted by wordId so that i can avoidrandom reads and do a sequential read. rigth now insert performance isfaster than i expected so i can trade some write for read is thispossible? if this is not possible out of the box can you give me sometips as to how can i implement this in to the source code?
Thanks for your time...

It is not quite clear to me what you are trying to achieve. Why do youwant a sequential read? Scanning the entire table of 100 millionrecords should take longer time than looking up a record using a indexon wordid. Have you retrieved the query plan and made sure the index onwordid is used? Or are you talking about doing a lookup of manydifferent wordids in sorted order?


--
Øystein

Re: keeping the table ordered

Reply via email to