Joos,

You are right. I should use a local database instead of SD files...

Lochana

________________________________
 From: Joos Kiener <j...@sunrise.ch>
To: lochana menikarachchi <locha...@yahoo.com> 
Cc: "cdk-user@lists.sourceforge.net" <cdk-user@lists.sourceforge.net> 
Sent: Thursday, September 19, 2013 9:37 AM
Subject: Re: [Cdk-user] Reading large SD Files
 


I played a round a bit and came up with a crude solution as I mentioned in my 
initial response.

index all occurrences of "$$$$" -> takes 3-4 seconds for a file with 131'000 
records

use separate thread to index to increase performance but current implementation 
requires that index is fully built. This is an issue as you need to have 2 
access mechanisms, index based and not-index based.


Use BufferedReader to go to the indexed line, eg 



for (int i = 0; i < linesToRead; i++) {
                bufferedReader.readLine();
            }


yeah, not ideal but it actually is faster than I expected.


add caching to it. 



But a question remains:

What is your actual goal? Why can't you use Marvin, for commercial use?

1 million is a lot. Using a real database comes to mind. 





2013/9/19 lochana menikarachchi <locha...@yahoo.com>

Hi Nina,
>
>
>I did try the RandomAccessSDFReader. It took few minutes to build the index 
>for an SD file with 50,000 structures. What I am saying is what ever 
>MarvinView does to build index (if it is using an index) is much faster. I am 
>wondering how it does that.
>
>Lochana
>------------------------------------------------------------------------------
>LIMITED TIME SALE - Full Year of Microsoft Training For Just $49.99!
>1,500+ hours of tutorials including VisualStudio 2012, Windows 8, SharePoint
>2013, SQL 2012, MVC 4, more. BEST VALUE: New Multi-Library Power Pack includes
>Mobile, Cloud, Java, and UX Design. Lowest price ever! Ends 9/20/13.
>http://pubads.g.doubleclick.net/gampad/clk?id=58041151&iu=/4140/ostg.clktrk
>_______________________________________________
>Cdk-user mailing list
>Cdk-user@lists.sourceforge.net
>https://lists.sourceforge.net/lists/listinfo/cdk-user
>
>
------------------------------------------------------------------------------
LIMITED TIME SALE - Full Year of Microsoft Training For Just $49.99!
1,500+ hours of tutorials including VisualStudio 2012, Windows 8, SharePoint
2013, SQL 2012, MVC 4, more. BEST VALUE: New Multi-Library Power Pack includes
Mobile, Cloud, Java, and UX Design. Lowest price ever! Ends 9/20/13. 
http://pubads.g.doubleclick.net/gampad/clk?id=58041151&iu=/4140/ostg.clktrk
_______________________________________________
Cdk-user mailing list
Cdk-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/cdk-user

Reply via email to