On 2 Oct 2007, at 18:54, Fernan Aguero wrote: > Hi, > > I've installed TrEMBL in EMBOSS and it seems like I'm having some > problems ... > > I've run dbiflat as follows: [snip] > > Now, when using seqret, it seems like I'm not getting the > records I expect, for example if I search for the first ID > in the example above (A0B532), I get A0BDZ0 instead:
I suspect your problem is that your trembl file is >2Gb in size. Above this size dbiflat won't work properly and will give wacky results such as the ones you've shown. This won't be a problem with uniprot_sprot.dat as this is still only about 1.1Gb. Your choices are therefore: 1) You could split your trembl file into multiple files, each smaller than 2Gb. This ends up being a complete pain, and you probably don't want to do it this way. 2) Use the newer dbx* family of indexing programs which can cope with larger file sizes. In your case you'd use dbxflat instead of dbiflat. There are some configuration differences between the two so you should read 'tfm dbxflat' first, but they work pretty much the same as the old versions. We use the dbx programs for all of our databases and they work fine. Hope this helps Simon. _______________________________________________ EMBOSS mailing list [email protected] http://lists.open-bio.org/mailman/listinfo/emboss
