Hello List,
I am trying split the nt database from NCBI using mpiformatdb program.
As a test case, first I tried nt.00.fasta (nt db is split into 4 small
dbs nt00...nt03).
I got the following error message for several sequences and the
program fails to create index at the end.
[formatdb] WARNING: Sequence number 1053529
(gi|3077651|emb|Z95929.1|SCZ95929), 19 illegal characters were removed:
4 Es, 1 F, 4 Is, 1 J, 2 Os, 7 Ps
When I used nt.03.fasta, everything was fine.
How do I fix this? Do I have to preprocess nt.00.fasta before feeding to
mpiformatdb?
Thanks.
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Mpiblast-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/mpiblast-users