Re: [Biojava-l] RefSeq bioJava parser problem

2002-05-14 Thread Matthew Pocock
[EMAIL PROTECTED] wrote: > Hi, > > Appreciate the responses to the refSeq question. We've been able to put togther > a reliable parser using the example in TestRefSeqPrt. Great! Would you be interested in contributing this to BioJava once you are happy with it? > > Have an additional question

Re: [Biojava-l] RefSeq bioJava parser problem

2002-05-14 Thread Matthew Pocock
Hi. Anyone that write parsers should take a look at the org.biojava.bio.program.tagvalue package. It has cleaner support for this kind of parsing problem. We should be able to refactor sequenceIO to re-use a lot of this API, giving a much more modular framework for handeling nasty things like

RE: [Biojava-l] RefSeq bioJava parser problem

2002-05-14 Thread Cox, Greg
Unfortunately, not. This is probably the weakest point in BioJava's parsing right now. As you may have noticed, there's a more serious problem with the reference information. If a reference doesn't contain a field that others do, nothing is added under that key, causing them to get out of syn