Right - unfortunately all the other fields, while appearing well structured and nicely formatted in the example I sent, may or may not be present (or present but poorly formatted due to legacy issues) in the general case. And the patent number may not be present in the data representing patent applications that are still pending review.
Many thanks, Daniel -----Original Message----- From: Peter Rice [mailto:ricepet...@yahoo.co.uk] Sent: Wednesday, September 19, 2012 11:35 AM To: Rozenbaum, Daniel (Biocceleration Inc) Cc: emboss@lists.open-bio.org Subject: Re: [EMBOSS] Support for multi-line annotation in ig format On 19/09/2012 16:23, Rozenbaum, Daniel (Biocceleration Inc) wrote: > Dear Peter, > > At least within the context of USPTO the sequence identifier is the only > consistently present piece of information that uniquely identifies the > sequence. Does the absence of an accession number field make the task of > adding support for this in EMBOSS more complex? No, it is not a problem. You only need to tell the database definition it has no accession (but perhaps the patent number could be used as an accession) regards, Peter Rice EMBOSS Team _______________________________________________ EMBOSS mailing list EMBOSS@lists.open-bio.org http://lists.open-bio.org/mailman/listinfo/emboss