Hi all,

I'm certain this can be done, but I figure it would be much easier asking the news group than spending hours trawling though random web pages:

Does anyone know how to search protein sequence databases using a consensus sequence which allows varying gap sizes, and homologous amino acids. For example, say I have a consensus sequence DXXDN - can I scan databases which will pull out sequences containing, for example, EGGGGDQ?

A quick comment on the structure validation thread... Are bioscientists as critical of mistakes in protein and genome databases as the crystallography community seems to be of the PDB? Are mistakes in these databases more or less common that the PDB? Also, do they require you to submit your raw data (i.e., sequencing chromatograms, etc)? I feel that a simple improvement in the current validation tools which would flag unusual data submissions based on various parameters, and require another expert to go over just these structures to confirm them, would be sufficient.

Thanks,

AGS

_________________________________________________________________
Find a local pizza place, movie theater, and moreĀ….then map the best route! http://maps.live.com/default.aspx?v=2&ss=yp.bars~yp.pizza~yp.movie%20theater&cp=42.358996~-71.056691&style=r&lvl=13&tilt=-90&dir=0&alt=-1000&scene=950607&encType=1&FORM=MGAC01

Reply via email to