Hello Deepak - this is a great suggestion !

I've been meaning to introduce 'real' support for taxonomic names in
Jalview for some time, but there always seemed to be something else that
took higher priority. However - we will definitely have support for this
in the next major Jalview release, and I'll do what I can to introduce
basic support for this even sooner.

For the moment, you could consider running this groovy script (via the
groovy console under the tools menu) - which will do a search and
replace to put anything in parenthesis at the beginning of the sequence
name:

/// groovy script for inserting taxon name at beginning of sequence ID
import jalview.datamodel.*;
import jalview.gui.AlignFrame;
import jalview.gui.AlignViewport;

// matches the species name in [parentheses]

def descpattern = ~/.*\[([^]]+)/;

def af = Jalview.getAlignframes();

// walk through  all alignments, processing all sequences

for (ala in af)
{
     def al = ala.viewport.alignment;
     if (al!=null)
     {
         SequenceI[] seqs = al.getSequencesArray();
         for (sq in seqs)
         {
             if (sq!=null) {
                if (sq.getDatasetSequence()!=null)
                {
                    // extract species from dataset sequence description
and place at beginning of ID string in alignment
                   def mtch =
sq.getDatasetSequence().getDescription().replaceAll(descpattern,'$1')
                    +"|"+sq.getDatasetSequence().getName();
                   sq.setName(mtch);
                }
             }
          }
      }
      ala.repaint()
}

/// end of script - open Tools->Groovy console, copy and paste into
panel and hit the run button!

It's a bit rough and ready, but should do what you want for the moment.

hope that helps!
Jim.


On 18/07/2014 18:17, Deepak Barnabas wrote:
> Dear Jim,
>
> Jalview is indispensable to the vast majority of structural biologists 
> including myself. Thanks for your good work with Jalview! We use it on an 
> everyday basis. I have a small suggestion. You probably already have this 
> option somewhere, and I am not looking hard enough. When visualising multiple 
> alignments,  it would be very helpful if the organism name is displayed 
> instead of sequence name. Considering that a FASTA format download of 
> multiple sequences following a BLAST from NCBI always results in the organism 
> name being present in the sequence description within parentheses, would it 
> not be easy (as an option) if Jalview extracted the string within these 
> parentheses and displayed them instead of sequence name. Currently, one has 
> to individually hover over each sequence name to display its description to 
> look for the organism. Let me know, if I'm missing out on something.
>
> Thanks!
> Deepak


The University of Dundee is a registered Scottish Charity, No: SC015096
_______________________________________________
Jalview-discuss mailing list
[email protected]
http://www.compbio.dundee.ac.uk/mailman/listinfo/jalview-discuss

Reply via email to