Hello Andigoni,
To confirm, the data is being extracted using "Get Data -> UCSC Main"
using the Table browser for the hg16 reference genome?
If so, I can let you know that the reference genome itself remains the
same through time. However, annotation tracks may or may not be updated.
Genbank track update frequencies are incremental daily additions and
add/remove/change at full releases.
From your numbers, it seem like the data is coming from RefSeq Genes.
This track was last updated on "2012-04-07". This can be seem at UCSC
when browsing the track's description page, the last line of the top
section: "Data last updated:". The dates for the corresponding files on
their downloads server will also reflect updates (server updated weekly).
Unfortunately, there isn't a straightforward way to get back to your
prior data unless you have it saved. If you have the exact date of your
original data pull, you may be able to reconstruct part of it by mining
GenBank directly, but this would be non-trivial and involve your own
methods/tools outside of Galaxy.
Hopefully this helps to explain the data content. If you have comments
or questions about how tracks are processed at UCSC (including this
question, they may have a creative solution), their team can be reached
at: http://genome.ucsc.edu/contacts.html
Going forward, please use our mailing list for questions.
Thank you!
Jen
Galaxy team
> ---------- Forwarded message ----------
> From: *andigoni* <andig...@med.auth.gr <mailto:andig...@med.auth.gr>>
> Date: Sun, Apr 8, 2012 at 7:06 PM
> Subject: Re: [galaxy-user] inconsistency with query results at
different time periods
>
> Dear Galaxy Gury,
>
> I would like to ask you a simple question about the human genome
> assembly updates. Few months ago I extracted all the introns in bed
> format of chromosomes 6,20 and 22 using the hg16 assembly. The total
> number of records was 32.050.
>
> Could you please explain why today the same query results in 32.837
> introns? hg16 isn't a stable release? And how can I go back to the
> 32.050 introns in order to process them further?
>
>
> Thank you very much in advance,
> Andigoni
>
>
> Andigoni Malousi
> Post-doc in Bioinformatics
> Aristotle University of Thessaloniki
___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using "reply all" in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists,
please use the interface at:
http://lists.bx.psu.edu/