Please note, this is a very large computation job.

1. Break the human genome up into about 400 pieces (10M bases/piece)
2. Break the 100,000 sequences up into about 1000 files (100 sequences/file)
3. Run each sequence file against each human genome piece:
     400*1000 = 400,000 individual blat operations at several minutes each
4. Several 10's of Gb of psl data output, pslReps filter down to several Gb of 
results.

You will need a super computer to perform this amount of computation.

--Hiram

Mitja Mitrovic wrote:
> Dear UCSC team,
> 
> I have a fasta file of approx. 100.000 sequences (up to 50 bp in length). I 
> would like to create a custom track with those sequences to see where they 
> fall 
> in the genome. Therefore I would need a ped file with bp position ect. So, my 
> question is how can i make a custom track from this fasta file? Thanks in 
> advance!
> 
> Mitja
_______________________________________________
Genome maillist  -  [email protected]
https://lists.soe.ucsc.edu/mailman/listinfo/genome

Reply via email to