If you run the Sequest search through the TPP GUI interface or via the
runsearch.exe wrapper on the command line, it would pack up and
compress the search results folder and delete the corresponding
directory (of dta/outs).  You obviously don't need to keep the folders
around if you have no use for those files.  My suggestion would be to
compress the .out files into a zip or tgz archive and then delete each
directory only because the searches are expensive and there is
esoteric bits of info in those files that aren't replicated in a
pep.xml.  If you do archive those files in a .tgz in a specific way,
the link to the out files in the PepXML Viewer can still display the
search results pulling the info directly from the compressed archive.
You don't need to include the .dta files in the compressed archive
because those spectra are easily recreated from the mzXML and the
spectral display tool will grab corresponding spectrum from the mzXML
file if present.

On Wed, Sep 16, 2009 at 1:32 PM, Kris <ktrunc...@gmail.com> wrote:
>
> Anyone know what I need the intermediate folders for?
>
> -Kris
>
> On Sep 10, 4:49 pm, Kris <ktrunc...@gmail.com> wrote:
>> Hi,
>>
>> I'm running a lot of database searches on a large number of mzXML
>> files. Because of this, I need to conserve space on my computer. My
>> understanding of the software is that "Database Search" generates afolderof 
>> out/dta files, which is converted to a PepXML file. The
>> PepXML file is then used in "Analyze Peptides" to generate probability
>> scores for matches.
>>
>> My question is: Is there any reason why I would want to save the
>> folders of out/dta files? It seems like the only thing these folders
>> are used for is generating PepXML files, and having the PepXML files
>> is all I need.
>>
>> I would like to delete the folders of out/dtas to save space on my
>> computer. But if there is some reason I should save them (something
>> they could be used for later on), please let me know.
>>
>> Also, I noticed that if you run a large number of mzXML files in
>> "Database search", the search usually freezes after it gets to about
>> 2.5 million lines of output (I believe). So, I've been running the
>> commands in command line. Please let me know, if you see any problem
>> with this.
>>
>> Thanks,
>> Kris
> >
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"spctools-discuss" group.
To post to this group, send email to spctools-discuss@googlegroups.com
To unsubscribe from this group, send email to 
spctools-discuss+unsubscr...@googlegroups.com
For more options, visit this group at 
http://groups.google.com/group/spctools-discuss?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to