[spctools-discuss] Warning message during the peptideprophet analysis

Ankit Balhara Thu, 14 Feb 2019 10:28:54 -0800

Hi,

I am using peptideprophet for the statistical validation of X!Tandem 
results. I am using the options: Use accurate mass binning, using ppm, Use 
Hydrophobicity / RT information, Use decoy hits to pin down the negative 
distribution. Decoy protein names begin with'rev_', Use Non-parametric 
model (can only be used with decoy option) and Report decoy hits with a 
computed probability (based on the model learned). I have acquired the data 
on AB Sciex system but I used msconvert for conversion of .wiff files to 
mzXML format.
But I am getting the following error message:


Found 676 Decoys, and 834 Non-Decoys
Iterations: .........10.........20.....Estimating Retention Time Model ... 
please wait ... WARNING: Not enough IDs in run index c:/TPP/data/liver_DDA_1 to 
generate RT Gradient Correction.WARNING: Not enough high probability IDs in run 
index c:/TPP/data/liver_DDA_1 to generate RT model. RT Model has been disabled.
Run Index: c:/TPP/data/liver_DDA_1, slope=nan, intercept=nan, r_sq=nanWARNING: 
Not enough IDs in run index c:/TPP/data/liver_DDA_2 to generate RT Gradient 
Correction.WARNING: Not enough high probability IDs in run index 
c:/TPP/data/liver_DDA_2 to generate RT model. RT Model has been disabled.
Run Index: c:/TPP/data/liver_DDA_2, slope=nan, intercept=nan, r_sq=nanWARNING: 
Not enough IDs in run index c:/TPP/data/liver_DDA_3 to generate RT Gradient 
Correction.WARNING: Not enough high probability IDs in run index 
c:/TPP/data/liver_DDA_3 to generate RT model. RT Model has been disabled.
Run Index: c:/TPP/data/liver_DDA_3, slope=nan, intercept=nan, r_sq=nanWARNING: 
Not enough high probability IDs in run index c:/TPP/data/liver_DDA_1 to 
generate RT model. RT Model has been disabled.
Run Index: c:/TPP/data/liver_DDA_1, slope=nan, intercept=nan, r_sq=nanWARNING: 
Not enough high probability IDs in run index c:/TPP/data/liver_DDA_2 to 
generate RT model. RT Model has been disabled.
Run Index: c:/TPP/data/liver_DDA_2, slope=nan, intercept=nan, r_sq=nanWARNING: 
Not enough high probability IDs in run index c:/TPP/data/liver_DDA_3 to 
generate RT model. RT Model has been disabled.
Run Index: c:/TPP/data/liver_DDA_3, slope=nan, intercept=nan, r_sq=nan

WARNING: Mixture model quality test failed for charge (1+).WARNING: Mixture 
model quality test failed for charge (2+).WARNING: Mixture model quality test 
failed for charge (3+).WARNING: Mixture model quality test failed for charge 
(4+).WARNING: Mixture model quality test failed for charge (5+).WARNING: 
Mixture model quality test failed for charge (6+).WARNING: Mixture model 
quality test failed for charge (7+).
model complete after 26 iterations
command completed in 4 sec 


Could anyone help me out to resolve this problem?

I am also attaching the tandem parameter file also. Also the X!Tandem 
results showed that the valid models for first data = 0, while 2 and 4 
models for two other data.

-- 
You received this message because you are subscribed to the Google Groups 
"spctools-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/spctools-discuss.
For more options, visit https://groups.google.com/d/optout.

<?xml version="1.0" encoding="UTF-8"?>

<bioml>
 
<note> DEFAULT PARAMETERS. The value of "isb_default_input_kscore.xml" is 
recommended. Change to "isb_default_input_native.xml" for native X!Tandem 
scoring.</note> 
        <note type="input" label="list path, default 
parameters">C:/TPP/data/params/isb_default_input_kscore.xml</note>

<note> FILE LOCATIONS. Replace them with your input (.mzXML) file and output 
file -- these are REQUIRED. Optionally a log file and a sequence output file of 
all protein sequences identified in the first-pass can be specified. Use of 
FULL path (not relative) paths is recommended. </note>
        <note type="input" label="spectrum, path">full_mzXML_filepath</note>
        <note type="input" label="output, path">full_tandem_output_path</note>
        <note type="input" label="output, log path"></note>
        <note type="input" label="output, sequence path"></note>

<note> TAXONOMY FILE. This is a file containing references to the sequence 
databases. Point it to your own taxonomy.xml if needed.</note>
        <note type="input" label="list path, taxonomy 
information">C:/TPP/data/params/taxonomy.xml</note>

<note> PROTEIN SEQUENCE DATABASE. This refers to identifiers in the 
taxomony.xml, not the .fasta files themselves! Make sure the database you want 
is present as an entry in the taxonomy.xml referenced above. This is REQUIRED. 
</note>
        <note type="input" label="protein, taxon">Human</note>
 
<note> PRECURSOR MASS TOLERANCES. In the example below, a 0 Da to 20.0 ppm 
(monoisotopic mass) window is searched for peptide candidates. Since this is 
monoisotopic mass, so for non-accurate-mass instruments, for which the 
precursor is often taken nearer to the isotopically averaged mass, an 
asymmetric tolerance (-2.0 Da to 4.0 Da) is preferable. This somewhat imitates 
a (-3.0 Da to 3.0 Da) window for averaged mass (but not exactly)</note>
        <note type="input" label="spectrum, parent monoisotopic mass 
error">0.0</note>
        <note type="input" label="spectrum, parent monoisotopic mass error 
plus">20.0</note>
        <note type="input" label="spectrum, parent monoisotopic mass error 
units">ppm</note>
                <note>The value for this parameter may be 'Daltons' or 'ppm': 
all other values are ignored</note>
        <note type="input" label="spectrum, parent monoisotopic mass isotope 
error">no</note>
                <note>This allows peptide candidates in windows around -1 Da 
and -2 Da from the acquired mass to be considered. Only applicable when the 
minus/plus window above is set to less than 0.5 Da. Good for accurate-mass 
instruments for which the reported precursor mass is not corrected to the 
monoisotopic mass. </note>


<note> MODIFICATIONS. In the example below, there is a static (carbamidomethyl) 
modification on C, and variable modifications on M (oxidation). Multiple 
modifications can be separated by commas, as in "80.0@S,80.0@T". Peptide 
terminal modifications can be specified with the symbol '[' for N-terminus and 
']' for C-terminus, such as 42.0@[ .  </note>
        
        <note type="input" label="residue, modification mass">57.021464@C</note>
        <note type="input" label="residue, potential modification 
mass">15.994915@M</note>
        <note type="input" label="residue, potential modification motif"></note>
                <note> You can specify a variable modification only when 
present in a motif. For instance, 0.998@N!{P}[ST] is a deamidation modification 
on N only if it is present in an N[any but P][S or T] motif (N-glycosite). 
</note>
        <note type="input" label="protein, N-terminal residue modification 
mass"></note>
        <note type="input" label="protein, C-terminal residue modification 
mass"></note>
                <note> These are *static* modifications on the PROTEINS' N or 
C-termini. </note>

<note> SEMI-TRYPTICS AND MISSED CLEAVAGES. In the example below, semitryptic 
peptides are allowed, and up to 2 missed cleavages are allowed. </note>
        <note type="input" label="protein, cleavage semi">no</note>
        <note type="input" label="scoring, maximum missed cleavage 
sites">2</note>

<note> REFINEMENT. Do not use unless you know what you are doing. Set "refine" 
to "yes" and specify what you want to search in the refinement. For 
non-confusing results, repeat the same modifications you set above for the 
first-pass here.</note>
        <note type="input" label="refine">no</note>
        <note type="input" label="refine, maximum valid expectation 
value">0.1</note>
        <note type="input" label="refine, modification mass">57.012@C</note>
        <note type="input" label="refine, potential modification 
mass">15.994915@M</note>
        <note type="input" label="refine, potential modification motif"></note>
        <note type="input" label="refine, cleavage semi">yes</note>
        <note type="input" label="refine, unanticipated cleavage">no</note>
        <note type="input" label="refine, potential N-terminus 
modifications"></note>
        <note type="input" label="refine, potential C-terminus 
modifications"></note>
        <note type="input" label="refine, point mutations">no</note>
        <note type="input" label="refine, use potential modifications for full 
refinement">no</note>

        


</bioml>

[spctools-discuss] Warning message during the peptideprophet analysis

Reply via email to