Definition: | The upper-most hierarchy level of mzIdentML with sub-containers for example describing software, protocols and search results (spectrum identifications or protein detection results). | ||||||||||||||||||||||||||||||||||||||||||||
Type: | MzIdentMLType | ||||||||||||||||||||||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||||||||||||||||||||||
Graphical Context: | ![]() |
||||||||||||||||||||||||||||||||||||||||||||
Example Context: | <MzIdentML id="1234" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://psidev.info/psi/pi/mzIdentML/1.2 mzIdentML1.2.0.xsd" xmlns="http://psidev.info/psi/pi/mzIdentML/1.2" version="1.2.0" creationDate="2016-06-22T19:35:50"> <cvList> <cv id="PSI-MS" uri="https://raw.githubusercontent.com/HUPO-PSI/psi-ms-CV/master/psi-ms.obo" fullName="PSI-MS"/> <cv id="UNIMOD" uri="http://www.unimod.org/obo/unimod.obo" fullName="UNIMOD"/> <cv id="UO" uri="https://raw.githubusercontent.com/bio-ontology-research-group/unit-ontology/master/unit.obo" fullName="UNIT-ONTOLOGY"/> <cv id="PRIDE" uri="https://github.com/PRIDE-Utilities/pride-ontology/blob/master/pride_cv.obo" fullName="PRIDE"/> </cvList> ... </MzIdentML> |
Definition: | The search parameters other than the modifications searched. | ||||||||||||
Type: | ParamListType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | <AdditionalSearchParams> <cvParam cvRef="PSI-MS" accession="MS:1001211" name="parent mass type mono"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1001256" name="fragment mass type mono"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1002490" name="peptide-level scoring"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1002496" name="group PSMs by sequence"></cvParam> </AdditionalSearchParams> |
||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/AdditionalSearchParams MAY supply a *child* term of MS:1001302 (search engine specific input parameter) one or more times e.g.: MS:1001005 (SEQUEST:CleavesAt) WARNING: Term has no definition! e.g.: MS:1001007 (SEQUEST:OutputLines) e.g.: MS:1001009 (SEQUEST:DescriptionLines) e.g.: MS:1001026 (SEQUEST:NormalizeXCorrValues) WARNING: Term has no definition! e.g.: MS:1001028 (SEQUEST:SequenceHeaderFilter) e.g.: MS:1001032 (SEQUEST:SequencePartialFilter) WARNING: Term has no definition! e.g.: MS:1001037 (SEQUEST:ShowFragmentIons) e.g.: MS:1001038 (SEQUEST:Consensus) e.g.: MS:1001042 (SEQUEST:LimitTo) e.g.: MS:1001046 (SEQUEST:sort by dCn) et al. MAY supply term MS:1002635 (proteogenomics search) only once MAY supply a *child* term of MS:1000044 (dissociation method) one or more times e.g.: MS:1000134 (plasma desorption) e.g.: MS:1000135 (post-source decay) e.g.: MS:1000136 (surface-induced dissociation) e.g.: MS:1000242 (blackbody infrared radiative dissociation) e.g.: MS:1000250 (electron capture dissociation) e.g.: MS:1000262 (infrared multiphoton dissociation) e.g.: MS:1000282 (sustained off-resonance irradiation) e.g.: MS:1000433 (low-energy collision-induced dissociation) e.g.: MS:1000435 (photodissociation) e.g.: MS:1000598 (electron transfer dissociation) et al. MAY supply a *child* term of MS:1002473 (ion series considered in search) one or more times e.g.: MS:1001108 (param: a ion) e.g.: MS:1001118 (param: b ion) e.g.: MS:1001119 (param: c ion) e.g.: MS:1001257 (param: v ion) e.g.: MS:1001258 (param: d ion) e.g.: MS:1001259 (param: immonium ion) e.g.: MS:1001260 (param: w ion) e.g.: MS:1001261 (param: x ion) e.g.: MS:1001262 (param: y ion) e.g.: MS:1001263 (param: z ion) et al. MAY supply a *child* term of MS:1002489 (special processing) one or more times e.g.: MS:1002490 (peptide-level scoring) e.g.: MS:1002491 (modification localization scoring) e.g.: MS:1002492 (consensus scoring) e.g.: MS:1002493 (sample pre-fractionation) e.g.: MS:1002494 (cross-linking search) e.g.: MS:1002495 (no special processing) e.g.: MS:1002635 (proteogenomics search) MAY supply term MS:1002490 (peptide-level scoring) only once MAY supply a *child* term of MS:1001066 (ions series considered in search) one or more times MAY supply term MS:1002494 (cross-linking search) only once MAY supply term MS:1002491 (modification localization scoring) only once MAY supply a *child* term of MS:1002658 (identification parameter) only once e.g.: MS:1002496 (group PSMs by sequence) e.g.: MS:1002497 (group PSMs by sequence with modifications) e.g.: MS:1002498 (group PSMs by sequence with modifications and charge) MAY supply a *child* term of MS:1001210 (mass type settings) one or more times e.g.: MS:1001211 (parent mass type mono) e.g.: MS:1001212 (parent mass type average) e.g.: MS:1001255 (fragment mass type average) e.g.: MS:1001256 (fragment mass type mono) |
||||||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1001211" name="parent mass type mono"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1001256" name="fragment mass type mono"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1002490" name="peptide-level scoring"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1002496" name="group PSMs by sequence"></cvParam> |
Definition: | The organization a person belongs to. | ||||||||
Type: | AffiliationType | ||||||||
Attributes: |
|
||||||||
Subelements: | none |
||||||||
Example Context: | <Affiliation organization_ref="ORG_DOC_OWNER"></Affiliation> |
Definition: | Ambiguous residues e.g. X can be specified by the Code attribute and a set of parameters for example giving the different masses that will be used in the search. | ||||||||||||
Type: | AmbiguousResidueType | ||||||||||||
Attributes: |
|
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | |||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/MassTable/AmbiguousResidue MAY supply a *child* term of MS:1001359 (ambiguous residues) one or more times e.g.: MS:1001360 (alternate single letter codes) e.g.: MS:1001361 (alternate mass) |
Definition: | The analyses performed to get the results, which map the input and output data sets. Analyses are for example: SpectrumIdentification (resulting in peptides) or ProteinDetection (assemble proteins from peptides). | ||||||||||||
Type: | AnalysisCollectionType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | <AnalysisCollection xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"> <SpectrumIdentification spectrumIdentificationProtocol_ref="SearchProtocol_1" spectrumIdentificationList_ref="SII_LIST_1" id="SpecIdent_1"> <InputSpectra spectraData_ref="SD_COMBINED_SE_0"></InputSpectra> <SearchDatabaseRef searchDatabase_ref="SearchDB_1"></SearchDatabaseRef> </SpectrumIdentification> </AnalysisCollection> |
Definition: | Data sets generated by the analyses, including peptide and protein lists. | ||||||||||||
Type: | AnalysisDataType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | <AnalysisData> <SpectrumIdentificationList xmlns="http://psidev.info/psi/pi/mzIdentML/1.2" id="SII_LIST_1"> <SpectrumIdentificationResult spectrumID="index=6451" spectraData_ref="SD_COMBINED_SE_0" id="SIR_8947"> <SpectrumIdentificationItem chargeState="2" experimentalMassToCharge="679.817322" calculatedMassToCharge="679.818488" peptide_ref="AVMDDFAAFVEK_##Oxidation(M):3" rank="1" passThreshold="false" id="SIR_8947_SII_1"> <PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic|A_ENSP00000401820|p:known_378_389"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic|A_ENSP00000421027|p:putative_420_431"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic|A_ENSP00000483421|p:known_357_368"></PeptideEvidenceRef> ... </AnalysisData> |
Definition: | The parameters and settings for the protein detection given as CV terms. | ||||||||||||
Type: | ParamListType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | |||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/ProteinDetectionProtocol/AnalysisParams MAY supply a *child* term of MS:1001302 (search engine specific input parameter) one or more times e.g.: MS:1001005 (SEQUEST:CleavesAt) WARNING: Term has no definition! e.g.: MS:1001007 (SEQUEST:OutputLines) e.g.: MS:1001009 (SEQUEST:DescriptionLines) e.g.: MS:1001026 (SEQUEST:NormalizeXCorrValues) WARNING: Term has no definition! e.g.: MS:1001028 (SEQUEST:SequenceHeaderFilter) e.g.: MS:1001032 (SEQUEST:SequencePartialFilter) WARNING: Term has no definition! e.g.: MS:1001037 (SEQUEST:ShowFragmentIons) e.g.: MS:1001038 (SEQUEST:Consensus) e.g.: MS:1001042 (SEQUEST:LimitTo) e.g.: MS:1001046 (SEQUEST:sort by dCn) et al. |
Definition: | The collection of protocols which include the parameters and settings of the performed analyses. | ||||||||||||
Type: | AnalysisProtocolCollectionType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | <AnalysisProtocolCollection xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"> <SpectrumIdentificationProtocol analysisSoftware_ref="ID_software" id="SearchProtocol_1"> <SearchType> <cvParam cvRef="PSI-MS" accession="MS:1001083" name="ms-ms search"></cvParam> </SearchType> <AdditionalSearchParams> <cvParam cvRef="PSI-MS" accession="MS:1001211" name="parent mass type mono"></cvParam> ... </AnalysisProtocolCollection> |
Definition: | The samples analysed can optionally be recorded using CV terms for descriptions. If a composite sample has been analysed, the subsample association can be used to build a hierarchical description. | ||||||||
Type: | AnalysisSampleCollectionType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: |
Definition: | The software used for performing the analyses. | ||||||||||||||||||||
Type: | AnalysisSoftwareType | ||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||
Example Context: | <AnalysisSoftware id="FalseDiscoveryRateGlobal_2015-02-05 15-38-39" name="FalseDiscoveryRateGlobal_2015-02-05 15-38-39"> <SoftwareName> <cvParam cvRef="PSI-MS" accession="MS:1002237" name="mzidLib"></cvParam> </SoftwareName> </AnalysisSoftware> |
Definition: | The software packages used to perform the analyses. | ||||||||
Type: | AnalysisSoftwareListType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: | <AnalysisSoftwareList xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"> <AnalysisSoftware version="" id="ID_software" name="OMSSA"> <SoftwareName> <cvParam cvRef="PSI-MS" accession="MS:1001475" name="OMSSA"></cvParam> </SoftwareName> </AnalysisSoftware> <AnalysisSoftware id="Omssa2mzid_2014-12-08 16-58-29" name="Omssa2mzid_2014-12-08 16-58-29"> ... </AnalysisSoftwareList> |
Definition: | The complete set of Contacts (people and organisations) for this file. | ||||||||||||
Type: | AuditCollectionType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | <AuditCollection xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"> <Person lastName="secondName" firstName="firstname" id="PERSON_DOC_OWNER"> <cvParam cvRef="PSI-MS" accession="MS:1000587" name="contact address" value="address"></cvParam> <Affiliation organization_ref="ORG_DOC_OWNER"></Affiliation> </Person> <Organization id="ORG_DOC_OWNER" name="myworkplace"> <cvParam cvRef="PSI-MS" accession="MS:1000586" name="contact name" value="address"></cvParam> ... </AuditCollection> |
Definition: | Any bibliographic references associated with the file | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Type: | BibliographicReferenceType | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
Subelements: | none |
||||||||||||||||||||||||||||||||||||||||||||||||||||
Example Context: |
Definition: | Depending on context: 1: The contact details of the organisation or person that produced the software 2: Contact details for the Material. The association to ContactRole could specify, for example, the creator or provider of the Material. 3: The Contact that provided the document instance. |
||||||||
Type: | ContactRoleType | ||||||||
Attributes: |
|
||||||||
Subelements: |
|
||||||||
Example Context: | <ContactRole contact_ref="PERSON_DOC_OWNER"> <Role> <cvParam cvRef="PSI-MS" accession="MS:1001271" name="researcher"></cvParam> </Role> </ContactRole> |
Definition: | Any customizations to the software, such as alternative scoring mechanisms implemented, should be documented here as free text. |
Type: | xsd:string |
Attributes: | none |
Subelements: | none |
Example Context: |
Definition: | A source controlled vocabulary from which cvParams will be obtained. | ||||||||||||||||||||
Type: | cvType | ||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||
Subelements: | none |
||||||||||||||||||||
Example Context: | <cv id="UO" uri="https://raw.githubusercontent.com/bio-ontology-research-group/unit-ontology/master/unit.obo" fullName="UNIT-ONTOLOGY"/> |
Definition: | The list of controlled vocabularies used in the file. | ||||||||
Type: | CVListType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: | <cvList> <cv id="PSI-MS" uri="https://raw.githubusercontent.com/HUPO-PSI/psi-ms-CV/master/psi-ms.obo" fullName="PSI-MS"/> <cv id="UNIMOD" uri="http://www.unimod.org/obo/unimod.obo" fullName="UNIMOD"/> <cv id="UO" uri="https://raw.githubusercontent.com/bio-ontology-research-group/unit-ontology/master/unit.obo" fullName="UNIT-ONTOLOGY"/> <cv id="PRIDE" uri="https://github.com/PRIDE-Utilities/pride-ontology/blob/master/pride_cv.obo" fullName="PRIDE"/> </cvList> |
Definition: | A single entry from an ontology or a controlled vocabulary. | ||||||||||||||||||||||||||||||||
Type: | CVParamType | ||||||||||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||||||||||
Subelements: | none |
||||||||||||||||||||||||||||||||
Example Context: | <cvParam cvRef="PSI-MS" accession="MS:1002520" name="peptide group ID" value="CCPQCCSSGCSQNLCGPLCVTTPYYCTR_##Carbamidomethyl(C):1##Carbamidomethyl(C):2##Carbamidomethyl(C):5##Carbamidomethyl(C):6##Carbamidomethyl(C):10##Carbamidomethyl(C):15##Carbamidomethyl(C):19##Carbamidomethyl(C):26"></cvParam> |
Definition: | The specification of filters applied to the database searched. | ||||||||
Type: | DatabaseFiltersType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: |
Definition: | The database name may be given as a cvParam if it maps exactly to one of the release databases listed in the CV, otherwise a userParam should be used. | ||||||||||||
Type: | ParamType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | <DatabaseName> <userParam name="fawaz_PXD000652_combined_concatenated_target_decoy.fasta"></userParam> </DatabaseName> |
||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/Inputs/SearchDatabase/DatabaseName MAY supply a *child* term of MS:1001013 (database name) one or more times e.g.: MS:1001084 (database nr) e.g.: MS:1001104 (database UniProtKB/Swiss-Prot) e.g.: MS:1001142 (database IPI_human) e.g.: MS:1001285 (database IPI_mouse) e.g.: MS:1001286 (database IPI_rat) e.g.: MS:1001287 (database IPI_zebrafish) e.g.: MS:1001288 (database IPI_chicken) e.g.: MS:1001289 (database IPI_cow) e.g.: MS:1001290 (database IPI_arabidopsis) e.g.: MS:1002060 (database UniProtKB/TrEMBL) |
||||||||||||
Example userParams: | <userParam name="fawaz_PXD000652_combined_concatenated_target_decoy.fasta"></userParam> |
Definition: | A specification of how a nucleic acid sequence database was translated for searching. | ||||||||
Type: | DatabaseTranslationType | ||||||||
Attributes: |
|
||||||||
Subelements: |
|
||||||||
Example Context: |
Definition: | The collection of input and output data sets of the analyses. | ||||||||||||
Type: | DataCollectionType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Graphical Context: | ![]() |
||||||||||||
Example Context: | <DataCollection> <Inputs xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"> <SourceFile location="example_files/55merge_omssa.omx" id="SourceFile_1"> <FileFormat> <cvParam cvRef="PSI-MS" accession="MS:1001400" name="OMSSA xml format"></cvParam> </FileFormat> </SourceFile> ... </DataCollection> |
Definition: | A database sequence from the specified SearchDatabase (nucleic acid or amino acid). If the sequence is nucleic acid, the source nucleic acid sequence should be given in the seq attribute rather than a translated sequence. | ||||||||||||||||||||||||
Type: | DBSequenceType | ||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||
Example Context: | <DBSequence searchDatabase_ref="SearchDB_1" accession="generic|B_GENSCAN00000043743_REVERSED|p:genscan" id="dbseq_generic|B_GENSCAN00000043743_REVERSED|p:genscan"></DBSequence> |
||||||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/SequenceCollection/DBSequence MUST supply term MS:1002638 (chromosome strand) only once MAY supply a *child* term of MS:1001342 (database sequence details) one or more times e.g.: MS:1001088 (protein description) e.g.: MS:1001090 (taxonomy nomenclature) e.g.: MS:1001343 (NA sequence) e.g.: MS:1001344 (AA sequence) e.g.: MS:1001467 (taxonomy: NCBI TaxID) e.g.: MS:1001468 (taxonomy: common name) e.g.: MS:1001469 (taxonomy: scientific name) e.g.: MS:1001470 (taxonomy: Swiss-Prot ID) MAY supply a *child* term of MS:1002636 (proteogenomics attribute) one or more times e.g.: MS:1002637 (chromosome name) e.g.: MS:1002638 (chromosome strand) e.g.: MS:1002639 (peptide start on chromosome) e.g.: MS:1002640 (peptide end on chromosome) e.g.: MS:1002641 (peptide exon count) e.g.: MS:1002642 (peptide exon nucleotide sizes) e.g.: MS:1002643 (peptide start positions on chromosome) e.g.: MS:1002644 (genome reference version) MAY supply a *child* term of MS:1001089 (molecule taxonomy) one or more times e.g.: MS:1001090 (taxonomy nomenclature) e.g.: MS:1001467 (taxonomy: NCBI TaxID) e.g.: MS:1001468 (taxonomy: common name) e.g.: MS:1001469 (taxonomy: scientific name) e.g.: MS:1001470 (taxonomy: Swiss-Prot ID) MUST supply term MS:1002637 (chromosome name) only once MAY supply term MS:1001088 (protein description) only once MUST supply term MS:1002644 (genome reference version) only once |
Definition: | The details of an individual cleavage enzyme should be provided by giving a regular expression or a CV term if a "standard" enzyme cleavage has been performed. | ||||||||||||||||||||||||||||||||
Type: | EnzymeType | ||||||||||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||||||||||
Example Context: | <Enzyme nTermGain="H" cTermGain="OH" semiSpecific="false" missedCleavages="2" id="Enz1"> <EnzymeName> <cvParam cvRef="PSI-MS" accession="MS:1001251" name="Trypsin"></cvParam> </EnzymeName> </Enzyme> |
Definition: | The name of the enzyme from a CV. | ||||||||||||
Type: | ParamListType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | <EnzymeName> <cvParam cvRef="PSI-MS" accession="MS:1001251" name="Trypsin"></cvParam> </EnzymeName> |
||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/Enzymes/Enzyme/EnzymeName MAY supply a *child* term of MS:1001045 (cleavage agent name) only once e.g.: MS:1001091 (NoEnzyme) WARNING: Term has no definition! e.g.: MS:1001251 (Trypsin) e.g.: MS:1001303 (Arg-C) e.g.: MS:1001304 (Asp-N) e.g.: MS:1001305 (Asp-N_ambic) e.g.: MS:1001306 (Chymotrypsin) e.g.: MS:1001307 (CNBr) e.g.: MS:1001308 (Formic_acid) e.g.: MS:1001309 (Lys-C) e.g.: MS:1001310 (Lys-C/P) et al. |
||||||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1001251" name="Trypsin"></cvParam> |
Definition: | The list of enzymes used in experiment | ||||||||
Type: | EnzymesType | ||||||||
Attributes: |
|
||||||||
Subelements: |
|
||||||||
Example Context: | <Enzymes independent="false"> <Enzyme nTermGain="H" cTermGain="OH" semiSpecific="false" missedCleavages="2" id="Enz1"> <EnzymeName> <cvParam cvRef="PSI-MS" accession="MS:1001251" name="Trypsin"></cvParam> </EnzymeName> </Enzyme> </Enzymes> |
Definition: | All sequences fulfilling the specifed criteria are excluded. | ||||||||||||
Type: | ParamListType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | |||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/DatabaseFilters/Filter/Exclude MAY supply a *child* term of MS:1001512 (Sequence database filters) one or more times e.g.: MS:1001090 (taxonomy nomenclature) e.g.: MS:1001201 (DB MW filter maximum) e.g.: MS:1001202 (DB MW filter minimum) e.g.: MS:1001203 (DB PI filter maximum) e.g.: MS:1001204 (DB PI filter minimum) e.g.: MS:1001467 (taxonomy: NCBI TaxID) e.g.: MS:1001468 (taxonomy: common name) e.g.: MS:1001469 (taxonomy: scientific name) e.g.: MS:1001470 (taxonomy: Swiss-Prot ID) e.g.: MS:1001513 (DB sequence filter pattern) et al. |
Definition: | A URI to access documentation and tools to interpret the external format of the ExternalData instance. For example, XML Schema or static libraries (APIs) to access binary formats. |
Type: | xsd:anyURI |
Attributes: | none |
Subelements: | none |
Example Context: |
Definition: | The format of the ExternalData file, for example "tiff" for image files. | ||||||||
Type: | FileFormatType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: | <FileFormat> <cvParam cvRef="PSI-MS" accession="MS:1001062" name="Mascot MGF format"></cvParam> </FileFormat> |
||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/Inputs/SearchDatabase/FileFormat MUST supply a *child* term of MS:1001347 (database file formats) one or more times e.g.: MS:1001348 (FASTA format) e.g.: MS:1001349 (ASN.1) e.g.: MS:1001350 (NCBI *.p*) e.g.: MS:1001351 (clustal aln) e.g.: MS:1001352 (embl em) e.g.: MS:1001353 (NBRF PIR) e.g.: MS:1001462 (PEFF format) e.g.: MS:1002659 (UniProtKB text sequence format) e.g.: MS:1002660 (UniProtKB XML sequence format) Path /MzIdentML/DataCollection/Inputs/SourceFile/FileFormat MUST supply a *child* term of MS:1001040 (intermediate analysis format) only once e.g.: MS:1000742 (Bioworks SRF format) e.g.: MS:1001107 (data stored in database) e.g.: MS:1001199 (Mascot DAT format) e.g.: MS:1001200 (SEQUEST out file format) e.g.: MS:1001242 (SEQUEST out folder) e.g.: MS:1001243 (SEQUEST summary) e.g.: MS:1001275 (ProteinScape SearchEvent) e.g.: MS:1001276 (ProteinScape Gel) e.g.: MS:1001399 (OMSSA csv format) e.g.: MS:1001400 (OMSSA xml format) et al. Path /MzIdentML/DataCollection/Inputs/SpectraData/FileFormat MUST supply a *child* term of MS:1000560 (mass spectrometer file format) one or more times e.g.: MS:1000526 (Waters raw format) e.g.: MS:1000562 (ABI WIFF format) e.g.: MS:1000563 (Thermo RAW format) e.g.: MS:1000564 (PSI mzData format) e.g.: MS:1000565 (Micromass PKL format) e.g.: MS:1000566 (ISB mzXML format) e.g.: MS:1000567 (Bruker/Agilent YEP format) e.g.: MS:1000584 (mzML format) e.g.: MS:1000613 (DTA format) e.g.: MS:1000614 (ProteinLynx Global Server mass spectrum XML format) et al. |
||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1001400" name="OMSSA xml format"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1001348" name="FASTA format"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1001062" name="Mascot MGF format"></cvParam> |
Definition: | Filters applied to the search database. The filter MUST include at least one of Include and Exclude. If both are used, it is assumed that inclusion is performed first. | ||||||||||||||||
Type: | FilterType | ||||||||||||||||
Attributes: | none |
||||||||||||||||
Subelements: |
|
||||||||||||||||
Example Context: |
Definition: | The type of filter e.g. database taxonomy filter, pi filter, mw filter | ||||||||||||
Type: | ParamType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | |||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/DatabaseFilters/Filter/FilterType MUST supply a *child* term of MS:1001511 (Sequence database filter types) one or more times e.g.: MS:1001020 (DB filter taxonomy) e.g.: MS:1001021 (DB filter on accession numbers) e.g.: MS:1001022 (DB MW filter) e.g.: MS:1001023 (DB PI filter) e.g.: MS:1001027 (DB filter on sequence pattern) |
Definition: | An array of values for a given type of measure and for a particular ion type, in parallel to the index of ions identified. | ||||||||||||
Type: | FragmentArrayType | ||||||||||||
Attributes: |
|
||||||||||||
Subelements: | none |
||||||||||||
Example Context: |
Definition: | The product ions identified in this result. | ||||||||
Type: | FragmentationType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: |
Definition: | Contains the types of measures that will be reported in generic arrays for each SpectrumIdentificationItem e.g. product ion m/z, product ion intensity, product ion m/z error | ||||||||
Type: | FragmentationTableType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: |
Definition: | The tolerance of the search given as a plus and minus value with units. | ||||||||
Type: | ToleranceType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: | <FragmentTolerance> <cvParam cvRef="PSI-MS" accession="MS:1001412" name="search tolerance plus value" value="0.7" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1001413" name="search tolerance minus value" value="0.7" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"></cvParam> </FragmentTolerance> |
||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/FragmentTolerance MUST supply term MS:1001412 (search tolerance plus value) only once MUST supply term MS:1001413 (search tolerance minus value) only once |
||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1001412" name="search tolerance plus value" value="0.7" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1001413" name="search tolerance minus value" value="0.7" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"></cvParam> |
Definition: | All sequences fulfilling the specifed criteria are included. | ||||||||||||
Type: | ParamListType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | |||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/DatabaseFilters/Filter/Include MAY supply a *child* term of MS:1001512 (Sequence database filters) one or more times e.g.: MS:1001090 (taxonomy nomenclature) e.g.: MS:1001201 (DB MW filter maximum) e.g.: MS:1001202 (DB MW filter minimum) e.g.: MS:1001203 (DB PI filter maximum) e.g.: MS:1001204 (DB PI filter minimum) e.g.: MS:1001467 (taxonomy: NCBI TaxID) e.g.: MS:1001468 (taxonomy: common name) e.g.: MS:1001469 (taxonomy: scientific name) e.g.: MS:1001470 (taxonomy: Swiss-Prot ID) e.g.: MS:1001513 (DB sequence filter pattern) et al. |
Definition: | The inputs to the analyses including the databases searched, the spectral data and the source file converted to mzIdentML. | ||||||||||||||||
Type: | InputsType | ||||||||||||||||
Attributes: | none |
||||||||||||||||
Subelements: |
|
||||||||||||||||
Example Context: | <Inputs xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"> <SourceFile location="example_files/55merge_omssa.omx" id="SourceFile_1"> <FileFormat> <cvParam cvRef="PSI-MS" accession="MS:1001400" name="OMSSA xml format"></cvParam> </FileFormat> </SourceFile> <SearchDatabase numDatabaseSequences="299106" location="fawaz_PXD000652_combined_concatenated_target_decoy.fasta" id="SearchDB_1"> ... </Inputs> |
Definition: | One of the spectra data sets used. | ||||||||
Type: | InputSpectraType | ||||||||
Attributes: |
|
||||||||
Subelements: | none |
||||||||
Example Context: | <InputSpectra spectraData_ref="SD_COMBINED_SE_0"></InputSpectra> |
Definition: | The lists of spectrum identifications that are input to the protein detection process. | ||||||||
Type: | InputSpectrumIdentificationsType | ||||||||
Attributes: |
|
||||||||
Subelements: | none |
||||||||
Example Context: |
Definition: | IonType defines the index of fragmentation ions being reported, importing a CV term for the type of ion e.g. b ion. Example: if b3 b7 b8 and b10 have been identified, the index attribute will contain 3 7 8 10, and the corresponding values will be reported in parallel arrays below | ||||||||||||||||
Type: | IonTypeType | ||||||||||||||||
Attributes: |
|
||||||||||||||||
Subelements: |
|
||||||||||||||||
Example Context: | |||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/AnalysisData/SpectrumIdentificationList/SpectrumIdentificationResult/SpectrumIdentificationItem/Fragmentation/IonType MAY supply a *child* term of MS:1001221 (product ion attribute) one or more times e.g.: MS:1000903 (product ion series ordinal) e.g.: MS:1000904 (product ion m/z delta) e.g.: MS:1000926 (product interpretation rank) e.g.: MS:1001220 (frag: y ion) e.g.: MS:1001222 (frag: b ion - H2O) e.g.: MS:1001223 (frag: y ion - H2O) e.g.: MS:1001224 (frag: b ion) e.g.: MS:1001225 (product ion m/z) e.g.: MS:1001227 (product ion m/z error) e.g.: MS:1001228 (frag: x ion) et al. MAY supply term MS:1000336 (neutral loss) one or more times |
Definition: | The masses of residues used in the search. | ||||||||||||||||||||
Type: | MassTableType | ||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||
Example Context: | |||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/MassTable MAY supply a *child* term of MS:1001354 (mass table options) one or more times e.g.: MS:1001346 (AAIndex mass table) |
Definition: | References to CV terms defining the measures about product ions to be reported in SpectrumIdentificationItem | ||||||||||||
Type: | MeasureType | ||||||||||||
Attributes: |
|
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | |||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/AnalysisData/SpectrumIdentificationList/FragmentationTable/Measure MUST supply term MS:1001226 (product ion intensity) only once MUST supply term MS:1001225 (product ion m/z) only once MUST supply term MS:1001227 (product ion m/z error) only once |
Definition: | A molecule modification specification. If n modifications have been found on a peptide, there should be n instances of Modification. If multiple modifications are provided as cvParams, it is assumed that the modification is ambiguous i.e. one modification or another. A cvParam MUST be provided with the identification of the modification sourced from a suitable CV e.g. UNIMOD. If the modification is not present in the CV (and this will be checked by the semantic validator within a given tolerance window), there is a “unknown modification†CV term that MUST be used instead. A neutral loss should be defined as an additional CVParam within Modification. If more complex information should be given about neutral losses (such as presence/absence on particular product ions), this can additionally be encoded within the FragmentationArray. | ||||||||||||||||||||
Type: | ModificationType | ||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||
Example Context: | <Modification location="1" residues="E" monoisotopicMassDelta="-18.01056"> <cvParam cvRef="UNIMOD" accession="UNIMOD:27" name="Glu->pyro-Glu"></cvParam> </Modification> |
||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/SequenceCollection/Peptide/Modification MAY supply term MS:1002509 (cross-link donor) only once MAY supply term MS:1002510 (cross-link acceptor) only once MAY supply a *child* term of UNIMOD:0 (unimod root node) only once MAY supply a *child* term of XLMOD:00004 (cross-linker) only once MAY supply a *child* term of MS:1001471 (peptide modification details) only once e.g.: MS:1001460 (unknown modification) e.g.: MS:1001524 (fragment neutral loss) e.g.: MS:1001525 (precursor neutral loss) e.g.: MS:1001972 (PTM scoring algorithm version) e.g.: MS:1002028 (nucleic acid base modification) e.g.: MS:1002029 (original nucleic acid sequence) e.g.: MS:1002030 (modified nucleic acid sequence) MAY supply a *child* term of XLMOD:00002 (cross-linker related PTM) only once MAY supply a *child* term of MOD:00000 (protein modification) only once |
||||||||||||||||||||
Example cvParams: | <cvParam cvRef="UNIMOD" accession="UNIMOD:35" name="Oxidation"></cvParam> <cvParam cvRef="UNIMOD" accession="UNIMOD:37" name="Trimethyl"></cvParam> <cvParam cvRef="UNIMOD" accession="UNIMOD:4" name="Carbamidomethyl"></cvParam> <cvParam cvRef="UNIMOD" accession="UNIMOD:27" name="Glu->pyro-Glu"></cvParam> <cvParam cvRef="UNIMOD" accession="UNIMOD:1" name="Acetyl"></cvParam> <cvParam cvRef="UNIMOD" accession="UNIMOD:385" name="Ammonia-loss"></cvParam> <cvParam cvRef="UNIMOD" accession="UNIMOD:28" name="Gln->pyro-Glu"></cvParam> <cvParam cvRef="UNIMOD" accession="UNIMOD:575" name="Gly->Val"></cvParam> |
Definition: | The specification of static/variable modifications (e.g. Oxidation of Methionine) that are to be considered in the spectra search. | ||||||||
Type: | ModificationParamsType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: | <ModificationParams> <SearchModification fixedMod="true" massDelta="57.021465" residues="C"> <cvParam cvRef="UNIMOD" accession="UNIMOD:4" name="Carbamidomethyl"></cvParam> </SearchModification> <SearchModification fixedMod="false" massDelta="15.994915" residues="M"> <cvParam cvRef="UNIMOD" accession="UNIMOD:35" name="Oxidation"></cvParam> </SearchModification> ... </ModificationParams> |
Definition: | Organizations are entities like companies, universities, government agencies. Any additional information such as the address, email etc. should be supplied either as CV parameters or as user parameters. | ||||||||||||||||
Type: | OrganizationType | ||||||||||||||||
Attributes: |
|
||||||||||||||||
Subelements: |
|
||||||||||||||||
Example Context: | <Organization id="ORG_DOC_OWNER" name="myworkplace"> <cvParam cvRef="PSI-MS" accession="MS:1000586" name="contact name" value="address"></cvParam> </Organization> |
||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AuditCollection/Organization SHOULD supply term MS:1000588 (contact URL) one or more times SHOULD supply term MS:1000587 (contact address) one or more times SHOULD supply term MS:1000589 (contact email) one or more times SHOULD supply term MS:1000586 (contact name) one or more times |
||||||||||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1000586" name="contact name" value="address"></cvParam> |
Definition: | The containing organization (the university or business which a lab belongs to, etc.) | ||||||||
Type: | ParentOrganizationType | ||||||||
Attributes: |
|
||||||||
Subelements: | none |
||||||||
Example Context: |
Definition: | The tolerance of the search given as a plus and minus value with units. | ||||||||
Type: | ToleranceType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: | <ParentTolerance> <cvParam cvRef="PSI-MS" accession="MS:1001412" name="search tolerance plus value" value="10.0" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1001413" name="search tolerance minus value" value="10.0" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"></cvParam> </ParentTolerance> |
||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/ParentTolerance MUST supply term MS:1001412 (search tolerance plus value) only once MUST supply term MS:1001413 (search tolerance minus value) only once |
||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1001412" name="search tolerance plus value" value="10.0" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1001413" name="search tolerance minus value" value="10.0" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"></cvParam> |
Definition: | One (poly)peptide (a sequence with modifications). The combination of Peptide sequence and modifications MUST be unique in the file. | ||||||||||||||||||||||||
Type: | PeptideType | ||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||
Example Context: | <Peptide id="CCPQCCSSGCSQNLCGPLCVTTPYYCTR_##Carbamidomethyl(C):1##Carbamidomethyl(C):2##Carbamidomethyl(C):5##Carbamidomethyl(C):6##Carbamidomethyl(C):10##Carbamidomethyl(C):15##Carbamidomethyl(C):19##Carbamidomethyl(C):26"> <PeptideSequence>CCPQCCSSGCSQNLCGPLCVTTPYYCTR</PeptideSequence> <Modification location="1" residues="C" monoisotopicMassDelta="57.02147"> <cvParam cvRef="UNIMOD" accession="UNIMOD:4" name="Carbamidomethyl"></cvParam> </Modification> <Modification location="2" residues="C" monoisotopicMassDelta="57.02147"> <cvParam cvRef="UNIMOD" accession="UNIMOD:4" name="Carbamidomethyl"></cvParam> ... </Peptide> |
||||||||||||||||||||||||
cvParam Mapping Rules: | WARNING: There are no cvParam mapping rules for this element even though cvParam is an allowed subelement. |
Definition: | PeptideEvidence links a specific Peptide element to a specific position in a DBSequence. There MUST only be one PeptideEvidence item per Peptide-to-DBSequence-position. | ||||||||||||||||||||||||||||||||||||||||||||||||
Type: | PeptideEvidenceType | ||||||||||||||||||||||||||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||||||||||||||||||||||||||
Example Context: | <PeptideEvidence dBSequence_ref="dbseq_generic|B_GENSCAN00000036974_REVERSED|p:genscan" peptide_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_##Carbamidomethyl(C):1##Carbamidomethyl(C):3##Carbamidomethyl(C):12##Carbamidomethyl(C):18##Carbamidomethyl(C):25##Carbamidomethyl(C):31##Ammonia-loss(C):1" start="494" end="531" pre="R" post="L" isDecoy="true" id="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|B_GENSCAN00000036974_REVERSED|p:genscan_494_531"></PeptideEvidence> |
||||||||||||||||||||||||||||||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/SequenceCollection/PeptideEvidence MUST supply term MS:1002642 (peptide exon nucleotide sizes) only once MUST supply term MS:1002643 (peptide start positions on chromosome) only once MUST supply term MS:1002640 (peptide end on chromosome) only once MUST supply term MS:1002641 (peptide exon count) only once |
Definition: | Reference to the PeptideEvidence element identified. If a specific sequence can be assigned to multiple proteins and or positions in a protein all possible PeptideEvidence elements should be referenced here. | ||||||||
Type: | PeptideEvidenceRefType | ||||||||
Attributes: |
|
||||||||
Subelements: | none |
||||||||
Example Context: | <PeptideEvidenceRef peptideEvidence_ref="GEGGAQDGSGTEGVGATGAAGGRGAQGAPGGTGGAGSGSGLHHQQDSGYQGASGSGGAQSGGR_generic|A_ENSP00000352272_REVERSED|p:known_125_187"></PeptideEvidenceRef> |
||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1002356" name="PSM-level combined FDRScore" value="3.9523759266648643E-7"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1002359" name="peptide sequence-level local FDR" value="0.0"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1001868" name="distinct peptide-level q-value" value="0.0"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1002360" name="distinct peptide-level FDRScore" value="3.0117913560694526E-7"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1002500" name="peptide passes threshold" value="true"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1002520" name="peptide group ID" value="AVMDDFAAFVEK_##Oxidation(M):3"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1002439" name="final PSM list"></cvParam> |
||||||||
Example userParams: | <userParam name="search engines identifying PSM" value="12"></userParam> |
Definition: | Peptide evidence on which this ProteinHypothesis is based by reference to a PeptideEvidence element. | ||||||||
Type: | PeptideHypothesisType | ||||||||
Attributes: |
|
||||||||
Subelements: |
|
||||||||
Example Context: |
Definition: | The amino acid sequence of the (poly)peptide. If a substitution modification has been found, the original sequence should be reported. |
Type: | sequence |
Attributes: | none |
Subelements: | none |
Example Context: | <PeptideSequence>GEGGAQDGSGTEGVGATGAAGGRGAQGAPGGTGGAGSGSGLHHQQDSGYQGASGSGGAQSGGR</PeptideSequence> |
Definition: | A person's name and contact details. Any additional information such as the address, contact email etc. should be supplied using CV parameters or user parameters. | ||||||||||||||||||||||||
Type: | PersonType | ||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||
Example Context: | <Person lastName="secondName" firstName="firstname" id="PERSON_DOC_OWNER"> <cvParam cvRef="PSI-MS" accession="MS:1000587" name="contact address" value="address"></cvParam> <Affiliation organization_ref="ORG_DOC_OWNER"></Affiliation> </Person> |
||||||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AuditCollection/Person SHOULD supply term MS:1000588 (contact URL) one or more times SHOULD supply term MS:1000587 (contact address) one or more times SHOULD supply term MS:1000589 (contact email) one or more times |
||||||||||||||||||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1000587" name="contact address" value="address"></cvParam> |
Definition: | A set of logically related results from a protein detection, for example to represent conflicting assignments of peptides to proteins. | ||||||||||||||||
Type: | ProteinAmbiguityGroupType | ||||||||||||||||
Attributes: |
|
||||||||||||||||
Subelements: |
|
||||||||||||||||
Example Context: | |||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/AnalysisData/ProteinDetectionList/ProteinAmbiguityGroup MAY supply a *child* term of MS:1002698 (protein cluster identification attribute) only once e.g.: MS:1002407 (cluster identifier) MAY supply a *child* term of MS:1002346 (protein group-level identification attribute) one or more times e.g.: MS:1001164 (Paragon:unused protscore) e.g.: MS:1001165 (Paragon:total protscore) e.g.: MS:1002236 (ProteoGrouper:PAG score) e.g.: MS:1002370 (protein group-level local FDR) e.g.: MS:1002371 (protein group-level p-value) e.g.: MS:1002372 (protein group-level e-value) e.g.: MS:1002373 (protein group-level q-value) e.g.: MS:1002374 (protein group-level FDRScore) e.g.: MS:1002375 (protein group-level combined FDRScore) e.g.: MS:1002376 (protein group-level probability) et al. MAY supply term MS:1002542 (PeptideShaker protein confidence type) only once MAY supply term MS:1002471 (PeptideShaker protein group confidence) only once MUST supply term MS:1002415 (protein group passes threshold) only once MAY supply term MS:1002470 (PeptideShaker protein group score) only once |
Definition: | An Analysis which assembles a set of peptides (e.g. from a spectra search analysis) to proteins. | ||||||||||||||||||||||||
Type: | ProteinDetectionType | ||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||
Example Context: |
Definition: | A single result of the ProteinDetection analysis (i.e. a protein). | ||||||||||||||||||||
Type: | ProteinDetectionHypothesisType | ||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||
Example Context: | |||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/AnalysisData/ProteinDetectionList/ProteinAmbiguityGroup/ProteinDetectionHypothesis MAY supply a *child* term of MS:1001085 (protein-level identification attribute) one or more times e.g.: MS:1001088 (protein description) e.g.: MS:1001090 (taxonomy nomenclature) e.g.: MS:1001093 (sequence coverage) e.g.: MS:1001097 (distinct peptide sequences) e.g.: MS:1001098 (confident distinct peptide sequences) e.g.: MS:1001099 (confident peptide qualification) e.g.: MS:1001100 (confident peptide sequence number) e.g.: MS:1001125 (manual validation) e.g.: MS:1001147 (protein ambiguity group result details) WARNING: Term has no definition! e.g.: MS:1001157 (SEQUEST:sp) et al. MAY supply a *child* term of MS:1001116 (single protein identification statistic) one or more times e.g.: MS:1001125 (manual validation) e.g.: MS:1001157 (SEQUEST:sp) e.g.: MS:1001158 (SEQUEST:Uniq) WARNING: Term has no definition! e.g.: MS:1001164 (Paragon:unused protscore) e.g.: MS:1001165 (Paragon:total protscore) e.g.: MS:1001166 (Paragon:score) e.g.: MS:1001167 (Paragon:confidence) e.g.: MS:1001168 (Paragon:expression error factor) e.g.: MS:1001169 (Paragon:expression change p-value) e.g.: MS:1001170 (Paragon:contrib) et al. MAY supply a *child* term of MS:1001153 (search engine specific score) one or more times e.g.: MS:1001147 (protein ambiguity group result details) WARNING: Term has no definition! e.g.: MS:1001155 (SEQUEST:xcorr) e.g.: MS:1001159 (SEQUEST:expectation value) e.g.: MS:1001163 (SEQUEST:consensus score) e.g.: MS:1001874 (FDRScore) e.g.: MS:1002052 (MS-GF:SpecEValue) e.g.: MS:1002053 (MS-GF:EValue) e.g.: MS:1002125 (combined FDRScore) e.g.: MS:1002252 (Comet:xcorr) e.g.: MS:1002257 (Comet:expectation value) et al. MAY supply a *child* term of MS:1001805 (quantification datatype) one or more times e.g.: MS:1001130 (peptide raw area) e.g.: MS:1001131 (error on peptide area) e.g.: MS:1001132 (peptide ratio) e.g.: MS:1001133 (error on peptide ratio) e.g.: MS:1001134 (protein ratio) e.g.: MS:1001135 (error on protein ratio) e.g.: MS:1001136 (p-value (protein diff from 1 randomly)) e.g.: MS:1001137 (absolute quantity) e.g.: MS:1001138 (error on absolute quantity) e.g.: MS:1001141 (intensity of precursor ion) et al. MAY supply a *child* term of MS:1001060 (quality estimation method details) one or more times e.g.: MS:1001058 (quality estimation by manual validation) e.g.: MS:1001194 (quality estimation with decoy database) e.g.: MS:1001447 (prot:FDR threshold) e.g.: MS:1001448 (pep:FDR threshold) e.g.: MS:1001454 (quality estimation with implicite decoy sequences) e.g.: MS:1001494 (no threshold) e.g.: MS:1001574 (report only spectra assigned to identified proteins) e.g.: MS:1001868 (distinct peptide-level q-value) e.g.: MS:1002260 (PSM:FDR threshold) e.g.: MS:1002486 (protein group-level statistical threshold) MUST supply a *child* term of MS:1001101 (protein group or subset relationship) one or more times e.g.: MS:1001591 (anchor protein) e.g.: MS:1001592 (family member protein) e.g.: MS:1001593 (group member with undefined relationship OR ortholog protein) e.g.: MS:1001594 (sequence same-set protein) e.g.: MS:1001595 (spectrum same-set protein) e.g.: MS:1001596 (sequence sub-set protein) e.g.: MS:1001597 (spectrum sub-set protein) e.g.: MS:1001598 (sequence subsumable protein) e.g.: MS:1001599 (spectrum subsumable protein) e.g.: MS:1002213 (PAnalyzer:conclusive protein) et al. MAY supply a *child* term of MS:1002664 (interaction score derived from cross-linking) one or more times e.g.: MS:1002676 (protein-pair-level global FDR) e.g.: MS:1002677 (residue-pair-level global FDR) MAY supply a *child* term of MS:1001143 (PSM-level search engine specific statistic) one or more times e.g.: MS:1001154 (SEQUEST:probability) e.g.: MS:1001155 (SEQUEST:xcorr) e.g.: MS:1001156 (SEQUEST:deltacn) e.g.: MS:1001160 (SEQUEST:sf) e.g.: MS:1001161 (SEQUEST:matched ions) e.g.: MS:1001162 (SEQUEST:total ions) e.g.: MS:1001171 (Mascot:score) e.g.: MS:1001172 (Mascot:expectation value) e.g.: MS:1001173 (Mascot:matched ions) e.g.: MS:1001174 (Mascot:total ions) et al. |
Definition: | The protein list resulting from a protein detection process. | ||||||||||||||||
Type: | ProteinDetectionListType | ||||||||||||||||
Attributes: |
|
||||||||||||||||
Subelements: |
|
||||||||||||||||
Example Context: | <ProteinDetectionList id="PDL_1" > </ProteinDetectionList> |
||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/AnalysisData/ProteinDetectionList MAY supply a *child* term of MS:1001184 (search statistics) one or more times e.g.: MS:1001035 (date / time search performed) e.g.: MS:1001036 (search time taken) e.g.: MS:1001177 (number of molecular hypothesis considered) MUST supply term MS:1002404 (count of identified proteins) only once MAY supply term MS:1002406 (count of identified clusters) only once |
Definition: | The parameters and settings of a ProteinDetection process. | ||||||||||||||||
Type: | ProteinDetectionProtocolType | ||||||||||||||||
Attributes: |
|
||||||||||||||||
Subelements: |
|
||||||||||||||||
Graphical Context: | ![]() |
||||||||||||||||
Example Context: |
Definition: | The Provider of the mzIdentML record in terms of the contact and software. | ||||||||||||||||
Type: | ProviderType | ||||||||||||||||
Attributes: |
|
||||||||||||||||
Subelements: |
|
||||||||||||||||
Example Context: | <Provider xmlns="http://psidev.info/psi/pi/mzIdentML/1.2" id="PROVIDER"> <ContactRole contact_ref="PERSON_DOC_OWNER"> <Role> <cvParam cvRef="PSI-MS" accession="MS:1001271" name="researcher"></cvParam> </Role> </ContactRole> </Provider> |
Definition: | The specification of a single residue within the mass table. | ||||||||||||
Type: | ResidueType | ||||||||||||
Attributes: |
|
||||||||||||
Subelements: | none |
||||||||||||
Example Context: |
Definition: | The roles (lab equipment sales, contractor, etc.) the Contact fills. | ||||||||
Type: | RoleType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: | <Role> <cvParam cvRef="PSI-MS" accession="MS:1001271" name="researcher"></cvParam> </Role> |
||||||||
cvParam Mapping Rules: | Path /MzIdentML/Provider/ContactRole/Role MUST supply a *child* term of MS:1001266 (role type) one or more times e.g.: MS:1001267 (software vendor) e.g.: MS:1001268 (programmer) e.g.: MS:1001269 (instrument vendor) e.g.: MS:1001270 (lab personnel) e.g.: MS:1001271 (researcher) Path /MzIdentML/AnalysisSoftwareList/AnalysisSoftware/ContactRole/Role MUST supply a *child* term of MS:1001266 (role type) one or more times e.g.: MS:1001267 (software vendor) e.g.: MS:1001268 (programmer) e.g.: MS:1001269 (instrument vendor) e.g.: MS:1001270 (lab personnel) e.g.: MS:1001271 (researcher) Path /MzIdentML/AnalysisSampleCollection/Sample/ContactRole/Role MUST supply a *child* term of MS:1001266 (role type) one or more times e.g.: MS:1001267 (software vendor) e.g.: MS:1001268 (programmer) e.g.: MS:1001269 (instrument vendor) e.g.: MS:1001270 (lab personnel) e.g.: MS:1001271 (researcher) |
||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1001271" name="researcher"></cvParam> |
Definition: | A description of the sample analysed by mass spectrometry using CVParams or UserParams. If a composite sample has been analysed, a parent sample should be defined, which references subsamples. This represents any kind of substance used in an experimental workflow, such as whole organisms, cells, DNA, solutions, compounds and experimental substances (gels, arrays etc.). | ||||||||||||||||||||
Type: | SampleType | ||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||
Example Context: | |||||||||||||||||||||
cvParam Mapping Rules: | WARNING: There are no cvParam mapping rules for this element even though cvParam is an allowed subelement. |
Definition: | A database for searching mass spectra. Examples include a set of amino acid sequence entries, nucleotide databases (e.g. 6 frame translated) or annotated spectra libraries. | ||||||||||||||||||||||||||||||||
Type: | SearchDatabaseType | ||||||||||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||||||||||
Example Context: | <SearchDatabase numDatabaseSequences="299106" location="fawaz_PXD000652_combined_concatenated_target_decoy.fasta" id="SearchDB_1"> <FileFormat> <cvParam cvRef="PSI-MS" accession="MS:1001348" name="FASTA format"></cvParam> </FileFormat> <DatabaseName> <userParam name="fawaz_PXD000652_combined_concatenated_target_decoy.fasta"></userParam> </DatabaseName> ... </SearchDatabase> |
||||||||||||||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/Inputs/SearchDatabase MAY supply a *child* term of MS:1000561 (data file checksum type) one or more times e.g.: MS:1000568 (MD5) e.g.: MS:1000569 (SHA-1) MAY supply a *child* term of MS:1001011 (search database details) one or more times e.g.: MS:1001014 (database local file path) e.g.: MS:1001015 (database original uri) e.g.: MS:1001016 (database version) e.g.: MS:1001017 (database release date) e.g.: MS:1001020 (DB filter taxonomy) e.g.: MS:1001021 (DB filter on accession numbers) e.g.: MS:1001022 (DB MW filter) e.g.: MS:1001023 (DB PI filter) e.g.: MS:1001024 (translation frame) e.g.: MS:1001025 (translation table) et al. |
Definition: | One of the search databases used. | ||||||||
Type: | SearchDatabaseRefType | ||||||||
Attributes: |
|
||||||||
Subelements: | none |
||||||||
Example Context: | <SearchDatabaseRef searchDatabase_ref="SearchDB_1"></SearchDatabaseRef> |
Definition: | Specification of a search modification as parameter for a spectra search. Contains the name of the modification, the mass, the specificity and whether it is a static modification. | ||||||||||||||||
Type: | SearchModificationType | ||||||||||||||||
Attributes: |
|
||||||||||||||||
Subelements: |
|
||||||||||||||||
Example Context: | <SearchModification fixedMod="true" massDelta="57.021465" residues="C"> <cvParam cvRef="UNIMOD" accession="UNIMOD:4" name="Carbamidomethyl"></cvParam> </SearchModification> |
||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/ModificationParams/SearchModification MAY supply term MS:1002509 (cross-link donor) only once MAY supply term MS:1002510 (cross-link acceptor) only once MAY supply a *child* term of UNIMOD:0 (unimod root node) only once MAY supply a *child* term of MS:1001471 (peptide modification details) only once e.g.: MS:1001460 (unknown modification) e.g.: MS:1001524 (fragment neutral loss) e.g.: MS:1001525 (precursor neutral loss) e.g.: MS:1001972 (PTM scoring algorithm version) e.g.: MS:1002028 (nucleic acid base modification) e.g.: MS:1002029 (original nucleic acid sequence) e.g.: MS:1002030 (modified nucleic acid sequence) MUST supply term MS:1001460 (unknown modification) only once MAY supply a *child* term of XLMOD:00002 (cross-linker related PTM) only once MAY supply term MS:1002504 (modification index) only once MAY supply a *child* term of XLMOD:00004 (cross-linker) only once MAY supply a *child* term of MOD:00000 (protein modification) only once |
||||||||||||||||
Example cvParams: | <cvParam cvRef="UNIMOD" accession="UNIMOD:4" name="Carbamidomethyl"></cvParam> <cvParam cvRef="UNIMOD" accession="UNIMOD:35" name="Oxidation"></cvParam> |
Definition: | The type of search performed e.g. PMF, Tag searches, MS-MS | ||||||||||||
Type: | ParamType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | <SearchType> <cvParam cvRef="PSI-MS" accession="MS:1001083" name="ms-ms search"></cvParam> </SearchType> |
||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/SearchType MUST supply term MS:1001083 (ms-ms search) only once MUST supply term MS:1001081 (pmf search) only once MAY supply term MS:1001010 (de novo search) only once MAY supply term MS:1001031 (spectral library search) only once MUST supply term MS:1001584 (combined pmf + ms-ms search) only once MUST supply term MS:1001082 (tag search) only once |
||||||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1001083" name="ms-ms search"></cvParam> |
Definition: | The actual sequence of amino acids or nucleic acid. |
Type: | sequence |
Attributes: | none |
Subelements: | none |
Example Context: |
Definition: | The collection of sequences (DBSequence or Peptide) identified and their relationship between each other (PeptideEvidence) to be referenced elsewhere in the results. | ||||||||||||||||
Type: | SequenceCollectionType | ||||||||||||||||
Attributes: | none |
||||||||||||||||
Subelements: |
|
||||||||||||||||
Graphical Context: | ![]() |
||||||||||||||||
Example Context: | <SequenceCollection xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"> <DBSequence searchDatabase_ref="SearchDB_1" accession="generic|A_ENSP00000462078|p:known" id="dbseq_generic|A_ENSP00000462078|p:known"></DBSequence> <DBSequence searchDatabase_ref="SearchDB_1" accession="generic|A_ENSP00000290100|p:known" id="dbseq_generic|A_ENSP00000290100|p:known"></DBSequence> <DBSequence searchDatabase_ref="SearchDB_1" accession="generic|A_ENSP00000468798|p:known" id="dbseq_generic|A_ENSP00000468798|p:known"></DBSequence> <DBSequence searchDatabase_ref="SearchDB_1" accession="generic|A_ENSP00000481360|p:known" id="dbseq_generic|A_ENSP00000481360|p:known"></DBSequence> <DBSequence searchDatabase_ref="SearchDB_1" accession="generic|B_GENSCAN00000011570|p:genscan" id="dbseq_generic|B_GENSCAN00000011570|p:genscan"></DBSequence> <DBSequence searchDatabase_ref="SearchDB_1" accession="generic|A_ENSP00000397445|p:known" id="dbseq_generic|A_ENSP00000397445|p:known"></DBSequence> ... </SequenceCollection> |
Definition: | Regular expression for specifying the enzyme cleavage site. |
Type: | xsd:string |
Attributes: | none |
Subelements: | none |
Example Context: |
Definition: | The name of the analysis software package, sourced from a CV if available. | ||||||||||||
Type: | ParamType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | <SoftwareName> <cvParam cvRef="PSI-MS" accession="MS:1002237" name="mzidLib"></cvParam> </SoftwareName> |
||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisSoftwareList/AnalysisSoftware/SoftwareName MUST supply a *child* term of MS:1001456 (analysis software) one or more times e.g.: MS:1000532 (Xcalibur) e.g.: MS:1000533 (Bioworks) e.g.: MS:1000534 (MassLynx) e.g.: MS:1000535 (FlexAnalysis) e.g.: MS:1000536 (Data Explorer) e.g.: MS:1000537 (4700 Explorer) e.g.: MS:1000539 (Voyager Biospectrometry Workstation System) e.g.: MS:1000551 (Analyst) e.g.: MS:1000600 (Proteios) e.g.: MS:1000601 (ProteinLynx Global Server) et al. |
||||||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1001475" name="OMSSA"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1002237" name="mzidLib"></cvParam> |
Definition: | A file from which this mzIdentML instance was created. | ||||||||||||||||||||
Type: | SourceFileType | ||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||
Example Context: | <SourceFile location="example_files/55merge_omssa.omx" id="SourceFile_1"> <FileFormat> <cvParam cvRef="PSI-MS" accession="MS:1001400" name="OMSSA xml format"></cvParam> </FileFormat> </SourceFile> |
||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/Inputs/SourceFile MAY supply a *child* term of MS:1000561 (data file checksum type) one or more times e.g.: MS:1000568 (MD5) e.g.: MS:1000569 (SHA-1) |
Definition: | The specificity rules of the searched modification including for example the probability of a modification's presence or peptide or protein termini. Standard fixed or variable status should be provided by the attribute fixedMod. | ||||||||
Type: | SpecificityRulesType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: | |||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/ModificationParams/SearchModification/SpecificityRules MUST supply a *child* term of MS:1001056 (modification specificity rule) only once e.g.: MS:1001189 (modification specificity peptide N-term) e.g.: MS:1001190 (modification specificity peptide C-term) e.g.: MS:1001875 (modification motif) e.g.: MS:1001876 (modification probability) e.g.: MS:1002057 (modification specificity protein N-term) e.g.: MS:1002058 (modification specificity protein C-term) |
Definition: | A data set containing spectra data (consisting of one or more spectra). | ||||||||||||||||
Type: | SpectraDataType | ||||||||||||||||
Attributes: |
|
||||||||||||||||
Subelements: |
|
||||||||||||||||
Example Context: | <SpectraData location="/home/fawaz/mgf/HV_map_B1.mgf" id="SD_COMBINED_SE_0"> <FileFormat> <cvParam cvRef="PSI-MS" accession="MS:1001062" name="Mascot MGF format"></cvParam> </FileFormat> <SpectrumIDFormat> <cvParam cvRef="PSI-MS" accession="MS:1000774" name="multiple peak list nativeID format"></cvParam> </SpectrumIDFormat> ... </SpectraData> |
Definition: | An Analysis which tries to identify peptides in input spectra, referencing the database searched, the input spectra, the output results and the protocol that is run. | ||||||||||||||||||||||||
Type: | SpectrumIdentificationType | ||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||
Example Context: | <SpectrumIdentification spectrumIdentificationProtocol_ref="SearchProtocol_1" spectrumIdentificationList_ref="SII_LIST_1" id="SpecIdent_1"> <InputSpectra spectraData_ref="SD_COMBINED_SE_0"></InputSpectra> <SearchDatabaseRef searchDatabase_ref="SearchDB_1"></SearchDatabaseRef> </SpectrumIdentification> |
Definition: | An identification of a single (poly)peptide, resulting from querying an input spectra, along with the set of confidence values for that identification. PeptideEvidence elements should be given for all mappings of the corresponding Peptide sequence within protein sequences. | ||||||||||||||||||||||||||||||||||||||||||||||||
Type: | SpectrumIdentificationItemType | ||||||||||||||||||||||||||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||||||||||||||||||||||||||
Graphical Context: | ![]() |
||||||||||||||||||||||||||||||||||||||||||||||||
Example Context: | <SpectrumIdentificationItem chargeState="4" experimentalMassToCharge="1123.974121" calculatedMassToCharge="1123.968707" peptide_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_##Carbamidomethyl(C):1##Carbamidomethyl(C):3##Carbamidomethyl(C):12##Carbamidomethyl(C):18##Carbamidomethyl(C):25##Carbamidomethyl(C):31##Ammonia-loss(C):1" rank="1" passThreshold="false" id="SIR_7191_SII_1"> <PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|A_ENSP00000376692_REVERSED|p:novel_575_612"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|B_GENSCAN00000036974_REVERSED|p:genscan_494_531"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|A_ENSP00000471297_REVERSED|p:putative_641_678"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|A_ENSP00000319883_REVERSED|p:known_633_670"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|A_ENSP00000472280_REVERSED|p:putative_622_659"></PeptideEvidenceRef> <cvParam cvRef="PSI-MS" accession="MS:1002356" name="PSM-level combined FDRScore" value="0.4294078446305645"></cvParam> ... </SpectrumIdentificationItem> |
||||||||||||||||||||||||||||||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/AnalysisData/SpectrumIdentificationList/SpectrumIdentificationResult/SpectrumIdentificationItem MUST supply a *child* term of MS:1002549 (PTM localization distinct peptide-level statistic) one or more times e.g.: MS:1002550 (peptide:phosphoRS score) e.g.: MS:1002551 (peptide:Ascore) e.g.: MS:1002552 (peptide:H-Score) e.g.: MS:1002553 (peptide:D-Score) e.g.: MS:1002554 (peptide:MD-Score) MUST supply a *child* term of MS:1002538 (PTM localization confidence metric) one or more times e.g.: MS:1002539 (PeptideShaker PTM confidence type) MUST supply a *child* term of MS:1001968 (PTM localization PSM-level statistic) one or more times e.g.: MS:1001969 (phosphoRS score) e.g.: MS:1001970 (phosphoRS sequence probability) e.g.: MS:1001971 (phosphoRS site probability) e.g.: MS:1001974 (DeBunker:score) e.g.: MS:1001978 (MSQuant:PTM-score) e.g.: MS:1001979 (MaxQuant:PTM Score) e.g.: MS:1001980 (MaxQuant:Phospho (STY) Probabilities) e.g.: MS:1001981 (MaxQuant:Phospho (STY) Score Diffs) e.g.: MS:1001982 (MaxQuant:P-site localization probability) e.g.: MS:1001983 (MaxQuant:PTM Delta Score) et al. MUST supply a *child* term of MS:1002555 (PTM localization score threshold) one or more times e.g.: MS:1002556 (Ascore threshold) e.g.: MS:1002557 (D-Score threshold) e.g.: MS:1002558 (MD-Score threshold) e.g.: MS:1002559 (H-Score threshold) e.g.: MS:1002560 (DeBunker:score threshold) e.g.: MS:1002561 (Mascot:PTM site assignment confidence threshold) e.g.: MS:1002562 (MSQuant:PTM-score threshold) e.g.: MS:1002563 (MaxQuant:PTM Score threshold) e.g.: MS:1002564 (MaxQuant:P-site localization probability threshold) e.g.: MS:1002565 (MaxQuant:PTM Delta Score threshold) et al. MAY supply a *child* term of MS:1002484 (peptide-level statistical threshold) one or more times e.g.: MS:1001448 (pep:FDR threshold) e.g.: MS:1001868 (distinct peptide-level q-value) MUST supply a *child* term of MS:1001143 (PSM-level search engine specific statistic) one or more times e.g.: MS:1001154 (SEQUEST:probability) e.g.: MS:1001155 (SEQUEST:xcorr) e.g.: MS:1001156 (SEQUEST:deltacn) e.g.: MS:1001160 (SEQUEST:sf) e.g.: MS:1001161 (SEQUEST:matched ions) e.g.: MS:1001162 (SEQUEST:total ions) e.g.: MS:1001171 (Mascot:score) e.g.: MS:1001172 (Mascot:expectation value) e.g.: MS:1001173 (Mascot:matched ions) e.g.: MS:1001174 (Mascot:total ions) et al. MAY supply a *child* term of MS:1001405 (spectrum identification result details) one or more times e.g.: MS:1000016 (scan start time) e.g.: MS:1000796 (spectrum title) e.g.: MS:1000797 (peak list scans) e.g.: MS:1000798 (peak list raw scans) e.g.: MS:1000903 (product ion series ordinal) e.g.: MS:1000904 (product ion m/z delta) e.g.: MS:1000926 (product interpretation rank) e.g.: MS:1001030 (number of peptide seqs compared to each spectrum) e.g.: MS:1001035 (date / time search performed) e.g.: MS:1001036 (search time taken) et al. |
Definition: | Reference(s) to the SpectrumIdentificationItem element(s) that support the given PeptideEvidence element. Using these references it is possible to indicate which spectra were actually accepted as evidence for this peptide identification in the given protein. | ||||||||
Type: | SpectrumIdentificationItemRefType | ||||||||
Attributes: |
|
||||||||
Subelements: | none |
||||||||
Example Context: |
Definition: | Represents the set of all search results from SpectrumIdentification. | ||||||||||||||||||||
Type: | SpectrumIdentificationListType | ||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||
Graphical Context: | ![]() |
||||||||||||||||||||
Example Context: | <SpectrumIdentificationList xmlns="http://psidev.info/psi/pi/mzIdentML/1.2" id="SII_LIST_1"> <SpectrumIdentificationResult spectrumID="index=6451" spectraData_ref="SD_COMBINED_SE_0" id="SIR_8947"> <SpectrumIdentificationItem chargeState="2" experimentalMassToCharge="679.817322" calculatedMassToCharge="679.818488" peptide_ref="AVMDDFAAFVEK_##Oxidation(M):3" rank="1" passThreshold="false" id="SIR_8947_SII_1"> <PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic|A_ENSP00000401820|p:known_378_389"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic|A_ENSP00000421027|p:putative_420_431"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic|A_ENSP00000483421|p:known_357_368"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic|A_ENSP00000480485|p:known_357_368"></PeptideEvidenceRef> ... </SpectrumIdentificationList> |
||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/AnalysisData/SpectrumIdentificationList MAY supply a *child* term of MS:1001184 (search statistics) one or more times e.g.: MS:1001035 (date / time search performed) e.g.: MS:1001036 (search time taken) e.g.: MS:1001177 (number of molecular hypothesis considered) |
Definition: | The parameters and settings of a SpectrumIdentification analysis. | ||||||||||||||||||||||||||||||||||||||||||||
Type: | SpectrumIdentificationProtocolType | ||||||||||||||||||||||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||||||||||||||||||||||||||
Graphical Context: | ![]() |
||||||||||||||||||||||||||||||||||||||||||||
Example Context: | <SpectrumIdentificationProtocol analysisSoftware_ref="ID_software" id="SearchProtocol_1"> <SearchType> <cvParam cvRef="PSI-MS" accession="MS:1001083" name="ms-ms search"></cvParam> </SearchType> <AdditionalSearchParams> <cvParam cvRef="PSI-MS" accession="MS:1001211" name="parent mass type mono"></cvParam> <cvParam cvRef="PSI-MS" accession="MS:1001256" name="fragment mass type mono"></cvParam> ... </SpectrumIdentificationProtocol> |
Definition: | All identifications made from searching one spectrum. For PMF data, all peptide identifications will be listed underneath as SpectrumIdentificationItems. For MS/MS data, there will be ranked SpectrumIdentificationItems corresponding to possible different peptide IDs. | ||||||||||||||||||||
Type: | SpectrumIdentificationResultType | ||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||
Subelements: |
|
||||||||||||||||||||
Example Context: | <SpectrumIdentificationResult spectrumID="index=7665" spectraData_ref="SD_COMBINED_SE_0" id="SIR_7191"> <SpectrumIdentificationItem chargeState="4" experimentalMassToCharge="1123.974121" calculatedMassToCharge="1123.968707" peptide_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_##Carbamidomethyl(C):1##Carbamidomethyl(C):3##Carbamidomethyl(C):12##Carbamidomethyl(C):18##Carbamidomethyl(C):25##Carbamidomethyl(C):31##Ammonia-loss(C):1" rank="1" passThreshold="false" id="SIR_7191_SII_1"> <PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|A_ENSP00000376692_REVERSED|p:novel_575_612"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|B_GENSCAN00000036974_REVERSED|p:genscan_494_531"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|A_ENSP00000471297_REVERSED|p:putative_641_678"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|A_ENSP00000319883_REVERSED|p:known_633_670"></PeptideEvidenceRef> <PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic|A_ENSP00000472280_REVERSED|p:putative_622_659"></PeptideEvidenceRef> ... </SpectrumIdentificationResult> |
||||||||||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/AnalysisData/SpectrumIdentificationList/SpectrumIdentificationResult MAY supply term MS:1000894 (retention time) only once MAY supply a *child* term of MS:1001405 (spectrum identification result details) one or more times e.g.: MS:1000016 (scan start time) e.g.: MS:1000796 (spectrum title) e.g.: MS:1000797 (peak list scans) e.g.: MS:1000798 (peak list raw scans) e.g.: MS:1000903 (product ion series ordinal) e.g.: MS:1000904 (product ion m/z delta) e.g.: MS:1000926 (product interpretation rank) e.g.: MS:1001030 (number of peptide seqs compared to each spectrum) e.g.: MS:1001035 (date / time search performed) e.g.: MS:1001036 (search time taken) et al. |
Definition: | The format of the spectrum identifier within the source file | ||||||||
Type: | SpectrumIDFormatType | ||||||||
Attributes: | none |
||||||||
Subelements: |
|
||||||||
Example Context: | <SpectrumIDFormat> <cvParam cvRef="PSI-MS" accession="MS:1000774" name="multiple peak list nativeID format"></cvParam> </SpectrumIDFormat> |
||||||||
cvParam Mapping Rules: | Path /MzIdentML/DataCollection/Inputs/SpectraData/SpectrumIDFormat MUST supply a *child* term of MS:1000767 (native spectrum identifier format) only once e.g.: MS:1000768 (Thermo nativeID format) e.g.: MS:1000769 (Waters nativeID format) e.g.: MS:1000770 (WIFF nativeID format) e.g.: MS:1000771 (Bruker/Agilent YEP nativeID format) e.g.: MS:1000772 (Bruker BAF nativeID format) e.g.: MS:1000773 (Bruker FID nativeID format) e.g.: MS:1000774 (multiple peak list nativeID format) e.g.: MS:1000775 (single peak list nativeID format) e.g.: MS:1000776 (scan number only nativeID format) e.g.: MS:1000777 (spectrum identifier nativeID format) et al. MUST supply a *child* term of MS:1001529 (spectra data details) only once e.g.: MS:1001530 (mzML unique identifier) e.g.: MS:1001531 (spectrum from ProteinScape database nativeID format) e.g.: MS:1001532 (spectrum from database string nativeID format) |
||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1000774" name="multiple peak list nativeID format"></cvParam> |
Definition: | References to the individual component samples within a mixed parent sample. | ||||||||
Type: | SubSampleType | ||||||||
Attributes: |
|
||||||||
Subelements: | none |
||||||||
Example Context: |
Definition: | A modification where one residue is substituted by another (amino acid change). | ||||||||||||||||||||||||
Type: | SubstitutionModificationType | ||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||
Subelements: | none |
||||||||||||||||||||||||
Example Context: |
Definition: | Depending on context: 1: The threshold(s) applied to determine that a result is significant. If multiple terms are used it is assumed that all conditions are satisfied by the passing results. 2: The threshold(s) applied to determine that a result is significant. If multiple terms are used it is assumed that all conditions are satisfied by the passing results. |
||||||||||||
Type: | ParamListType | ||||||||||||
Attributes: | none |
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | <Threshold> <cvParam cvRef="PSI-MS" accession="MS:1001494" name="no threshold"></cvParam> </Threshold> |
||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/Threshold MUST supply a *child* term of MS:1002703 (peptide sequence-level result list statistic) one or more times e.g.: MS:1001364 (peptide sequence-level global FDR) e.g.: MS:1002462 (peptide sequence-level global FNR) e.g.: MS:1002463 (peptide sequence-level global confidence) MAY supply a *child* term of MS:1001302 (search engine specific input parameter) one or more times e.g.: MS:1001005 (SEQUEST:CleavesAt) WARNING: Term has no definition! e.g.: MS:1001007 (SEQUEST:OutputLines) e.g.: MS:1001009 (SEQUEST:DescriptionLines) e.g.: MS:1001026 (SEQUEST:NormalizeXCorrValues) WARNING: Term has no definition! e.g.: MS:1001028 (SEQUEST:SequenceHeaderFilter) e.g.: MS:1001032 (SEQUEST:SequencePartialFilter) WARNING: Term has no definition! e.g.: MS:1001037 (SEQUEST:ShowFragmentIons) e.g.: MS:1001038 (SEQUEST:Consensus) e.g.: MS:1001042 (SEQUEST:LimitTo) e.g.: MS:1001046 (SEQUEST:sort by dCn) et al. MAY supply a *child* term of MS:1002347 (PSM-level identification statistic) one or more times e.g.: MS:1001154 (SEQUEST:probability) e.g.: MS:1001155 (SEQUEST:xcorr) e.g.: MS:1001156 (SEQUEST:deltacn) e.g.: MS:1001160 (SEQUEST:sf) e.g.: MS:1001161 (SEQUEST:matched ions) e.g.: MS:1001162 (SEQUEST:total ions) e.g.: MS:1001171 (Mascot:score) e.g.: MS:1001172 (Mascot:expectation value) e.g.: MS:1001173 (Mascot:matched ions) e.g.: MS:1001174 (Mascot:total ions) et al. MUST supply a *child* term of MS:1002363 (search engine specific score for proteins) one or more times e.g.: MS:1001157 (SEQUEST:sp) e.g.: MS:1001158 (SEQUEST:Uniq) WARNING: Term has no definition! e.g.: MS:1001164 (Paragon:unused protscore) e.g.: MS:1001165 (Paragon:total protscore) e.g.: MS:1001166 (Paragon:score) e.g.: MS:1001167 (Paragon:confidence) e.g.: MS:1001168 (Paragon:expression error factor) e.g.: MS:1001169 (Paragon:expression change p-value) e.g.: MS:1001170 (Paragon:contrib) e.g.: MS:1001372 (SEQUEST:Sequences) WARNING: Term has no definition! et al. MUST supply a *child* term of MS:1002573 (spectrum identification statistical threshold) one or more times e.g.: MS:1001448 (pep:FDR threshold) e.g.: MS:1001868 (distinct peptide-level q-value) e.g.: MS:1002260 (PSM:FDR threshold) MAY supply a *child* term of MS:1002358 (search engine specific peptide sequence-level identification statistic) one or more times e.g.: MS:1001396 (Phenyx:PepPvalue) e.g.: MS:1002360 (distinct peptide-level FDRScore) e.g.: MS:1002361 (distinct peptide-level combined FDRScore) e.g.: MS:1002468 (PeptideShaker peptide score) e.g.: MS:1002499 (peptide level score) e.g.: MS:1002541 (PeptideShaker peptide confidence type) MUST supply a *child* term of MS:1002701 (PSM-level result list statistic) one or more times e.g.: MS:1002350 (PSM-level global FDR) e.g.: MS:1002464 (PSM-level global FNR) e.g.: MS:1002465 (PSM-level global confidence) e.g.: MS:1002501 (no PSM threshold) MAY supply a *child* term of MS:1002555 (PTM localization score threshold) one or more times e.g.: MS:1002556 (Ascore threshold) e.g.: MS:1002557 (D-Score threshold) e.g.: MS:1002558 (MD-Score threshold) e.g.: MS:1002559 (H-Score threshold) e.g.: MS:1002560 (DeBunker:score threshold) e.g.: MS:1002561 (Mascot:PTM site assignment confidence threshold) e.g.: MS:1002562 (MSQuant:PTM-score threshold) e.g.: MS:1002563 (MaxQuant:PTM Score threshold) e.g.: MS:1002564 (MaxQuant:P-site localization probability threshold) e.g.: MS:1002565 (MaxQuant:PTM Delta Score threshold) et al. MUST supply a *child* term of MS:1001153 (search engine specific score) one or more times e.g.: MS:1001147 (protein ambiguity group result details) WARNING: Term has no definition! e.g.: MS:1001155 (SEQUEST:xcorr) e.g.: MS:1001159 (SEQUEST:expectation value) e.g.: MS:1001163 (SEQUEST:consensus score) e.g.: MS:1001874 (FDRScore) e.g.: MS:1002052 (MS-GF:SpecEValue) e.g.: MS:1002053 (MS-GF:EValue) e.g.: MS:1002125 (combined FDRScore) e.g.: MS:1002252 (Comet:xcorr) e.g.: MS:1002257 (Comet:expectation value) et al. MAY supply a *child* term of MS:1001060 (quality estimation method details) one or more times e.g.: MS:1001058 (quality estimation by manual validation) e.g.: MS:1001194 (quality estimation with decoy database) e.g.: MS:1001447 (prot:FDR threshold) e.g.: MS:1001448 (pep:FDR threshold) e.g.: MS:1001454 (quality estimation with implicite decoy sequences) e.g.: MS:1001494 (no threshold) e.g.: MS:1001574 (report only spectra assigned to identified proteins) e.g.: MS:1001868 (distinct peptide-level q-value) e.g.: MS:1002260 (PSM:FDR threshold) e.g.: MS:1002486 (protein group-level statistical threshold) MUST supply term MS:1001494 (no threshold) one or more times MAY supply a *child* term of MS:1002484 (peptide-level statistical threshold) one or more times e.g.: MS:1001448 (pep:FDR threshold) e.g.: MS:1001868 (distinct peptide-level q-value) Path /MzIdentML/AnalysisProtocolCollection/ProteinDetectionProtocol/Threshold MUST supply a *child* term of MS:1001302 (search engine specific input parameter) one or more times e.g.: MS:1001005 (SEQUEST:CleavesAt) WARNING: Term has no definition! e.g.: MS:1001007 (SEQUEST:OutputLines) e.g.: MS:1001009 (SEQUEST:DescriptionLines) e.g.: MS:1001026 (SEQUEST:NormalizeXCorrValues) WARNING: Term has no definition! e.g.: MS:1001028 (SEQUEST:SequenceHeaderFilter) e.g.: MS:1001032 (SEQUEST:SequencePartialFilter) WARNING: Term has no definition! e.g.: MS:1001037 (SEQUEST:ShowFragmentIons) e.g.: MS:1001038 (SEQUEST:Consensus) e.g.: MS:1001042 (SEQUEST:LimitTo) e.g.: MS:1001046 (SEQUEST:sort by dCn) et al. MAY supply a *child* term of MS:1002706 (protein group-level result list statistic) one or more times e.g.: MS:1002369 (protein group-level global FDR) e.g.: MS:1002460 (protein group-level global FNR) MUST supply a *child* term of MS:1002572 (protein detection statistical threshold) one or more times e.g.: MS:1001447 (prot:FDR threshold) e.g.: MS:1002486 (protein group-level statistical threshold) MAY supply a *child* term of MS:1002701 (PSM-level result list statistic) one or more times e.g.: MS:1002350 (PSM-level global FDR) e.g.: MS:1002464 (PSM-level global FNR) e.g.: MS:1002465 (PSM-level global confidence) e.g.: MS:1002501 (no PSM threshold) MUST supply a *child* term of MS:1001153 (search engine specific score) one or more times e.g.: MS:1001147 (protein ambiguity group result details) WARNING: Term has no definition! e.g.: MS:1001155 (SEQUEST:xcorr) e.g.: MS:1001159 (SEQUEST:expectation value) e.g.: MS:1001163 (SEQUEST:consensus score) e.g.: MS:1001874 (FDRScore) e.g.: MS:1002052 (MS-GF:SpecEValue) e.g.: MS:1002053 (MS-GF:EValue) e.g.: MS:1002125 (combined FDRScore) e.g.: MS:1002252 (Comet:xcorr) e.g.: MS:1002257 (Comet:expectation value) et al. MAY supply a *child* term of MS:1002555 (PTM localization score threshold) one or more times e.g.: MS:1002556 (Ascore threshold) e.g.: MS:1002557 (D-Score threshold) e.g.: MS:1002558 (MD-Score threshold) e.g.: MS:1002559 (H-Score threshold) e.g.: MS:1002560 (DeBunker:score threshold) e.g.: MS:1002561 (Mascot:PTM site assignment confidence threshold) e.g.: MS:1002562 (MSQuant:PTM-score threshold) e.g.: MS:1002563 (MaxQuant:PTM Score threshold) e.g.: MS:1002564 (MaxQuant:P-site localization probability threshold) e.g.: MS:1002565 (MaxQuant:PTM Delta Score threshold) et al. MAY supply term MS:1001494 (no threshold) only once MAY supply a *child* term of MS:1002485 (protein-level statistical threshold) one or more times e.g.: MS:1001447 (prot:FDR threshold) |
||||||||||||
Example cvParams: | <cvParam cvRef="PSI-MS" accession="MS:1001494" name="no threshold"></cvParam> |
Definition: | The table used to translate codons into nucleic acids e.g. by reference to the NCBI translation table. | ||||||||||||
Type: | TranslationTableType | ||||||||||||
Attributes: |
|
||||||||||||
Subelements: |
|
||||||||||||
Example Context: | |||||||||||||
cvParam Mapping Rules: | Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/DatabaseTranslation/TranslationTable MUST supply term MS:1001410 (translation start codons) only once MUST supply term MS:1001025 (translation table) only once MUST supply term MS:1001423 (translation table description) only once |
Definition: | In case more information about the ions annotation has to be conveyed, that has no fit in FragmentArray. Note: It is suggested that the value attribute takes the form of a list of the same size as FragmentArray values. However, there is no formal encoding and it cannot be expeceted that other software will process or impart that information properly. | ||||||||||||||||||||||||||||
Type: | UserParamType | ||||||||||||||||||||||||||||
Attributes: |
|
||||||||||||||||||||||||||||
Subelements: | none |
||||||||||||||||||||||||||||
Example Context: | <userParam name="fawaz_PXD000652_combined_concatenated_target_decoy.fasta"></userParam> |