Legend of the output
General info
- Chr: the reference sequence for the human mtDNA is the Revised Cambridge Reference Sequence; NCBI accession number: NC_012920.1
- Start: genomic start position for the variant
- End: genomic end position for the variant
- Ref: reference nucleotide
- Alt: alternative nucleotide
- MitImpact id: variant identifier in MitImpact
- Gene symbol: official gene symbol
- Respiratory Chain complex: Respiratory Chain Complex which the mitochondrial protein belongs to
- Ensembl gene id: identifier from Ensembl Gene (ref)
- Ensembl protein id: identifier from Ensembl Protein
- Ensembl transcript id: identifier from Ensembl transcript
- Uniprot name: gene symbol in Uniprot (ref)
- Uniprot id: protein accession number in Uniprot
- NCBI gene id: gene identifier in NCBI Gene (ref)
- NCBI protein id: protein identifier in NCBI RefSeq Protein (ref)
- Gene position: variant position relative to the coordinates of the gene
- AA pos: amino acid position
- AA ref: reference amino acid
- AA alt: alternative amino acid
- Codon substitution: Codon substitution relative to the genomic variant
Conservation
- PhyloP 100V: PhyloP conservation index from UCSC Table Browser 2015
(ref). [min -20, max 10]
Predicted accelerated evolution: score <= 0;
Conserved: score > 0
- PhastCons 100V: PhastCons conservation index from UCSC Table Browser 2015
(ref). [min 0, max 1]
Non-conserved: score <= 0.7;
Conserved: score > 0.7 (soft threshold)
Pathogenicity predictors
- PolyPhen2: categorical prediction for PolyPhen ver. 2.2.2, HumDiv dataset (ref)
- PolyPhen2 score: [0.0-1.0]. Neutral: score <= 0.15; Possibly damaging: 0.15 < score <= 0.85; Probably damaging: score > 0.85
- SIFT: categorical prediction for SIFT ver. 5.0.3 (ref)
- SIFT score: [0.0-1.0]. Neutral: score > 0.05; Deleterious: score <= 0.05
- SIFT4G: categorical prediction for SIFT4G (ref)
- SIFT4G score: [0.0-1.0]. Neutral: score > 0.05; Deleterious: score <= 0.05
- FatHmm: categorical prediction for FatHmm ver. 2.2 (unweighted version) (ref)
- FatHmm score: [min -15, max 10]. Neutral: score > -3; Deleterious: score <= -3
- FatHmmW: categorical prediction for FatHmm ver. 2.3 (weighted version)
- FatHmmW score: [min -3, max 6]. Neutral: score > -1.5; Deleterious: score <= -1.5
- PROVEAN: categorical prediction for PROVEAN ver. 1.3 (ref)
- PROVEAN score: [min -14, max 14]. Neutral: score > -2.5; Deleterious: score <= -2.5 (soft threshold)
- MutationAssessor: categorical prediction for MutationAssessor ver. 2.0 (ref)
- MutationAssessor score: [min -6, max 6]. Neutral: score <= 0.8;
Low impact: 0.8 < score <= 1.9
Medium impact: 1.9 < score <= 3.5
High impact: score > 3.5
- EFIN SP: score for EFIN, trained with the SwissProt dataset (ref)
- EFIN SP score: [min 0, max 1]. Neutral: score > 0.6; Damaging: score <= 0.6
- EFIN HD: score for EFIN, trained with the HumDiv dataset (ref)
- EFIN HD score: [min 0, max 1]. Neutral: score > 0.28; Damaging: score <= 0.28
- VEST: categorical prediction for VEST ver. 1.3 (ref)
- VEST score: [min 0, max 1]. Neutral: p-value > 0.05; Pathogenic: p-value <= 0.05. The score was computed using CRAVAT 3.2 (ref)
- PANTHER: categorical prediction for PANTHER (ref)
- PANTHER score: [min 0, max 1]. Neutral: score < 0.5; Disease: score >= 0.5. The score was computed using Meta-SNP (ref)
- PhD-SNP: categorical prediction for PhD-SNP
- PhD-SNP score: [min 0, max 1]. Neutral: score < 0.5; Disease: score >= 0.5. The score was computed using Meta-SNP (ref)
- MutationTaster: Mutation Taster categorical prediction (ref)
- MutationTaster score: [min 0, max 1]. Polymorphism: score < 0.5; Disease causing: score >= 0.5
- CADD: categorical prediction for CADD ver. 1.3 (ref)
- CADD phred score: [min 0, max 35]. Neutral: score < 20 (soft threshold); Deleterious: score >= 20
- SNAP: categorical prediction for SNAP
- SNAP score: [min 0, max 1]. Neutral: score < 0.5; Disease: score >= 0.5. The score was computed using Meta-SNP (ref)
- Mitoclass.1: categorical prediction only for MitoClass.1 (ref)
- SNPDryad: categorical prediction for SNPDryad (ref)
- SNPDryad score: [min 0, max 1]. No hard-thresholds were indicated by authors. Indicatively: Neutral: score < 0.9; Pathogenic: score >= 0.9
Pathogenicity meta-predictors
- APOGEE1: categorical prediction for APOGEE1 [Neutral, Pathogenic] - (December 2022)
- APOGEE1 score:
Neutral: score ≤ 0.5;
Pathogenic: score > 0.5 - [min 0, max 1]
- APOGEE2: categorical prediction for APOGEE2 [Benign, Likely-benign, VUS, Likely-pathogenic, Pathogenic] - (December 2022)
- APOGEE2 score:
Benign: score ≤ 0.06157;
Likely-benign: 0.06157 < score ≤ 0.26529;
VUS: 0.26529 < score < 0.71613;
Likely-pathogenic: 0.71613 ≤ score < 0.90719;
Pathogenic: score ≥ 0.90719 - [min 0, max 1]
- APOGEE2 probability:
Benign: probability ≤ 0.001;
Likely-benign: 0.001 < probability ≤ 0.1;
VUS: 0.1 < probability < 0.9;
Likely-pathogenic: 0.9 ≤ probability < 0.99;
Pathogenic: probability ≥ 0.99 - [min 0, max 1]
- CAROL: categorical prediction for CAROL (ref)
- CAROL score: [min 0, max 1]. Neutral: score < 0.98; Deleterious: score >= 0.98
- Condel: categorical prediction for Condel (ref)
- Condel score: [min 0, max 1]. Neutral: score < 0.5; Deleterious: score >= 0.5
- COVEC WMV: categorical prediction for the COVEC ver. 0.4 Weighted Majority Rule consensus method (ref)
- COVEC WMV score: [min -6, max 6]. Neutral: score < 0; Deleterious: score > 0; Inaccurate prediction: score = 0
- Meta-SNP: categorical prediction for Meta-SNP (ref)
- Meta-SNP score: [min 0, max 1]. Neutral: score < 0.5; Disease: score >= 0.5
- MtoolBox: categorical prediction for MtoolBox (ref)
- MtoolBox DS: [min 0, max 1]. Neutral: score < 0.43; Deleterious: score >= 0.43
- DEOGEN2: DEOGEN2 categorical prediction (ref)
- DEOGEN2 score: [min 0, max 1]. Neutral: score < 0.5; Deleterious: score >= 0.5
Cancer-specific predictors
- PolyPhen2 transf: transformed PolyPhen2 categorical prediction by TransFIC ver. 1.0 (ref)
- PolyPhen2 transf score: [min -5, max 5]. Low impact: score <= -1; Medium impact: -1 < score < 1.5 (soft threshold); High impact: score >= 1.5 (soft threshold)
- SIFT transf: transformed SIFT categorical prediction
- SIFT transf score: [min -5, max 5]. Low impact: score < -1; Medium impact: -1 < score < 2 (soft threshold); High impact: score >= 2 (soft threshold)
- MutationAssessor transf: Transformed MutationAssessor categorical prediction
- MutationAssessor transf score: [min -5, max 5]. Low impact: score < 2; Medium impact: -1 < score < 2 (soft threshold); High impact: score >= 2 (soft threshold)
- CHASM: CHASM categorical rpediction, as computed by CRAVAT 3.2 (ref)
- CHASM FDR: [min 0, max 1]. Neutral: FDR > 0.2; Driver: FDR <= 0.2
Databases Phenotypes
- ClinVar July2022 Variation id: variation accession number
- ClinVar July2022 CLNSIG: variant clinical significance
(dbSNP-ClinVar, updated in July 2022)
- ClinVar July2022 CLNDN: disease name associated with a variant
- ClinVar July2022 CLNDISDB: disease database name for a variant
- MITOMAP Allele: allele description for a variant, as reported in the
MITOMAP database (July 2022)
- MITOMAP Disease Het/Hom: + if a variant was found in heteroplasmy (left) and/or homoplasmy (right)
- MITOMAP Disease Clinical info: phenotypes associated with a variant
- MITOMAP Disease Status: validation status for a variant (refer to this for details)
- MITOMAP Disease GenBank Freq: GenBank frequency information of a variant of the MITOMAP Disease set
- MITOMAP Disease GenBank Seqs: GenBank sequences where a variant of the MITOMAP Disease set was found
- MITOMAP Disease GenBank Curated refs: GenBank curated sequences where a variant of the MITOMAP Disease set was found
- MITOMAP General GenBank Freq: GenBank frequency information of a variant of the MITOMAP General set
- MITOMAP General GenBank Seqs: GenBank sequences where a variant of the MITOMAP General set was found
- MITOMAP General GenBank Curated refs: GenBank curated sequences where a variant of the MITOMAP General set was found
- gnomAD 3.1 filter: Reliability flag
- gnomAD 3.1 AC hom: Allele count of homoplasmic variants
- gnomAD 3.1 AC het: Allele count of heteroplasmic variants
- gnomAD 3.1 AF hom: Allelic frequency of homoplasmic variants
- gnomAD 3.1 AF het: Allelic frequency of heteroplasmic variants
- gnomAD 3.1 AN: Total alleles number
(Info)
- HelixMTdb AC hom: Allele count of homoplasmic variants
- HelixMTdb AF hom: Allelic frequency of homoplasmic variants
- HelixMTdb AC het: Allele count of heteroplasmic variants
- HelixMTdb AF het: Allelic frequency of heteroplasmic variants/li>
- HelixMTdb mean ARF: Mean Alternate Read Fraction (for heteroplasmic variants)
- HelixMTdb max ARF: Max Alternate Read Fraction (for heteroplasmic variants)
- COSMIC 90 id: accession number for a cancer-associated variant in COSMIC ver. 90
- dbSNP 155 id: accession number for a variant in dbSNP ver. 155
Residue interaction
- EVmutation: EVmutation score of pairs of amino acid variants. Scores are reported as gene:
pos1aa1-pos2
aa2 → score
(ref)§
- Site A-B InterP: predicted co-varying amino acid sites of two interacting proteins by
I-COMS, using the
corrected Mutual Information and Direct Coupling Analysis methods.
Any pair is described as protein1:pos1-protein2:pos2
§
- Site A-B IntraP: predicted co-varying amino acid sites located in the same protein by
I-COMS.
Any pair is described as protein1:pos1-protein1:
pos2§
- ΔΔG intraP: Free energy change of folding (ΔΔG, Kcal/mol) of pairs of co-varying amino acid variants
(protein1:var1:var2).
Upon selection, the righmost canvas will draw the 3D model of protein1, which will highlight both variants and
will report: the ΔΔG value of both variants if considered together; the ΔΔG value for var1; the ΔΔG value for var2
(ref)#
- ΔΔG intraP interface: ΔΔGbind of interaction energy. Both variants of the pair belong to
one of the proteins and are located in their interaction interface
(protein1:protein2:PDB ID:
PDB chain protein1:
PDB chain protein2:
var1:
var2). As from above, the rightmost canvas will display:
the ΔΔGbind value of both variants if considered together; the ΔΔGbind value for var1 in protein1;
the ΔΔGbind value for var2 in protein1 #.
- ΔΔG interP: ΔΔGbind of interaction energy calculated as the energetic difference between wild-type and mutant proteins.
Variants are located in different proteins
(protein1:protein2:PDB ID:
PDB chain protein1:
PDB chain protein2:
var1:
var2). As from above, the rightmost canvas will display:
the ΔΔGbind value of both variants if considered together; the ΔΔGbind value for var1 in protein1;
the ΔΔGbind value for var2 in protein2 #.
§ Co-variation scores (i.e., EV score, mfDCA, and
corrected MI) are not meant to prioritize one pair over another, since all reported pairs
of co-varying variants were selected from the top scoring pairs obtained as explained in the
Description page.
# ΔΔG values >±0.61 Kcal/mol are
indicative of disrupting variants. ΔΔG values close to zero (<±0.1 Kcal/mol) are
indicative of possibly compensating double mutants, as explained in the
Description page.
Compensated Pathogenic Deviations (ref, ref)
- Frequency: frequency of a CPD variant; it is the proportion of aligned orthologous sequences that carry a specific human pathogenic variant
as wild-type amino acid on the total number of aligned sequences. In our analysis, the total number of aligned orthologous sequences is 673 (for CYB, ND6)
or 674 (ND1, ND2, CO1, CO3) or 675 (all remaining genes)
- AA ref: reference amino acid for the CPD, considering the human protein reference sequence, as indicated in the General Info section
- AA alt: alternate amino acid found in the position specified by the aln pos field, within non-human mammalian species
(see species name, RefSeq Protein ID, and NCBI Taxon ID)
- Aln pos: position of the CPD within the ortholog proteins alignment; the reference is the human sequence
- RefSeq protein id: protein sequence identifier within the NCBI RefSeq Protein Database
- Species name: scientific species name carrying the CPD variant as wild-type amino acid
- NCBI taxon id: species identifier from the NCBI Taxonomy database