Krzysztof Bochenek
logo
1994
Born on 1994-02-05.

Links

linkedin

Genome, lost genes (pLoF mutations) SNP,INDEL

Genome, clinical annotations (only pathogenic and protective clinvar mutations) SNP,INDEL

chr: chromosome; position: position in chromosome (using Hg38 coordinates); dbsnpid: id of variant (mutation); found: this variant found in so many genomes in this database; who: max 10 genomes with this variant from this database sorted by similarity; freq: (max) frequency of this variant in public databases; homozygous: variant in both copies of the genome; gene: gene name; effect and description: description from clinvar database; note: variants with high frequency in public (freq) or this database; comment: Clinical annotations will change a lot in the future! Annotation of lost genes may not change.

Circo


The outer concentric ring is chromosomal information; The second ring represents the read coverage in histogram style. A histogram is the average coverage of a 0.5Mbp region; The third ring represents indel density in scatter style. A black dot is calculated as indel number in a range of 1Mbp/1Mbp); The fourth ring represents snp density in scatter style. A green dot is calculated as snp number in a range of 1Mbp/1Mbp); The fifth ring represents the proportion of homozygous SNP (orange) and heterozygous SNP (grey) in histogram style. A histogram is calculated from a 1Mbp region; The sixth ring represents the CNV inference. Red means gain, and green means loss; The most central ring represents the SV inference in exonic and splicing regions. If SV is called using breakdancer or crest, then CTX (orange), INS (green), DEL (grey), ITX (pink) and INV (blue). If SV is called using delly, then TRA (orange), INS (green), DEL (grey), DUP (pink) and INV (blue);

Similar genomes (max total of 3686353 used for caculating match%)

who (max 20)matchtotalmatch%
2437050369269566.11%
2435687370089166.07%
2434353370509366.04%
2433664370404666.02%
2432744370167565.99%
2430848368680465.94%
2428306369426165.87%
2426687369300365.83%
2426199369999265.82%
2425495368976265.80%
2425367370462065.79%
2425329389196965.79%
2425187370138665.79%
2424564370005765.77%
2423742368928565.75%
2422688370128965.72%
2422618369029665.72%
2418653369317965.61%
2418281369609865.60%
2417841369816765.59%

who: maximum 20 most similar genomes in this database sorted by number of common SNP variants; match: number of common SNP variants; total: total number of SNP variants in similar genome; match: similarity = (common variants)/min(total variants in tested or similar genome);

Statistics

sequencing quality
Raw reads8865262
Raw data(G)99.89
Effective(%)0.03
Error(%)97.19
Q20(%)92.21
Q30(%)44.03
GC(%)43.81
mapping, coverage and depth
Total1204965204 (100%)
Duplicate154353249 (20.48%)
Mapped753655092 (62.55%)
Properly mapped733250398 (60.85%)
PE mapped752542222 (62.45%)
SE mapped2225740 (0.18%)
With mate on different chr15857940 (1.32%)
-''- and ((mapQ>=5))10626783 (0.88%)
Average_sequencing_depth37.92
Coverage99.69%
Coverage_at_least_4X99.18%
Coverage_at_least_10X97.99%
Coverage_at_least_20X91.84%
number of SNPs
CDS23564
synonymous_SNP11886
missense_SNP11325
stopgain98
stoploss13
unknown259
intronic1284528
UTR326828
UTR56144
splicing71
ncRNA_exonic14748
ncRNA_intronic231603
ncRNA_splicing75
upstream23499
downstream24509
intergenic2167696
Total3804319
feature of SNPs
Total3804319
Het2336794
Hom1467525
transition2544115
transvertion1260204
ts/tv2.02
dbSNP percentage3680600 (96.75%)
novel123719
novel ts69259
novel tv54460
novel ts/tv1.27
number of InDels
CDS716
frameshift_deletion161
frameshift_insertion118
nonframeshift_deletion220
nonframeshift_insertion194
stopgain7
stoploss1
unknown21
intronic310184
UTR37117
UTR51078
splicing54
ncRNA_exonic2072
ncRNA_intronic52163
ncRNA_splicing14
upstream6027
downstream6541
intergenic453048
Total839270
feature of InDels
Total839270
Het534889
Hom304381
dbSNP percentage738769 (88.03%)
novel100501
structural variants
DUP1126
INV1058
INS78
DEL3972
BND1323
copy number variants
gain_count27
gain_size1538000
loss_count217
loss_size49112000
total_count244
total_size50650000
powered by BioInfoBank Institute and AfterLife Fund