PromoSer alignment statistics

 AnalyzedMappedAssigned
EPDHuman186966.67%68.49%
Mouse19635.20%40.82%
Rat11921.01%21.85%
RefSeqHuman2046896.69%97.22%
Mouse1663588.62%90.46%
Rat478080.33%81.55%
mRNAHuman13679481.02%86.45%
Mouse5025278.01%86.30%
Rat1124477.69%81.22%
ESTHuman547160228.87%74.75%
Mouse405625830.10%74.09%
Rat51763931.14%52.28%
GeneHuman732010.00%58.69%
Mouse209730.00%51.46%
Rat39880.00%35.96%
Total1038601830.43%73.49%

Aanlyzed:Total number of downloaded sequences that remained after keyword filtering.
Mapped:Sequences that passed level 1 filtering and were used for clustering (% of total analyzed).
Assigned:Mapped sequences plus those heuristically assigned to clusters (% of total analyzed).
Note that genomic records were not used for clustering and were always assigned later to clusters.

Clustering statistics

 AllTSS 2-4TSS 0TSS 1TSS 2TSS 3TSS 4
Human46171325149.81%19.77%38.61%29.02%2.79%
Mouse38926204706.78%40.63%19.57%32.78%0.24%
Rat26973478534.99%47.27%6.64%11.01%0.10%

The TSS columns represent the number or fraction from the total number of clusters where the highest quality TSS in the cluster is at the quality level shown. This is a graph of the above table data. The x-axis represents the number of clusters.


Genomic extensions for RefSeq data

 HumanMouseRat
<10028.82%37.95%36.92%
<50027.00%27.14%23.13%
<200012.11%9.10%11.72%
<50005.06%6.09%7.29%
<100004.63%4.56%6.13%
<200005.37%4.40%4.55%
<500007.14%4.97%5.59%
<1000004.67%2.79%2.31%
<5000004.56%2.38%2.13%
>5000000.63%0.63%0.24%
>2062.17%62.79%41.59%

The last row of the table shows the fraction of RefSeq sequences that had a genomic extension larger than 20bp, out of all assigned RefSeq records (see top table). The distribution of those extensions (from 20bp and above, as a fraction of all extensions over 20bp) are shown in the figure.