- DNA sequencing technologies
- Phred quality scores
- FASTQ quality control
- Sequencing as a random sampling
2019-10-18
@reference-seq-name1 ATCTATACTTTATCTTTATCTTTA + GFFFFBBBCBCCBBAAA:::;;;; @reference-seq-name2 ATTTTATCGCGTAGCTAGCTGGCT + FFFEEBBBCBCCBBAAA:::;;;;
\(S =\) “ATCG”, \(T =\) “TATCGTGA”
\(S =\) “AAA”, \(T =\) “AAAAAAAAAAAAAAAA”
\(S =\) “ACGG”, \(T =\) “GTGTGTGTGTGTGTGTGTGT”
See naive-alignment.py
.
How efficient is this method?
How many comparisons will we make, in the worst case?
Computer science: “big-O” notation
\(|S|\) comparisons for each shift of \(S\)
\(|T| - |S| + 1\) shifts in total
\(O(|S|(|T| - |S| + 1)) = O(|S||T|)\)
boyer-moore.py
..bt2
)