why nucleotides is more than 4?
Show older comments
hi as I know the no. of nucleotides is 4 letters. why in matlab consider it 17 letters as in table here:
thanks
Accepted Answer
More Answers (1)
Lucio Cetto
on 19 Nov 2011
1 vote
Ambiguous nucleotide symbols are used to characterize sequences that can have variations. It was introduced in the 80's and they are useful nowadays in certain cases, for example describing restriction enzymes. (e.g. http://www.chem.qmul.ac.uk/iubmb/misc/naseq.html). In my personal opinion I think that there are other situations in which we have better options, such as sequence motifs, sequence profiles and the more elaborated profile HMMs. If you plan to convert to aa, Matlab can actually use also ambiguous aa codes when possible, although this is no longer a standard practice; most people now uses only ACGT.
Categories
Find more on Nucleotide Sequence Analysis in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!