Raw plot data for 'Six-frame' plot:
Likelihood ratios:
- Frame.
- Alignment coordinate of window centre.
- Likelihood ratio: 'Window is coding in addition to the input
Known CDS(s)' versus 'Only the input Known CDS(s) are coding'.
- Alignment coordinate of window start.
- Alignment coordinate of window end.
- Sum over sequence pairs of (number of nt used in window) x
(pairwise sequence divergence), where here the sequence divergence
of each pair is the mean number of mutations per nucleotide for
the whole sequence (rather than just in the window), and the
number of nt used is generally equal to the window length, but may
be reduced for gaps, ambiguous nt codes or stop to non-stop
transitions. This is basically a measure of how much data is
available in each window to calculate the likelihood ratio statistic;
lower values correspond to lower signal-to-noise.
Stop codon positions:
- Frame.
- Sequence ID.
- Alignment coordinate.
Start codon positions:
- Frame.
- Sequence ID.
- Alignment coordinate.
Gap (and ambiguous nt) positions:
- Sequence ID.
- Alignment coordinate.
Summed sequence divergences:
- Alignment coordinate.
- Divergence of contributing sequence pairs (i.e. not gapped etc), summed
over phylogenetic tree.