Not exact matches
Circles at end
of lines indicate loss
of the TE
allele in that species after ILS, as the sequence
assembly contains an empty TE insertion site (SM10).
We disregarded CGG10023, due to insufficient genome coverage and high error rate, and Twilight, which represents the Thoroughbred horse used for generating the EquCab2.0
assembly and is therefore expected to show a strong deficit
of derived
alleles.
We used ∼ 9-fold whole - genome Sanger shotgun coverage to produce a ∼ 167 - megabase - pair
assembly that typically represents each locus once rather than splitting
alleles (Supplementary Notes 2 and 3) and captures ∼ 97 %
of the protein - coding gene content (Supplementary Note 2.5).