In a 2016 National Institutes of Health summer internship at the National Human Genomic Research Institute, Sappington conducted a third research project in which she demonstrated a fast, alignment - free computational method for
identifying orthologs — similar genes from species that are related by descent from a common ancestor.
Not exact matches
We were able to
identify 3,771
ortholog pairs by sequence similarity and investigated the
ortholog pair alignment length.
As expected, an
ortholog of AtDwf4 and OsDwf4 in S. viridis (SvDwf4; Sevir.9 G483600) was
identified.
In addition, one of the genes
identified — an
ortholog of human COL2A1 is commonly used as a chondrocytic maker in the development of cartilage (Zaucke et al., 2001).
Among the genes that were
identified to be up - regulated are
orthologs that encode for human collagen proteins: ADAMTS2, ADAMTS7 and COL2A1.
Whereas 12 of 16 annotations were confirmed by RT - PCR in human tissues, for only seven genes mouse
orthologs could be
identified and found to be expressed.
Benchmarking Universal Single - Copy
Orthologs (BUSCO) analysis showed that the pipeline was able to
identify at least 95 % of BUSCO.s plantae dataset.
One to one BLAST
orthologs were
identified by reciprocal BLASTP searches between D. plexippus and the B. mori OGS peptides (downloaded from SilkDB http://silkworm.genomics.org.cn/silkdb).