Optimization of sequence alignment for simple sequence repeat regions

cg.contactA.Jighly@cgiar.orgen_US
cg.contributor.centerInternational Center for Agricultural Research in the Dry Areas - ICARDAen_US
cg.contributor.centerGrains Research and Development Corporation - GRDCen_US
cg.contributor.crpCGIAR Research Program on Dryland Systems - DSen_US
cg.contributor.funderInternational Center for Agricultural Research in the Dry Areas - ICARDAen_US
cg.contributor.projectCommunication and Documentation Information Services (CODIS)en_US
cg.contributor.project-lead-instituteInternational Center for Agricultural Research in the Dry Areas - ICARDAen_US
cg.creator.idHamwieh, Aladdin: 0000-0001-6060-5560en_US
cg.identifier.doihttps://dx.doi.org/10.1186/1756-0500-4-239en_US
cg.issn1756-0500en_US
cg.journalBMC Research Notesen_US
cg.subject.agrovocagricultureen_US
cg.subject.agrovocsnpen_US
cg.subject.agrovocdna replicationen_US
cg.volume4: 239en_US
dc.contributorHamwieh, Aladdinen_US
dc.contributorOgbonnaya, Francis Chuksen_US
dc.creatorJighly, Abdul-Qaderen_US
dc.date.accessioned2017-07-23T21:31:12Z
dc.date.available2017-07-23T21:31:12Z
dc.description.abstractBackground: Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs). SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. Findings: To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type. When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. Conclusions: The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.en_US
dc.formatPDFen_US
dc.identifierhttps://mel.cgiar.org/reporting/downloadmelspace/hash/QsOwWbye/v/cfcc44c590e5c6446f02a04d9ae5154aen_US
dc.identifier.citationAbdul-Qader Jighly, Aladdin Hamwieh, Francis Chuks Ogbonnaya. (18/7/2011). Optimization of sequence alignment for simple sequence repeat regions. BMC Research Notes, 4: 239, pp. 1-9.en_US
dc.identifier.statusOpen accessen_US
dc.identifier.urihttps://hdl.handle.net/20.500.11766/7164
dc.languageenen_US
dc.publisherBioMed Centralen_US
dc.rightsCC-BY-NC-4.0en_US
dc.sourceBMC Research Notes; 4: 239,(2011) Pagination 1-9en_US
dc.subjectsimple sequence repeats (ssrs)en_US
dc.subjectinsertion/deletion (indels)en_US
dc.subjectssr markersen_US
dc.titleOptimization of sequence alignment for simple sequence repeat regionsen_US
dc.typeJournal Articleen_US
dcterms.available2011-07-18en_US
dcterms.extent1-9en_US

Files