Diversity arrays technology

From Wikipedia, the free encyclopedia

Diversity Arrays Technology (DArT) is a high-throughput genetic marker technique that can detect allelic variations to provides comprehensive genome coverage without any DNA sequence information for genotyping and other genetic analysis.[1][2][3] The general steps involve reducing the complexity of the genomic DNA with specific restriction enzymes, choosing diverse fragments to serve as representations for the parent genomes, amplify via polymerase chain reaction (PCR), insert fragments into a vector to be placed as probes within a microarray, then fluorescent targets from a reference sequence will be allowed to hybridize with probes and put through an imaging system.[1][2] The objective is to identify and quantify various forms of DNA polymorphism within genomic DNA of sampled species.[1]

First reported in 2001 by Damian Jaccoud, Andrzej Kilian, David Feinstein, and Kaiman Peng, DArT prioritized significant advantages over other traditional primer-based methods like the ability to analyze large amounts of various samples from a low amount of initial DNA.[1][2][4][5] It also afforded low costs and faster results compared to related solid state DNA arrays that detected Single Nucleotide Polymorphisms (SNPs).[1][2] Since its inception, the technology has been a major instrument in the analysis of polyploid plants as well as in the construction of physical and genetic map to understand related on species based on similarities and allelic variances among their genomes.[1][2][6][7][8][3]


The concept was first developed by Damian Jaccoud, Andrzej Kilian, David Feinstein, and Kaiman Peng in 2001.[1] They aimed to establish a genomic DNA-polymorphism detection and quantification technique that increases throughput when compared to more traditional methods like Amplified Fragment Length Polymorphism (AFLP), Restriction Fragment Length Polymorphism (RFLP), Simple Sequence Repeats (SSR).[1][2][4][5] They also aimed to minimize cost and reliance on sequenced genomes to identify polymorphisms which is a consequence of early immobilized, solid-states DNA arrays, like DNA chips, who solely identify SNPs.[1][2] A byproduct of their discovery of a fast, low-cost whole-genome profiling method was that it also provided with the identification of SNPs as well as base-pair insertions, deletions, and shifts, which is an added layer of allelic variation between species analyzed.[1][2]

Jaccoud, Kilian, Feinstein, and Peng selected nine subspecies of rice as their source for genomic DNA and polymorphism analysis.[1] The analysis consisted of detecting the presence, or absence, of specific DNA polymorphisms with probing arrays as well as quantifying the strength of each signal, via fluorescence, within the subspecies. Upon selecting and extracting DNA samples from subjects, samples were digested with three specific restriction enzymes and ligated with T4 ligase. Following ligation into double stranded DNA, dilution as well as extraction of a short amount of mixture to use as a PCR template was performed. Products were placed into a pCR2.1-TOPO vector and subsequently transformed into E. coli, who were selected based on resistance to ampicillin and pigmentation from the X-gal interaction.[1][2] Cloned cells are amplified with PCR-amplified, purified, and introduced into a microarray. Reference DNA and samples were mixed with fluorescent dyes, Cy3 or Cy5, mixed, denatured, and allowed to hybridize to further reintroduce them into the microarray for further analysis. Results reported that the use of DArT was able to detect the presence or absence of polymorphism in an expedient manner as compared to RFLP as well as quantify the polymorphisms detected.[1] In addition, DArT was able to minimize the amount of initial DNA required to conduct the analysis significantly compared to other methods.[1]


The DArT is broken down into three essential steps: Complexity reduction, genomic representation, and DArT assay.[2]

Complexity reduction[edit]

This step of the process deals with reducing large complex genomic DNA of selected species into more, manageable fragmented components through the use of specific restriction enzymes. In addition, this step exclusively relies on digestion enzymes over a couple effort of digestion enzymes and primers due to the reported increased polymorphism identified across analyzed samples.[2] The PstI enzyme is a commonly used restriction enzyme for this step because of its specificity to the nonrepetitive, nonmethylated genome of species.[2][6][7][8][9]

Genomic representation[edit]

Once genomic DNA has been reduced to a manageable size from the previous step by incorporating one or two specific restriction enzymes, the next step involves selecting for the fragments that include largest amount of significant polymorphism across gene pool. These selected fragments are termed “representations” as they are smaller representations of the initial, larger genomic DNA. It is eminent to avoid repetitive sequences when selecting fragments as these will exhibit the lowest amount of polymorphism within analyzed genomic DNA.[2]

DArT assay[edit]

Digested sequences are ligated using T4 ligase to produce double stranded DNA. A small amount of ligated mixture will be diluted then amplified via PCR. During PCR, it is important to use primers complementary to the restriction-enzymes’ cutting sites and RedTaq polymerase, which is rarely inhibited. Mix product into an amplified, gene pool representation and ligate onto vector pCR2.1-TOPO. Following representation insertion into vector, transform vector into E. coli cells via electrical shocking or chemical means. Incubate cells and select based on ampicillin resistance and white-pigmentation from inactive β-galactosidase gene in a medium containing X-gal.[1][2] Inserts are then amplified via PCR and inserted as spotters into a microarray slide. Slides are centrifuge to isolate inserts, which are then purified.

Fluorescent dyes, Cy3 or Cy5, are added to the microarray targets, which are genomic representations. Following addition of the fluorescent dye, targets are added to microarray probes containing the amplified E. coli clones where denaturing and subsequent hybridization, if possible, takes place. Following hybridization, slides are washed and scanned with an imaging system that targets fluorescent signals with the incorporation of an open-source software called DArTsoft. Interactions and dissimilarities between probe and various targets are used to develop a histogram which quantifies and identifies several forms of DNA polymorphism among analyzed genomes.[1][2]


Molecular breeding[edit]

The ability to identify and quantify allelic variations among genomes without the need for a sequenced genome is of great value to DArT and has large implications in the molecular breeding sector.[1][2] By comparing crops with phenotypes such as higher yields of produce or resistance to certain environmental parasites, a phenotype can be directly linked to a DNA polymorphism identified among related species through DArT. DArT is also able to outperform other genotyping techniques with polyploids due to the absence of primer competition found in other techniques.[2] Polyploids are commonly found among agriculturally important crops.[2] For example, DArT has been used to conduct genome-wide analysis among Musa species, which includes bananas and plantains, which led to the development of a phylogenetic cladogram based on genetic markers derived from DArT techniques. These developments enhance breeding knowledge to obtain desirable yields and products.[3]

Expedited recognition of markers found with genes responsible for phenotypes is also being studied in animals with the help of DArT.[2][10] Mosquitoes’ resistance to insecticide has been linked to specific mutations in genes that confer resistance to certain species of mosquitoes over others.[10] Genotypic variations were found through markers while conducting DArT analysis on relevant samples.

Genomic mapping[edit]

Since DArT is able to find genetic relations among species within a metagenome in a cheap and expedited manner, it has been integral to developing physical and genetic maps of closely related species.[2][8] In its inception, DArT was used to develop phylogenetic cladograms of rice subspecies based on the presence or absence of DNA fragments in each species’ genome.[1] In the same manner, DArT was incorporated in fabricating genetic maps for A. thaliana by conducting an automated version of DArT.[2][9] Wheat, a hexaploid, is also another crop that has benefited from implementation of a DArT analysis as a Bacterial Artificial Chromosome (BAC) of the largest chromosome, 3B, was created from markers detected through DArT assays.[8][11]


  1. ^ a b c d e f g h i j k l m n o p q r Jaccoud D, Peng K, Feinstein D, Kilian A (February 2001). "Diversity arrays: a solid state technology for sequence information independent genotyping". Nucleic Acids Research. 29 (4): 25e–25. doi:10.1093/nar/29.4.e25. PMC 29632. PMID 11160945.
  2. ^ a b c d e f g h i j k l m n o p q r s t u Kilian A, Wenzl P, Huttner E, Carling J, Xia L, Blois H, et al. (2012). "Diversity arrays technology: a generic genome profiling technology on open platforms". In Pompanon F, Bonin A (eds.). Data Production and Analysis in Population Genomics. Methods in Molecular Biology. Vol. 888. Totowa, NJ: Humana Press. pp. 67–89. doi:10.1007/978-1-61779-870-2_5. ISBN 978-1-61779-870-2. PMID 22665276. Data Production and Analysis in Population Genomics: Methods and Protocols
  3. ^ a b c Risterucci AM, Hippolyte I, Perrier X, Xia L, Caig V, Evers M, et al. (October 2009). "Development and assessment of Diversity Arrays Technology for high-throughput DNA analyses in Musa". TAG. Theoretical and Applied Genetics. Theoretische und Angewandte Genetik. 119 (6): 1093–1103. doi:10.1007/s00122-009-1111-5. PMID 19693484. S2CID 23747800.
  4. ^ a b Vos P, Hogers R, Bleeker M, Reijans M, van de Lee T, Hornes M, et al. (November 1995). "AFLP: a new technique for DNA fingerprinting". Nucleic Acids Research. 23 (21): 4407–4414. doi:10.1093/nar/23.21.4407. PMC 307397. PMID 7501463.
  5. ^ a b Weber JL, May PE (March 1989). "Abundant class of human DNA polymorphisms which can be typed using the polymerase chain reaction". American Journal of Human Genetics. 44 (3): 388–396. PMC 1715443. PMID 2916582.
  6. ^ a b Rabinowicz PD, Schutz K, Dedhia N, Yordan C, Parnell LD, Stein L, et al. (November 1999). "Differential methylation of genes and retrotransposons facilitates shotgun sequencing of the maize genome". Nature Genetics. 23 (3): 305–308. doi:10.1038/15479. PMID 10545948. S2CID 19943394.
  7. ^ a b Wenzl P, Carling J, Kudrna D, Jaccoud D, Huttner E, Kleinhofs A, Kilian A (June 2004). "Diversity Arrays Technology (DArT) for whole-genome profiling of barley". Proceedings of the National Academy of Sciences of the United States of America. 101 (26): 9915–9920. Bibcode:2004PNAS..101.9915W. doi:10.1073/pnas.0401076101. PMC 470773. PMID 15192146.
  8. ^ a b c d Akbari M, Wenzl P, Caig V, Carling J, Xia L, Yang S, et al. (November 2006). "Diversity arrays technology (DArT) for high-throughput profiling of the hexaploid wheat genome". TAG. Theoretical and Applied Genetics. Theoretische und Angewandte Genetik. 113 (8): 1409–1420. doi:10.1007/s00122-006-0365-4. PMID 17033786. S2CID 12636193.
  9. ^ a b Wittenberg AH, van der Lee T, Cayla C, Kilian A, Visser RG, Schouten HJ (August 2005). "Validation of the high-throughput marker technology DArT using the model plant Arabidopsis thaliana". Molecular Genetics and Genomics. 274 (1): 30–39. doi:10.1007/s00438-005-1145-6. PMID 15937704. S2CID 34817585.
  10. ^ a b Bonin A, Paris M, Després L, Tetreau G, David JP, Kilian A (October 2008). "A MITE-based genotyping method to reveal hundreds of DNA polymorphisms in an animal genome after a few generations of artificial selection". BMC Genomics. 9 (1): 459. doi:10.1186/1471-2164-9-459. PMC 2579443. PMID 18837997.
  11. ^ Paux E, Sourdille P, Salse J, Saintenac C, Choulet F, Leroy P, et al. (October 2008). "A physical map of the 1-gigabase bread wheat chromosome 3B". Science. 322 (5898): 101–104. Bibcode:2008Sci...322..101P. doi:10.1126/science.1161847. PMID 18832645. S2CID 27686615.