Digital Automated Identification SYstem (DAISY)
|Developer(s)||Mark A. O'Neill|
2.1.0 / February 1, 2016
|Written in||C|
|Platform||IA-32 x86-64 ARM|
|License||Proprietary commercial software|
It was developed by Dr. Mark O'Neill during the mid-1990s. Development was supported by funding from the Darwin Initiative in 1997 and BBSRC. The intellectual property rights were acquired by O'Neill's company; Tumbling Dice Ltd, in February 2000 at the end of the grant funded Darwin Project. The system underwent further development resulting in an producing an exemplar which is web accessible and which can cope in near real time with groups (e.g. hawk moths) which contain several hundred taxa. On medium to high end PC server hardware (e.g. a blade server) an identification is possible in under a second for a 300 taxon group. Parallelisation of the critical Daisy classifier codes (using either bespoke FPGA technology or general purpose GPU programming technology such as CUDA) will give an order of magnitude increase in performance. This means that Daisy can be deployed to make real time identifications within groups containing thousands of taxa (e.g. true flies).
|taxon||image type||structure||training images||species||success(%)|
|UK macro moths||RGB||wing||744||37||98|
|Soft fruit pests||RGB||body||2634||23||91|
|description||image type||structure||training images||classes||success(%)|
|Industrial objects||RGB||unposed object||155||14||100|
|Foraminifera tests||RGB||unposed object||198||8||95|
|Pollen grains||RGB||unposed object||6601||12||99|
|Human faces||mono||unposed face||400||41||99|
Daisy has been used in several research projects by O'Neill and others, and featured in popular science TV and magazine articles. The project has also been the subject of a recent article in Science.
In 2011, the first DAISY installation capable of scaling to hundreds of taxa was installed at Natural History Museum in London. This server offered both VNC and Web Service based interfaces and was able to offload compute intensive pattern matching operations onto an NVIDIA GPU programmed using CUDA. This installation was capable of providing identification to species given a 300+ taxon dataset in less than a second in a multiple user environment.
More recently, under the aegis of Innovate UK funding, DAISY has been extensively modified to meet the needs of upstream activities within the Oil & Gas sector, in particular biostratigraphy. The resultant system, GeoDAISY represents a significant technological advance. It is capable of deep learning, knowledge encapsulation, pattern based data mining and (image based) content search and can efficiently handle training sets consisting of millions of patterns on commodity hardware using a combination of smart data caching and OpenMP. Further details of GeoDAISY, and the rationale for developing it are available as white papers on the Tumbling Dice LinkedIn page.
Daisy Linux interface (DFE)
- "Automating Insect Identification for Inventorying Costa Rican Biodiversity". Darwin Initiative. Defra. Retrieved 15 December 2010.
- "Daisy Overview" (PDF). Tumbling Dice Ltd. 2007. Retrieved 14 December 2010.
- "Daisy". Tumbling Dice. Retrieved 14 December 2010.
- Watson, Anna T.; O'Neill, Mark A.; Kitching, Ian J. (2003). "A qualitative study investigating automated identification of living macrolepidoptera using the Digital Automated Identification SYstem (DAISY)". Systematics & Biodiversity. 1: 287–300. doi:10.1017/S1477200003001208.
- Reed, Sarah (2010). "Pushing DAISY". Science. 328: 1628–1629. PMID 20576867. doi:10.1126/science.328.5986.1628.