Data mining in agriculture
This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages)(Learn how and when to remove this template message)
Data mining in agriculture is a very recent research topic. It consists in the application of data mining techniques to agriculture. Recent technologies are nowadays able to provide a lot of information on agricultural-related activities, which can then be analyzed in order to find important information. A related, but not equivalent term is precision agriculture.
- 1 Applications
- 1.1 Relationship between sprays and fruit defects
- 1.2 Prediction of problematic wine fermentations
- 1.3 Predicting metabolizable energy of poultry feed using group method of data handling-type neural network
- 1.4 Detection of diseases from sounds issued by animals
- 1.5 Growth of sheep from genes polymorphism using artificial intelligence
- 1.6 Sorting apples by watercores
- 1.7 Optimizing pesticide use by data mining
- 1.8 Explaining pesticide abuse by data mining
- 1.9 Analyzing chicken performance data by neural network models
- 2 Literature
- 3 Conferences
- 4 References
Relationship between sprays and fruit defects
Fruit defects are often recorded (for a multitude of reasons, sometimes for insurance reasons when exporting fruit overseas). It may be done manually or through computer vision (detecting surface defects when grading fruit). Spray diaries are a legal requirement in many countries and at the very least record the date of spray and the product name. It is known that spraying can have affect different fruit defects for different fruit. Fungicidal sprays are often used to prevent rots from being expressed on fruit. It is also known that some sprays can cause russeting on apples. Currently much of this knowledge comes anecdotally, however some efforts have been in regards to the use of data mining in horticulture.
Prediction of problematic wine fermentations
Wine is widely produced all around the world. The fermentation process of the wine is very important, because it can impact the productivity of wine-related industries and also the quality of wine. If the fermentation could be categorized and predicted at the early stages of the process, it could be altered in order to guarantee a regular and smooth fermentation. Fermentations are nowadays studied by using different techniques, such as, for example, the k-means algorithm, and a technique for classification based on the concept of biclustering. Note that these works are different from the ones where a classification of different kinds of wine is performed. See the wiki page Classification of wine for more details.
Predicting metabolizable energy of poultry feed using group method of data handling-type neural network
A group method of data handling-type neural network (GMDH-type network) with an evolutionary method of genetic algorithm was used to predict the metabolizable energy of feather meal and poultry offal meal based on their protein, fat, and ash content. Published data samples were collected from literature and used to train a GMDH-type network model. The novel modeling of GMDH-type network with an evolutionary method of genetic algorithm can be used to predict the metabolizable energy of poltry feed samples based on their chemical content. It is also reported that the GMDH-type network may be used to accurately estimate the poultry performance from their dietary nutrients such as dietary metabolizable energy, protein and amino acids.
Detection of diseases from sounds issued by animals
The detection of animal's diseases in farms can impact positively the productivity of the farm, because sick animals can cause contaminations. Moreover, the early detection of the diseases can allow the farmer to cure the animal as soon as the disease appears. Sounds issued by pigs can be analyzed for the detection of diseases. In particular, their coughs can be studied, because they indicate their sickness. A computational system is under development which is able to monitor pig sounds by microphones installed in the farm, and which is also able to discriminate among the different sounds that can be detected.
Growth of sheep from genes polymorphism using artificial intelligence
Polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) method was used to determine the growth hormone (GH), leptin, calpain, and calpastatin polymorphism in Iranian Baluchi male sheep. An artificial neural network (ANN) model was developed to describe average daily gain (ADG) in lambs from input parameters of GH, leptin, calpain, and calpastatin polymorphism, birth weight, and birth type. The results revealed that the ANN-model is an appropriate tool to recognize the patterns of data to predict lamb growth in terms of ADG given specific genes polymorphism, birth weight, and birth type. The platform of PCR-SSCP approach and ANN-based model analyses may be used in molecular marker-assisted selection and breeding programs to design a scheme in enhancing the efficacy of sheep production.
Sorting apples by watercores
Before going to market, apples are checked and the ones showing some defects are removed. However, there are also invisible defects, that can spoil the apple flavor and look. An example of invisible defect is the watercore. This is an internal apple disorder that can affect the longevity of the fruit. Apples with slight or mild watercores are sweeter, but apples with moderate to severe degree of watercore cannot be stored for any length of time. Moreover, a few fruits with severe watercore could spoil a whole batch of apples. For this reason, a computational system is under study which takes X-ray photographs of the fruit while they run on conveyor belts, and which is also able to analyse (by data mining techniques) the taken pictures and estimate the probability that the fruit contains watercores.
Optimizing pesticide use by data mining
Recent studies by agriculture researchers in Pakistan (one of the top four cotton producers of the world) showed that attempts of cotton crop yield maximization through pro-pesticide state policies have led to a dangerously high pesticide use. These studies have reported a negative correlation between pesticide use and crop yield in Pakistan. Hence excessive use (or abuse) of pesticides is harming the farmers with adverse financial, environmental and social impacts. By data mining the cotton Pest Scouting data along with the meteorological recordings it was shown that how pesticide use can be optimized (reduced). Clustering of data revealed interesting patterns of farmer practices along with pesticide use dynamics and hence help identify the reasons for this pesticide abuse.
Explaining pesticide abuse by data mining
To monitor cotton growth, different government departments and agencies in Pakistan have been recording pest scouting, agriculture and metrological data for decades. Coarse estimates of just the cotton pest scouting data recorded stands at around 1.5 million records, and growing. The primary agro-met data recorded has never been digitized, integrated or standardized to give a complete picture, and hence cannot support decision making, thus requiring an Agriculture Data Warehouse. Creating a novel Pilot Agriculture Extension Data Warehouse followed by analysis through querying and data mining some interesting discoveries were made, such as pesticides sprayed at the wrong time, wrong pesticides used for the right reasons and temporal relationship between pesticide usage and day of the week.
Analyzing chicken performance data by neural network models
A platform of artificial neural network-based models with sensitivity analysis and optimization algorithms was used successfully to integrate published data on the responses of broiler chickens to threonine. Analyses of the artificial neural network models for weight gain and feed efficiency from a compiled data set suggested that the dietary protein concentration was more important than the threonine concentration. The results revealed that a diet containing 18.69% protein and 0.73% threonine may lead to producing optimal weight gain, whereas the optimal feed efficiency may be achieved with a diet containing 18.71% protein and 0.75% threonine.
Since this research topic is quite recent, there is only one reference book. Data Mining in Agriculture is published by Springer and it is co-authored by Antonio Mucherino, Petraq Papajorgji and Panos Pardalos. A quick survey of the book can be found at this address. There are a few precision agriculture journals, such as Springer's Precision Agriculture or Elsevier's Computers and Electronics in Agriculture, but those are not exclusively devoted to data mining in agriculture.
There are many conferences organized every year on data mining techniques and applications, but rather few of them consider problems arising in the agricultural field.
- Mucherino, A.; Papajorgji, P.J.; Pardalos, P. (2009). Data Mining in Agriculture, Springer.
- "Apple russeting". www.extension.umn.edu. Retrieved 2016-10-04.
- Hill, M. G.; Connolly, P. G.; Reutemann, P.; Fletcher, D. (2014-10-01). "The use of data mining to assist crop protection decisions on kiwifruit in New Zealand". Computers and Electronics in Agriculture. 108: 250–257. doi:10.1016/j.compag.2014.08.011.
- Urtubia, A.; Perez-Correa, J.R.; Meurens, M.; Agosin, E. (2004). "Monitoring Large Scale Wine Fermentations with Infrared Spectroscopy". Talanta. 64 (3): 778–784. doi:10.1016/j.talanta.2004.04.005. PMID 18969672.
- Mucherino, A.; Urtubia, A. (2010). "Consistent Biclustering and Applications to Agriculture". IbaI Conference Proceedings, Proceedings of the Industrial Conference on Data Mining (ICDM10), Workshop Data Mining in Agriculture (DMA10), Springer: 105–113.
- Ahmadi, H.; Golian, A.; Mottaghitalab, M.; Nariman-Zadeh, N. (2008-09-01). "Prediction Model for True Metabolizable Energy of Feather Meal and Poultry Offal Meal Using Group Method of Data Handling-Type Neural Network". Poultry Science. 87 (9): 1909–1912. doi:10.3382/ps.2007-00507. ISSN 0032-5791. PMID 18753461.
- Ahmadi, Dr H.; Mottaghitalab, M.; Nariman-Zadeh, N.; Golian, A. (2008-05-01). "Predicting performance of broiler chickens from dietary nutrients using group method of data handling-type neural networks". British Poultry Science. 49 (3): 315–320. doi:10.1080/00071660802136908. ISSN 0007-1668. PMID 18568756.
- Chedad, A.; Moshou, D.; Aerts, J.M.; Van Hirtum, A.; Ramon, H.; Berckmans, D. (2001). "Recognition System for Pig Cough based on Probabilistic Neural Networks". Journal of Agricultural Engineering Research 79(4). 79 (4): 449–457. doi:10.1006/jaer.2001.0719.
- Mojtaba, Tahmoorespur; Hamed, Ahmadi (2012-01-01). "neural network model to describe weight gain of sheep from genes polymorphism, birth weight and birth type". Livestock Science. ISSN 1871-1413.
- Schatzki, T.F.; Haff, R.P.; Young, R.; Can, I.; Le, L-C.; Toyofuku, N. (1997). "Defect Detection in Apples by Means of X-ray Imaging". Transactions of the American Society of Agricultural Engineers 40(5). 40 (5): 1407–1415. doi:10.13031/2013.21367.
- Abdullah, Ahsan; Brobst, Stephen; Pervaiz, Ijaz; Umar, Muhammad; Nisar, Azhar (2004). Learning Dynamics of Pesticide Abuse through Data Mining (PDF). Australasian Workshop on Data Mining and Web Intelligence, Dunedin, New Zealand.
- Abdullah, Ahsan; Hussain, Amir (2006). "Data Mining a New Pilot Agriculture Extension Data Warehouse" (PDF). Journal of Research and Practice in Information Technology. 38 (3): 229–249. Archived from the original (PDF) on 2010-09-23.
- Ahmadi, H.; Golian, A. (2010-11-01). "The integration of broiler chicken threonine responses data into neural network models". Poultry Science. 89 (11): 2535–2541. doi:10.3382/ps.2010-00884. ISSN 0032-5791. PMID 20952719.