• 沒有找到結果。

Accuracy Analysis Mechanism for Agriculture Data Using the Ensemble Neural Network Method

N/A
N/A
Protected

Academic year: 2021

Share "Accuracy Analysis Mechanism for Agriculture Data Using the Ensemble Neural Network Method"

Copied!
11
0
0

加載中.... (立即查看全文)

全文

(1)

Article

Accuracy Analysis Mechanism for Agriculture Data

Using the Ensemble Neural Network Method

Hsu-Yang Kung1, Ting-Huan Kuo2,*, Chi-Hua Chen3,4and Pei-Yu Tsai1

1 Department of Management Information Systems, National Pingtung University of Science and Technology,

Pingtung 91201, Taiwan; kung@mail.npust.edu.tw (H.-Y.K.); m10256005@mail.npust.edu.tw (P.-Y.T.)

2 Department of Computer Science and Engineering, National Sun Yat Sen University,

Kaohsiung 80424, Taiwan

3 Telecommunication Laboratories, Chunghwa Telecom Co., Ltd., Taoyuan 32661, Taiwan;

chihua0826@gmail.com

4 Department of Information Management and Finance, National Chiao Tung University,

Hsinchu 30010, Taiwan

* Correspondence: daphnekuo0408@gmail.com; Tel.: +886-7-5252365 Academic Editor: Douglas H. Constance

Received: 13 June 2016; Accepted: 28 July 2016; Published: 1 August 2016

Abstract:With the rise and development of information technology (IT) services, the amount of data generated is rapidly increasing. Data from many different places are inconsistent. Data capture, storage and analysis have major challenges. Most data analysis methods are unable to handle such large amounts of data. Many studies employ neural networks, mostly specifying the number of hidden layers and neurons according to experience or formula. Different sets of network topologies have different results, and the best network model is selected. This investigation proposes a system based on the ensemble neural network (ENN). It creates multiple network models, each with different numbers of hidden layers and neurons. A model that does not achieve the accuracy rate is discarded. The proposed system derives the weighted average of all remaining network models to improve the accuracy of the prediction. This study applies the proposed method to generate agricultural yield predictions. The agricultural production process in Taiwan is more complex than those of manufacturing or other industries. The Council of Agriculture provides agricultural forecasting primarily based on the planted area and experience to predict the yield, but without consideration of the overall planting environment. This work applies the proposed data analysis method to agriculture. The method based on ENN has a much lower error rate than traditional back-propagation neural networks, while multiple regression analysis has an error rate of 12.4%. Experimental results reveal that the ENN method is better than traditional back-propagation neural networks and multiple regression analysis.

Keywords:ensemble neural network; data mining; multiple regression analysis; stepwise regression; yield prediction models

1. Introduction

Crop production is important for people in Taiwan, while manufacturing industries face more issues than agricultural production. Issues in agricultural production include climatic factors, pests, diseases and the treatment process. Hence, farmers engaged in production, or those indirectly related to agricultural agencies, need to predict their crop yield accurately to avoid imbalances in market supply and demand caused or hastened by harvest crop quality and poor results. The agricultural forecasting provided by the Council of Agriculture is mainly based on the planted area and experience to predict the yield, but does not consider the impact of the plant environment on yield.

(2)

Sustainability 2016, 8, 735 2 of 11

To understand the effect of important meteorological parameters, and to predict crop yields effectively, this work adopts stepwise regression and an ensemble neural network (ENN) method for analysis with the aim of improving the accuracy of crop yield prediction.

The rest of this study is organized as follows. The research backgrounds and related works of data mining methods, agricultural production forecasting, stepwise regression, and back-propagation neural networks (BPNs) are presented in Section2. Section3proposes an ENN method to analyze agriculture data. The experimental results and discussions are illustrated in Section4. Section5gives conclusions and future work.

2. Research Backgrounds and Related Works

The literature review of data mining methods, agricultural production forecasting, stepwise regression, and BPNs is discussed in the following subsections.

2.1. Data Mining Methods

Data mining is a part of database knowledge discovery. As the name suggests, it involves accumulating large amounts of data and extracting useful information from them. However, with the current development of information technology, the increasing amount of data, and different data types and sources and diversification, big data has become a major research topic in recent years for governments and industries. Big data technology is still based on traditional data mining methods. The objective of data mining or big data analysis is to identify implicit information from data, and thus enhance the value of information. Data analysis can be conducted using many approaches, such as cluster analysis, classification and statistical analysis.

2.1.1. Cluster Analysis

Fahad et al. [1] divided cluster analysis methods into five types, namely segmentation-based, hierarchical-based, density-based, grid-based and model-based methods, as listed in Table1.

Table 1.Clustering methods.

Type Algorithm

segmentation-based

K-means, K-medoids, K-modes, PAM (partitioning around medoids), CLARANS (clustering large applications based on randomized search),

CLARA (clustering large applications), FCM (fuzzy c-means)

hierarchical-based BIRCH (balanced iterative reducing and clustering using hierarchies), CURE (clustering using representatives), ROCK (robust clustering algorithm), Chameleon, Echidna density-based

DBSCAN (density-based spatial clustering of applications with noise), OPTICS (ordering points to identify the clustering structure), DBCLASD (density based clustering algorithms for mining large spatial databases), DENCLUE (density-based clustering) grid-based Wave-Cluster, STING (statistical information grid),

CLIQUE (clustering in quest), OptiGrid (optimal grid-clustering) model-based EM (expectation maximization), Cobweb, CLASSIT

(a robust concept formation system), SOMs (self-organizing maps)

2.1.2. Classification

A classification model is generated from property values of existing data, then employed to predict the category of new data. The main goal of classification is to analyze the influence of each factor or variable on forecast data values. The result is a supervised learning network, containing neural networks and decision trees [2–4].

(3)

2.1.3. Statistical Analysis

This is based on mathematical principles, and can be categorized as descriptive statistics and inferential statistics [5–7].

Data mining creates high value for enterprises in sectors such as health and medical care, personal location information, retail and manufacturing [8]. The proportion of US health care spending is very high. Analyzing the massive amount of health care data would significantly reduce capital costs. The retail sector has employed data mining analysis techniques for a long time: customer purchase records are applied to predict a future purchases list, and to adjust marketing strategies or merchandise display modes. The manufacturing sector, which is the backbone of the global trading industry, has a complex and widely dispersed value chain. Analyzing the available data would enable increased productivity, process improvements and reduced product delivery times.

2.2. Agricultural Production Forecasting

Many factors, mainly meteorological and environmental factors, influence crop yield. The variables covering changes in the weather include temperature, the amount of sunlight, and rain. Some studies concluded that temperatures and rainfall affect the growth of crops, thus affecting the final yield. Environmental factors that affect crop growth include latitude and soil. Chen et al. (2008) [9] accumulated data about crop damage, the economic growth rate, pesticide sales, the rate of change in agricultural production, the index of agricultural production and the gross national product to determine the effect of these variables on the amount of fresh fruits and vegetables in the market output as a factor of economic variables. Other investigations have observed that the usage of fertilizer and the mechanization of production are factors that affect crop yield.

Some agricultural prediction algorithms utilize neural networks. Zhang et al. (2010) [10] accumulated meteorological and crop growth data, and employed these to compare the performance of artificial neural networks, the k-nearest neighbors algorithm (kNN) and regression methods to predict soybean growth and flowering stages in the schedule model. Their results show that artificial neural networks predicted the soybean growth and flowering stages more accurately than the two other models. Tsai et al. (2004) [11] constructed a production forecast model based on meteorological factors and growth traits factors, and analyzed it using the back-propagation network and other methods. Their analytical results demonstrated that the BPN forecasting performed better than others. Ma et al. employed regression analysis, the genetic algorithm, the back propagation neural network, and regression analysis combined with genetic algorithms to predict sales of pineapple, grapes and wax apples. According to their experimental data, the BPN best predicted wax apple sales, while regression analysis combined with genetic algorithms was most accurate for predicting pineapple and grape sales. 2.3. Stepwise Regression

Regression analysis by one or more independent variables is performed on the degree of correlation of a dependent variable to understand the influence of each independent variable. The methods of regression analysis are entering, forward, backward and stepwise regression. Stepwise regression analysis combines the forward and backward regression return characteristics, beginning with the selected independent and dependent variables with the largest number of relationships. The dependent variables are successively removed from the regression equation, then added back to determine whether they should be included in the equation. Thus, forward and backward regression is utilized to obtain the best regression model [12,13].

2.4. Back-Propagation Neural Network

An artificial neural network (ANN) simulates messaging between neurons in a biological neural network. It comprises a plurality of neurons, as depicted in Figure1. Figure2illustrates the network structure, also called the network topology [14–16].

(4)

Sustainability 2016, 8, 735 4 of 11

Sustainability 2016, 8, 735 4 of 11

Figure 1. Neural network neuron model.

Figure 2. Neural network architecture diagram.

The traditional BPN which is a supervised learning network can be used for classification and prediction. In the learning stage, the BPN can update the weights among neurons in accordance with the error rate between the predicted output and the actual output in each iteration, and the error rate can be minimized after several iterations. The steps of the BPN method are described in the following [14–16].

(1) Setting the parameters (e.g., neural network structure, learning rate, etc.) of the BPN. (2) Setting the weights (e.g., Wi,j in Figure 1) among neurons in the BPN.

(3) Setting the input neurons (e.g., Xi in Figure 1) and the output neurons (e.g., Yj in Figure 1). (4) Calculating the output value of each neuron in the hidden layer in accordance with inputs and

the output value of the neuron (e.g., Yj in Figure 1) in the output layer. (5) Evaluating the error rate between the predicted output and actual output.

(6) Evaluating the error rate among the value of the output neuron, the output value of each neuron in the hidden layer, and the value of the input neurons.

(7) Updating the weights of neurons in accordance with error rates. (8) Repeating Steps (4)–(7) until convergence.

While the BPN can analyze data and optimize the weights of the neural network, a local optimal solution may be performed by the BPN. Therefore, this study proposes an ENN to combine multiple BPNs with several compositions of data.

3. Materials and Methods

This investigation designs an accuracy analysis mechanism for agriculture data using the ENN method. The designed mechanism is employed for agricultural applications. Figure 3 shows the architecture of this mechanism.

3.1. Data Collection Mechanism

This is the underlying data analysis layer. It accumulates meteorological factors (e.g., relative humidity, precipitation, and air temperature), environmental factors (e.g., planting area, harvested area, harvest and harvest per unit volume), and economic factors (e.g., the cost of production and

Figure 1.Neural network neuron model.

Sustainability 2016, 8, 735 4 of 11

Figure 1. Neural network neuron model.

Figure 2. Neural network architecture diagram.

The traditional BPN which is a supervised learning network can be used for classification and prediction. In the learning stage, the BPN can update the weights among neurons in accordance with the error rate between the predicted output and the actual output in each iteration, and the error rate can be minimized after several iterations. The steps of the BPN method are described in the following [14–16].

(1) Setting the parameters (e.g., neural network structure, learning rate, etc.) of the BPN. (2) Setting the weights (e.g., Wi,j in Figure 1) among neurons in the BPN.

(3) Setting the input neurons (e.g., Xi in Figure 1) and the output neurons (e.g., Yj in Figure 1). (4) Calculating the output value of each neuron in the hidden layer in accordance with inputs and

the output value of the neuron (e.g., Yj in Figure 1) in the output layer. (5) Evaluating the error rate between the predicted output and actual output.

(6) Evaluating the error rate among the value of the output neuron, the output value of each neuron in the hidden layer, and the value of the input neurons.

(7) Updating the weights of neurons in accordance with error rates. (8) Repeating Steps (4)–(7) until convergence.

While the BPN can analyze data and optimize the weights of the neural network, a local optimal solution may be performed by the BPN. Therefore, this study proposes an ENN to combine multiple BPNs with several compositions of data.

3. Materials and Methods

This investigation designs an accuracy analysis mechanism for agriculture data using the ENN method. The designed mechanism is employed for agricultural applications. Figure 3 shows the architecture of this mechanism.

3.1. Data Collection Mechanism

This is the underlying data analysis layer. It accumulates meteorological factors (e.g., relative humidity, precipitation, and air temperature), environmental factors (e.g., planting area, harvested area, harvest and harvest per unit volume), and economic factors (e.g., the cost of production and

Figure 2.Neural network architecture diagram.

The traditional BPN which is a supervised learning network can be used for classification and prediction. In the learning stage, the BPN can update the weights among neurons in accordance with the error rate between the predicted output and the actual output in each iteration, and the error rate can be minimized after several iterations. The steps of the BPN method are described in the following [14–16].

(1) Setting the parameters (e.g., neural network structure, learning rate, etc.) of the BPN. (2) Setting the weights (e.g., Wi,jin Figure1) among neurons in the BPN.

(3) Setting the input neurons (e.g., Xiin Figure1) and the output neurons (e.g., Yjin Figure1). (4) Calculating the output value of each neuron in the hidden layer in accordance with inputs and

the output value of the neuron (e.g., Yjin Figure1) in the output layer. (5) Evaluating the error rate between the predicted output and actual output.

(6) Evaluating the error rate among the value of the output neuron, the output value of each neuron in the hidden layer, and the value of the input neurons.

(7) Updating the weights of neurons in accordance with error rates. (8) Repeating Steps (4)–(7) until convergence.

While the BPN can analyze data and optimize the weights of the neural network, a local optimal solution may be performed by the BPN. Therefore, this study proposes an ENN to combine multiple BPNs with several compositions of data.

3. Materials and Methods

This investigation designs an accuracy analysis mechanism for agriculture data using the ENN method. The designed mechanism is employed for agricultural applications. Figure3shows the architecture of this mechanism.

(5)

Sustainability 2016, 8, 735 5 of 11

the market trading price) which are shown in Table 2 from many different open data sources. Figure 4 illustrates the data preprocessing stage, which involves data integration, data cleaning and data transformation. Each step is presented in the following paragraph.

Table 2. Notations. Parameter Description x1 Relative humidity x2 Precipitation x3 Planting area x4 Air temperature x5 Cost of production

x6 Market trading price

Y Total harvest

Figure 3. Architecture of accuracy analysis mechanism for agricultural data based on the ENN method. Figure 3.Architecture of accuracy analysis mechanism for agricultural data based on the ENN method. 3.1. Data Collection Mechanism

This is the underlying data analysis layer. It accumulates meteorological factors (e.g., relative humidity, precipitation, and air temperature), environmental factors (e.g., planting area, harvested area, harvest and harvest per unit volume), and economic factors (e.g., the cost of production and the market trading price) which are shown in Table2from many different open data sources. Figure4illustrates the data preprocessing stage, which involves data integration, data cleaning and data transformation. Each step is presented in the following paragraph.

Sustainability 2016, 8, 735 6 of 11

Figure 4. The process of data preprocessing. (1) Data integration

The data from different databases (e.g., the Agriculture and Food Agency of Council of Agriculture in Taiwan) are collected and stored into a database.

(2) Data cleaning

Due to the wide range of sources of information, information may be incomplete, non-conformant or noisy. Therefore, the data are cleaned to ensure the integrity and accuracy of the information

(3) Data transformation

For data normalization, data transformation is performed to normalize the data by using Equations (1)–(3). For instance, the average of the relative humidity during the j-th month can be defined as a1,j, and the mean and standard deviation of the relative humidity in the historical

dataset can be calculated by Equations (1) and (2), respectively. Then the normalized average of the relative humidity during the j-th month can be expressed as x1,j by Equation (3).

𝑎̅ =𝑖 ∑𝑛𝑗=1𝑎𝑖,𝑗 𝑛 (1) 𝜎𝑎𝑖 = √ ∑ (𝑎𝑖,𝑗− 𝑎̅ )𝑖 2 𝑛 𝑗=1 𝑛 − 1 (2) 𝑥𝑖,𝑗 = 𝑎𝑖,𝑗− 𝑎̅𝑖 𝜎𝑎𝑖 (3)

3.2. Stepwise Multiple Regression Mechanism

Selecting the input variables of the neural network is a very important issue. Irrelevant input variables may lead to high network error, and indirectly reduce the network model reliability. To discover the relationship between meteorological factors and yields, this work derives a dependent variable from the monthly average temperature, relative humidity, sunshine and precipitation as independent variables.

3.3. Ensemble Neural Network Analysis Mechanism

The ENN method is based on BPNs. The ENN mechanism randomly generates a plurality of neural networks, each with a different architecture. For instance, the numbers of hidden layers and hidden layer neurons are generated randomly. Figure 5 illustrates the main process, which is divided into three stages, namely learning, recall and forecast.

(6)

Sustainability 2016, 8, 735 6 of 11 Table 2.Notations. Parameter Description x1 Relative humidity x2 Precipitation x3 Planting area x4 Air temperature x5 Cost of production

x6 Market trading price

Y Total harvest

(1) Data integration

The data from different databases (e.g., the Agriculture and Food Agency of Council of Agriculture in Taiwan) are collected and stored into a database.

(2) Data cleaning

Due to the wide range of sources of information, information may be incomplete, non-conformant or noisy. Therefore, the data are cleaned to ensure the integrity and accuracy of the information

(3) Data transformation

For data normalization, data transformation is performed to normalize the data by using Equations (1)–(3). For instance, the average of the relative humidity during the j-th month can be defined as a1,j, and the mean and standard deviation of the relative humidity in the historical dataset can be calculated by Equations (1) and (2), respectively. Then the normalized average of the relative humidity during the j-th month can be expressed as x1,jby Equation (3).

ai “ řn j“1ai,j n (1) σai “ d řn j“1`ai,j´ai˘2 n ´ 1 (2) xi,j“ ai,j´ai σai (3)

3.2. Stepwise Multiple Regression Mechanism

Selecting the input variables of the neural network is a very important issue. Irrelevant input variables may lead to high network error, and indirectly reduce the network model reliability. To discover the relationship between meteorological factors and yields, this work derives a dependent variable from the monthly average temperature, relative humidity, sunshine and precipitation as independent variables.

3.3. Ensemble Neural Network Analysis Mechanism

The ENN method is based on BPNs. The ENN mechanism randomly generates a plurality of neural networks, each with a different architecture. For instance, the numbers of hidden layers and hidden layer neurons are generated randomly. Figure5illustrates the main process, which is divided into three stages, namely learning, recall and forecast.

(7)

Sustainability 2016, 8, 735 7 of 11

Figure 5. Prediction of random neural networks flowchart. 3.3.1. Learning Stage

This algorithm generates M neural networks, each with different numbers of hidden layers and neurons in each hidden layer. In the learning stage, the learning data set is input into the networks. Input parameters, including meteorological data entry, contain the previous stage of the regional yield important parameters, environmental factors and economic factors. A neural network is a supervised learning network. In the learning stage, the input layer of the target maps to a known state in the output layer. Table 3 depicts the group summary. Hence, the main objective of this investigation is to construct a neuron coupling model between neurons for the learning stage, by constantly modifying the weights of neurons, in order to establish a correspondence between the input and output data in the study sample through learning.

Table 3. ENN learning stage group summary.

Parameter Status

Neurons of input layer Known (learning data set)

Weight Unknown (learned through constant learning and revision)

Neurons of output layer Known

3.3.2. Recall Stage

Each network model constructs its network architecture model based on the preceding learning stage. The testing data set are entered for each network model which is then reconstructed based on the best correspondence. Table 4 presents the group summary. The actual output value is then obtained. This is then compared with the target output value to obtain the accuracy for each network model. This accuracy is reused as the weight in the prediction stage. Furthermore, a threshold is considered and adopted for heuristic design. Any model that does not reach the accuracy threshold is eliminated.

Figure 5.Prediction of random neural networks flowchart. 3.3.1. Learning Stage

This algorithm generates M neural networks, each with different numbers of hidden layers and neurons in each hidden layer. In the learning stage, the learning data set is input into the networks. Input parameters, including meteorological data entry, contain the previous stage of the regional yield important parameters, environmental factors and economic factors. A neural network is a supervised learning network. In the learning stage, the input layer of the target maps to a known state in the output layer. Table3depicts the group summary. Hence, the main objective of this investigation is to construct a neuron coupling model between neurons for the learning stage, by constantly modifying the weights of neurons, in order to establish a correspondence between the input and output data in the study sample through learning.

Table 3.ENN learning stage group summary.

Parameter Status

Neurons of input layer Known (learning data set)

Weight Unknown (learned through constant learning and revision)

Neurons of output layer Known

3.3.2. Recall Stage

Each network model constructs its network architecture model based on the preceding learning stage. The testing data set are entered for each network model which is then reconstructed based on the best correspondence. Table4presents the group summary. The actual output value is then obtained. This is then compared with the target output value to obtain the accuracy for each network model.

(8)

Sustainability 2016, 8, 735 8 of 11

This accuracy is reused as the weight in the prediction stage. Furthermore, a threshold is considered and adopted for heuristic design. Any model that does not reach the accuracy threshold is eliminated.

Table 4.Summary of recall stage group of ENN.

Parameter Status

Neurons of input layer Known (testing data set) Weight Known (learned through learning stage) Neurons of output layer Unknown (to verify the accuracy of the model output) 3.3.3. Prediction Stage

Any new data to be analyzed are entered into the remaining network models. Each network model determines the output based on the learning results and predictions. These network models which more accurately predict the overall results have a greater impact on the overall result.

4. Analyses of Experimental Results

This section presents the experimental environments and performs traditional BPNs and ENN to predict agricultural production.

4.1. Experimental Environments

All tomato data, meteorology data, environment data and economic data were accumulated. The total data set had 9953 records from the Agriculture and Food Agency of the Council of Agriculture in Taiwan from 1997 to 2014. The meteorological factors included the average air temperature, relative humidity, and precipitation; the environmental factors included the planting area, harvested area, harvest and harvest per unit volume; and the economic factors included the cost of production and the market trading price. In this study, the input parameters include the average air temperature, relative humidity, precipitation, planting area, cost of production, and market trading price; the output is harvest. The tools used in the experimental environments are listed in Table5.

Table 5.Tools in experimental environments.

Item Tool

Operating System Windows 7 64-bit Processor Intel(R) Core(TM) i5 1.6GHz

Random Access Memory 4 GB

Integrated Development Environment Eclipse Programming Language Java SE 1.8 JDK

Statistics Tool IBM SPSS Statistics 22.0 4.2. Experimental Results and Discussions

This study randomly generated five neural network models. Each network model generated up to five random hidden layers, and up to five neurons. Each network model used 60% of the available data for the learning data set, and the remaining 40% for the testing data set. The accuracy threshold was set as 90%, and the learning rate of each neural network was set as 0.1. That is, any model with accuracy below 90% was eliminated. Five tests were run in the learning stage. Table6shows the network model and network infrastructure for each test run.

In the first experiment, the accuracy rates of network models 1–5 were 90.81%, 86.70%, 88.10%, 89.87%, 93.30%, respectively. Only network models 1 and 5 had accuracy above 90%. In back-propagation neural technology research and analysis, a network model is only adopted if the accuracy rate of the network model has reached a threshold value. In this case, the model is used for later analysis to verify its prediction accuracy. The experimental conditions and parameters are fixed

(9)

in this stage. A datum is randomly selected from the data cluster. The traditional BPN model is then run to predict the results of multiple regression analysis and comparison.

Table 6.Experimental results.

Model The Number of Hidden Layers The Number of Hidden Neurons Correct Rate

First times Model 1 5 {1,3,1,2,1} 90.81% Model 2 5 {2,4,5,5,1} 86.70% Model 3 5 {5,1,1,4,5} 88.10% Model 4 1 {3} 89.87% Model 5 3 {1,3,2} 93.30% Second times Model 1 5 {2,5,3,1,5} 86.03% Model 2 5 {4,1,3,3,4} 90.74% Model 3 2 {2,5} 82.99% Model 4 1 {2} 83.5% Model 5 1 {3} 90.01% Third times Model 1 4 {5,1,5,5} 90.7% Model 2 4 {5,4,4,2} 93.81% Model 3 4 {2,5,4,5} 87.27% Model 4 1 {3} 94.75% Model 5 1 {5} 94.29% Fourth times Model 1 5 {5,3,3,2,5} 85.93% Model 2 1 {1} 87.63% Model 3 3 {3,2,3} 90.06% Model 4 2 {2,3} 86.95% Model 5 4 {5,1,4,3} 94.46% Five times Model 1 2 {4,2} 88.16% Model 2 3 {4,3,4} 91.02% Model 3 5 {5,3,1,3,5} 90.53% Model 4 3 {2,5,2} 89.85% Model 5 1 {5} 88.67%

4.2.1. Regression Analysis of Experimental Results

The regression equation based on regression analysis is defined as Equation (4). Each parameter in Table2was adopted into the regression model to predict harvests. This study used the root mean squared error (RMSE) to evaluate the error rate of the prediction method. The error rate of this method is about 12.4% which is higher than the error rates of traditional BPNs and ENNs.

Y “ ´0.873x1´0.388x2´0.969x3`1.547x4`0.074x5´0.047x6´0.491 (4) 4.2.2. Experimental Results of Traditional Back-Propagation Neural Network Analysis

The same consideration threshold of 90% of the model was compared to the first experiments. The actual production forecast was obtained by Model 1 which includes the neural network structure {1,3,1,2,1}. The output of Model 1 was 179,582 kg, and the actual yield was 191,500 kg. The result from Model 1 was thus 11,918 kg, or 6.64%, less than the actual production. The production forecast with Model 5 (i.e., neural network structure {1,3,2}) was 202,587 kg, which is 11,087 kg greater than the actual yield, giving a network model error of 5.47%.

4.2.3. Ensemble Neural Network Analysis of Experimental Results

The merit of this method is that it also considers the accuracy of the threshold through the network model. In the first experiment, ENNs were run to obtain the output value of Models 1 and 5. The weighted average yield was found to be 191,240 kg. The error rate of the ENN in Experiment 1 was 1.30% which is smaller than the error rate of traditional BPNs. The error rate was under 2% in Experiments 1, 3 and 4. The error rates of the models in Experiments 2 and 5 were higher, so

(10)

Sustainability 2016, 8, 735 10 of 11

Experiments 2 and 5 had high error rates. However, considering the weighted average significantly reduced the error rate. Figure6depicts the error rate for each experiment, and Figure7shows the error rate comparisons of BPNs and the ENN.

Sustainability 2016, 8, 735 10 of 11

reduced the error rate. Figure 6 depicts the error rate for each experiment, and Figure 7 shows the error rate comparisons of BPNs and the ENN.

Figure 6. ENN results compared to experimental results of each experiment.

Figure 7. The error rates from BPNs and the ENN. 5. Conclusions and Future Work

With the advancement of information technology in various fields and the daily growth rate in data, neural networks are being widely adopted in industry, business, science and finance. However, the optimal number of hidden layers and neurons is mostly determined by experience or a formula. Considering a variety of analytical models is not possible. This study utilized stepwise regression analysis and ENN for the design guidelines to use in agriculture forecast analysis. The ENN method randomly creates a plurality of networks for analysis and forecasting and analyzes the results of all network models in order to improve the accuracy of the analysis. Experimental results reveal that the ENNs have the lowest error rate and highest accuracy, followed by traditional BPNs and multiple regression analysis.

Acknowledgments: The research is supported by the Ministry of Science and Technology of Taiwan under

grant No. MOST 105-2221-E-020-024. This study is also conducted under the “Online and Offline Integrated Smart Commerce Platform (3/4)” of the Institute for Information Industry, which is subsidized by the Ministry of Economy Affairs of Taiwan.

Figure 6.ENN results compared to experimental results of each experiment.

Sustainability 2016, 8, 735 10 of 11

reduced the error rate. Figure 6 depicts the error rate for each experiment, and Figure 7 shows the error rate comparisons of BPNs and the ENN.

Figure 6. ENN results compared to experimental results of each experiment.

Figure 7. The error rates from BPNs and the ENN. 5. Conclusions and Future Work

With the advancement of information technology in various fields and the daily growth rate in data, neural networks are being widely adopted in industry, business, science and finance. However, the optimal number of hidden layers and neurons is mostly determined by experience or a formula. Considering a variety of analytical models is not possible. This study utilized stepwise regression analysis and ENN for the design guidelines to use in agriculture forecast analysis. The ENN method randomly creates a plurality of networks for analysis and forecasting and analyzes the results of all network models in order to improve the accuracy of the analysis. Experimental results reveal that the ENNs have the lowest error rate and highest accuracy, followed by traditional BPNs and multiple regression analysis.

Acknowledgments: The research is supported by the Ministry of Science and Technology of Taiwan under

grant No. MOST 105-2221-E-020-024. This study is also conducted under the “Online and Offline Integrated Smart Commerce Platform (3/4)” of the Institute for Information Industry, which is subsidized by the Ministry of Economy Affairs of Taiwan.

Figure 7.The error rates from BPNs and the ENN. 5. Conclusions and Future Work

With the advancement of information technology in various fields and the daily growth rate in data, neural networks are being widely adopted in industry, business, science and finance. However, the optimal number of hidden layers and neurons is mostly determined by experience or a formula. Considering a variety of analytical models is not possible. This study utilized stepwise regression analysis and ENN for the design guidelines to use in agriculture forecast analysis. The ENN method randomly creates a plurality of networks for analysis and forecasting and analyzes the results of all network models in order to improve the accuracy of the analysis. Experimental results reveal that the ENNs have the lowest error rate and highest accuracy, followed by traditional BPNs and multiple regression analysis.

Acknowledgments:The research is supported by the Ministry of Science and Technology of Taiwan under grant No. MOST 105-2221-E-020-024. This study is also conducted under the “Online and Offline Integrated Smart Commerce Platform (3/4)” of the Institute for Information Industry, which is subsidized by the Ministry of Economy Affairs of Taiwan.

(11)

Author Contributions:Hsu-Yang Kung and Chi-Hua Chen conceived and designed the experiments; Pei-Yu Tsai and Ting-Huan Kuo performed the experiments; Hsu-Yang Kung and Pei-Yu Tsai analyzed the data; Pei-Yu Tsai, Chi-Hua Chen and Ting-Huan Kuo contributed analysis tools; Pei-Yu Tsai and Chi-Hua Chen wrote the paper.

Conflicts of Interest:The authors declare no conflict of interest. References

1. Fahad, A.; Alshatri, N.; Tari, Z.; Alamri, A.; Khalil, I.; Zomaya, A.Y.; Foufou, S.; Bouras, A. A survey of clustering algorithms for big data: Taxonomy and empirical analysis. IEEE Trans. Emerg. Top. Comput. 2014, 2, 267–279. [CrossRef]

2. Huang, G.B.; Zhou, H.; Ding, X.; Zhang, R. Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. B Cybern. 2012, 42, 513–529. [CrossRef] [PubMed]

3. Bai, Z.; Huang, G.B.; Wang, D.; Wang, H.; Westover, M.B. Sparse extreme learning machine for classification. IEEE Trans. Cybern. 2014, 44, 1858–1870. [PubMed]

4. Lo, C.C.; Chen, C.H.; Cheng, D.Y.; Kung, H.Y. Ubiquitous healthcare service system with context-awareness capability: Design and implementation. Expert Syst. Appl. 2011, 38, 4416–4436. [CrossRef]

5. Lu, F.; Sugano, Y.; Okabe, T.; Sato, Y. Adaptive linear regression for appearance-based gaze estimation. IEEE Trans. Pattern Anal. Mach. Intell. 2014, 36, 2033–2046. [CrossRef] [PubMed]

6. Chen, C.H.; Lin, J.H.; Kuan, T.S.; Lo, K.R. A high-efficiency method of mobile positioning based on commercial vehicle operation data. ISPRS Int. J. Geo-Inf. 2016, 5. [CrossRef]

7. Chen, C.H.; Yang, Y.T.; Chang, C.S.; Hsieh, C.M.; Kuan, T.S.; Lo, K.R. The design and implementation of a garbage truck fleet management system. S. Afr. J. Ind. Eng. 2016, 27, 32–46. [CrossRef]

8. Zhou, Z.H.; Chawla, N.V.; Jin, Y.; Williams, G.J. Big data opportunities and challenges: Discussions from data analytics perspectives. IEEE Comput. Intell. Mag. 2014, 9, 62–74. [CrossRef]

9. Chen, J.C.; Wang, K.J.; Wu, C.J.; Lai, Y.L.; Chen, C.W. Application of artificial neural network to Taiwan’s agriculture forecasting. J. Adv. Eng. 2008, 3, 241–249.

10. Zhang, L.; Zhang, J.; Kyei-Boahen, S.; Zhang, M. Simulation and prediction of soybean growth and development under field conditions. Am. Eurasian J. Agric. Environ. Sci. 2010, 7, 374–385.

11. Tsai, C.Y.; Shiue, Y.C. Predicting the productions of napier-grass based on back-propagation neural network. Acad. J. Kang-Ning 2004, 6, 97–107.

12. Ghunem, R.A.; Assaleh, K.; El-hag, A.H. Artificial neural networks with stepwise regression for predicting transformer oil furan content. IEEE Trans. Dielectr. Electr. Insul. 2012, 19, 414–420. [CrossRef]

13. Zhou, N.; Pierre, J.W.; Trudnowski, D. A stepwise regression method for estimating dominant electromechanical modes. IEEE Trans. Power Syst. 2012, 27, 1051–1059. [CrossRef]

14. Heermann, P.D.; Khazenie, N. Classification of multispectral remote sensing data using a back-propagation neural network. IEEE Trans. Geosci. Remote Sens. 1992, 30, 81–88. [CrossRef]

15. Yuan, J.; Yu, S. Privacy preserving back-propagation neural network learning made practical with cloud computing. IEEE Trans. Parallel Distrib. Syst. 2013, 25, 212–221. [CrossRef]

16. Lin, H.F.; Chen, C.H. Design and application of augmented reality query-answering system in mobile phone information navigation. Expert Syst. Appl. 2015, 42, 810–820. [CrossRef]

© 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

數據

Figure 1. Neural network neuron model.
Table 2. Notations.  Parameter  Description  x 1 Relative humidity  x 2 Precipitation  x 3 Planting area  x 4 Air temperature  x 5 Cost of production
Figure 5. Prediction of random neural networks flowchart.  3.3.1. Learning Stage
Table 5. Tools in experimental environments.
+3

參考文獻

相關文件

Principle Component Analysis Denoising Auto Encoder Deep Neural Network... Deep Learning Optimization

Random Forest: Theory and Practice Neural Network Motivation.. Neural Network Hypothesis Neural Network Training Deep

Moreover, this chapter also presents the basic of the Taguchi method, artificial neural network, genetic algorithm, particle swarm optimization, soft computing and

Finally, discriminate analysis and back-propagation neural network (BPN) are applied to compare business financial crisis detecting prediction models and the accuracies.. In

Envelopment Analysis,” International Institute for Applied Systems Analysis(IIASA), Interim Report, IR-97-079/October. Lye , “Clustering in a Data Envelopment Analysis

Comprehensive above, this research elected Penghu County for regional range, and tie over actual data, using location quotient method (location quotient,LQ) analysis Penghu

For better efficiency of parallel and distributed computing, Apache Hadoop distributes the imported data randomly on data nodes.. This mechanism provides some advantages

Kuo, R.J., Chen, C.H., Hwang, Y.C., 2001, “An intelligent stock trading decision support system through integration of genetic algorithm based fuzzy neural network and