Estimation of Leaf Area in Paprika Based on Leaf Length,  Leaf Width,  and Node Number Using Regression Models and an Artificial Neural Network

Joonwoo   Lee; Taewon   Moon; Du Sung  Nam; Kyoung Sub  Park; Jung Eek  Son

doi:10.12972/kjhst.20180019

Preview

Horticultural Science and Technology. 30 April 2018. 183-192
https://doi.org/10.12972/kjhst.20180019

Estimation of Leaf Area in Paprika Based on Leaf Length, Leaf Width, and Node Number Using Regression Models and an Artificial Neural Network

Joonwoo Lee¹

Taewon Moon¹

Du Sung Nam¹

Kyoung Sub Park²

Jung Eek Son¹^*

¹Department of Plant Science and Research Institute of Agriculture and Life Sciences, Seoul National University

²Protected Horticulture Research Institute, National Institute of Horticultural and Herbal Science

^{*교신저자.}^{*Corresponding Author.}

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

ABSTRACT

Leaf area directly affects growth responses and plays an important role in estimating individual leaf growth. Most studies on the subject have used non-destructive estimations of leaf area based on regression analysis of leaf length and width, with the assumption that the leaf shape is constant. For paprika, however, leaf shapes differ depending on the nodes where leaves are attached. The objective of this study was to estimate leaf area using not only the leaf length and width but also the node number. Paprika leaves were collected ten months after transplanting, and the leaf length, width, area, and shape ratio (= leaf length/width), as well as node number, were measured. Leaf length and width measurements led to the development of regression equations; among them, equations with strong correlations were chosen and used in validation. The measured leaf length and width and node number were used to train a selected artificial neural network (ANN, Google Tensorflow). A regression equation using only leaf area and width estimated leaf areas with high accuracy, while the accuracy significantly decreased when the equation was separately applied to the upper and lower leaves. This result was likely due to the shape characteristics of the leaves; newly-formed leaves were thin and long, whereas those of developed leaves were broad and thick. Therefore, the length/width ratios of the upper and lower leaves were different. The regressions including the node number in the model resulted in higher R2 values with higher estimation accuracy than the previous regression equations for a variety of leaf positions. The ANN estimated areas of leaves located in a variety of positions with higher accuracy using a simpler process than both regression equations. In conclusion, not only the leaf length and width but also the node number are important to estimate leaf area in paprika, and ANN is an effective tool to analyze growth characteristics using various indicators.

Keywords

canopy structure

leaf length-width ratio

leaf shape

machine learning

non-destructive method

MAIN

Introduction
Material and Methods
Growth Conditions
Measurement of Leaf Areas
Selection of Regression Models
Structure and Training of Artificial Neural Network (ANN)
Verification of Regression Models and ANN
Results and Discussion
Characteristics of Paprika Leaves
Selection of Regression Models
Estimation and Verification of Regression Model
Training and Verification of ANN
Discussion

Introduction

Paprika (Capsicum annuum L.), a high-value horticultural crop, requires a significant level of environmental control for successful cultivation. Under favorable greenhouse conditions, paprika grows rapidly to form a dense community with high stem elongation; therefore, a simple and accurate model to estimate growth conditions is needed (Cho et al., 2007). Leaf area is an important trait in plant growth and yield analysis, and affects transpiration, photosynthesis, and response to irrigation and fertilization. Thus, estimation of individual leaf growth in greenhouse studies is crucial (Launay and Guerif, 2003; Rosenthal and Vanderlip, 2004; Blanco and Folegatti, 2005). Destructive methods are commonly used to measure leaf area but are time-consuming, have high cost, and may affect light interception and growth patterns of neighbor crops. Therefore, a simple method is needed for estimating leaf area through indices that can be measured through non-destructive methods (Peksen, 2007). Among several indices, the leaf length and width are used to estimate the leaf area of various crops (Gamiely et al., 1991; Montero et al., 2000; Cho et al., 2007; Tai et al., 2009; Jung et al., 2016). These studies were conducted based on the assumption that all leaves from a single crop are constant in shape, so that leaf area can be estimated using a regression analysis of the leaf length and width. However, even genetically identical crops have varying leaf morphologies (Dickinson et al., 1987, Weight et al., 2008; Xia et al., 2013). Paprika, which has a number of growth stages, is known to have non-uniform leaf forms during varying leaf ages (Xia et al., 2013). According to de Swart et al. (2004), the estimation accuracy of paprika leaf areas via a modified model that accounted for leaf age was higher than that via a regression model using only the leaf length and width. Therefore, accounting for leaf age is crucial in the development of models that accurately estimate individual leaf areas. The age of leaves, however, needs to be continuously identified starting at early growth stage. Thus, estimating leaf area with measurements conducted in a specific aperiodic stage is difficult, and an indirect leaf age indicator is required. The objective of this study was to develop an accurate estimation model for paprika leaf areas by adding a node number variable to the model developed using leaf length and width. In addition to the existing regression models, an artificial neural network model was applied for more accurate and precise results.

Materials and Methods

Growth Conditions

The paprika (Capsicum annuum L. ‘Cupra’) plants to be measured were cultivated in a Venlo greenhouse at the Protected Horticulture Research Institute (RDA) based in Haman, Gyeongsangnam-do (35.249° N, 128.425° E), Korea. Paprika plants grown over ten months of transplanting were used in the experiment because all of their leaves could be sampled regardless of size. Nutrient solutions were maintained at an electrical conductivity (EC) of 2.5-3.0 dS·m^-1and a pH of 5.8-6.0. Air temperatures in the greenhouse were maintained at 25/18°C (day/night) with ventilation and heating.

Measurement of Leaf Areas

A total of 442 leaf samples were randomly selected from six individual paprikas. The widely-used traits leaf length and width were measured, and the number of nodes where the leaves were attached was recorded from the standard of the first node. Leaf length was measured from the end of the petiole to the opposite end of the leaf, and the leaf was measured after it was flattened. Leaf width was measured in the direction perpendicular to the leaf area, and the longest leaf width was measured in the middle part of the leaf. The leaf shape ratio (length/width) was calculated for each leaf, and leaf area was measured using a leaf area meter (Li-3100, LI-COR, Lincoln, NE, USA).

Selection of Regression Models

To estimate leaf areas, 18 types of regression equations were selected as standard models (Cho et al., 2007; Table 1). The regression model was used to estimate leaf area by expressing the leaf area (L) and width (W) and the number of nodes with leaves (N) as linear variables. Values with high correlations to leaf area, such as leaf length and width, were combined with multiplication. The number of nodes with leaves, however, was not strongly correlated with leaf area, so the number could not be combined with multiplication. Standard models (Eqs. 1-9) expressed with linear or multiplication equations of leaf area and width have been used widely. The modified model (Eqs. 10-18), which added the number of nodes with leaves corresponding to 1-9 as a variable, was also used.

Table 1. Regression models for estimating leaf area (LA). L, W, and N indicate leaf length, width, and node number, respectively

http://static.apub.kr/journalsite/sites/kshs/2018-036-02/N0130360204/images/Table_HST_36_02_04_T1.jpg

Regression analysis was performed using the SPSS statistical software (IBM, New York, NY, USA) for 18 models randomly selected from 337 samples of a combined 442 leaves. Sigmaplot (Systat Software, San Jose, CA, USA) was used to plot the graphs. The model with the highest R² value was selected for each regression analysis, and that with the lowest root mean square error (RMSE) was selected when the R² values were the same.

Structure and Training of Artificial Neural Network (ANN)

A machine learning method was also applied for estimating leaf areas using the node numbers in addition to the leaf area and width. The machine learning tool used in the present study was Tensorflow (v. 1.2.1, Google, Menlo Park, CA, USA), a numerical computation library based on the programming language Python. An ANN consists of input, hidden, and output layers, with each layer consisting of perceptrons. The perceptrons of the input layer correspond to leaf area and width and node number, and those of the output layer correspond to leaf area. In the hidden layer, the value received from the input layer was multiplied by the weight and the input information passed through the activation function. In the present study, each of the five hidden layers consisted of 512 perceptrons (Fig. 1). The rectified linear unit (ReLU) function was used as the active function, and learning proceeded to lower the mean square error (MSE).

http://static.apub.kr/journalsite/sites/kshs/2018-036-02/N0130360204/images/Figure_HST_36_02_04_F1.jpg

Fig. 1. Structure of artificial neural network including input (I), hidden (H₁- H₅), and output layers (O).

Verification of Regression Models and ANN

The verification procedure was conducted on the most reliable model derived from regression analysis and on the training results of the ANN. The regression models and the ANN were verified using 105 leaf samples excluded in regression modeling and ANN training. For comparison, regression analysis for 1: 1 line was conducted with SPSS program, and model reliability was verified by checking the R² value.

Results and Discussion

Characteristics of Paprika Leaves

The relationship between measured leaf length and width and estimated leaf area of 442 leaves was plotted (Fig. 2). As the leaf length and width increased, the leaf area grew linearly from 9.26 to 268.92 cm². In the relationship between node number and leaf area, an increase in the number of nodes with leaves correlated with an increase in leaf area (Fig. 3A). This result is due to physiological characteristics of Solanaceae crops, in which new nodes are developing at the top of the previous node, and to the characteristics of the defoliation of cultivated paprika, in which only one or two leaves are left in each node (Lohr and Sudkamp, 1989). The leaf shape ratio of paprika plants also changed depending on growth stage (Fig. 3B). The newly-emerged leaves had a longer length with a higher leaf ratio, but the ratio decreased as leaf growth progressed. It is known that leaves constituting the paprika are not uniform in leaf shape because they occurred at different times and have been grown under different environmental conditions (de Swart et al. 2013). Nevertheless, leaf ratio of the paprika leaves was correlated with node number (Fig. 3B). Therefore, the present study confirmed that the node where the leaves are present is an appropriate indicator of paprika leaf ages and shapes.

http://static.apub.kr/journalsite/sites/kshs/2018-036-02/N0130360204/images/Figure_HST_36_02_04_F2.jpg

Fig. 2. Relationships between leaf length, width, and area in paprika (n = 442).

http://static.apub.kr/journalsite/sites/kshs/2018-036-02/N0130360204/images/Figure_HST_36_02_04_F4.jpg

Fig. 3. Relationships between leaf area and node number (A) and between leaf shape ratio (= leaf length/leaf width) and node number in paprika (B) (n = 442).

Selection of Regression Models

Table 2 shows the regression results for the leaf areas of 18 paprika plants according to leaf length and width and node number. Among the regression results of Eqs. 1-9, which were expressed as leaf length and width, Eq. 5 consisted of the multiplication terms of the leaf length and width and showed the highest accuracy, with an R² and RMSE of 0.92 and 16.301, respectively. This model was similar to that selected in previous studies on paprika (Tai et al., 2009). The regression analysis of the modified model equations (Eqs. 10-18) improved accuracy thanks to the addition of node number as a new variable. Eqs. 10 and 12, modified from Eqs. 1 and 3, respectively, had relatively higher accuracy, while Eqs. 13 and 18, modified from Eqs. 4 and 9, showed no increase in R² but a lower RMSE. In general, the modified models that included the node number had higher accuracy and lower RSMEs than the standard models.

Table 2. Coefficients of regression models for estimating leaf areas (LA) in paprika (n = 337). L, W, and N indicate leaf length, width, and node number, respectively
All variables in the models above are significant at p = 0.05.

Estimation and Verification of Regression Model

Equations 5 and 14 were selected as the most reliable models for estimating leaf area in the standard and modified models, respectively. Verification was performed between the estimated and measured leaf areas for 105 leaves, which were separately classified without using the regression process (Fig. 4A). As a result, Eqs. 5 and 6 showed the highest accuracy, each with an R² of 0.95.

Leaf size was classified into three groups: big (over 150 cm²), middle (150-50 cm²), and small (under 50 cm²; Fig. 4B-4D). Accuracy differed when Eqs. 5 and 14 were applied to the three groups. When the standard model Eq. 5 was applied to each group, the areas of middle and small size leaves showed stable prediction accuracy (Fig. 4B and 4C). The highest accuracy was seen in the middle size because most paprika leaves were of middle size, and this consequently affected the regression analysis. Accuracy was poor, however, among the big size leaves (R²= 0.48), indicating the inapplicability of Eq. 5 to leaf area prediction for large leaves. On the other hand, the modified model Eq. 14 showed a higher R² than Eq. 5 for all leaf sizes. Even among the big leaf sizes accuracy was relatively higher (R² = 0.83), because the node number was crucial in reflecting the change of leaf shape over growth stages.

Fig. 4. Comparison of measured and estimated leaf areas in paprika using Eqs. 5 and 14 [A, whole verification data (n = 105); B, small size group (n = 20); C, middle size group (n = 66); D, big size group (n = 19)]. The equations are listed in Table 2.

The mean difference in the total leaf area of each plant estimated by Eq. 5 was larger than those estimated by Eq. 14 (Fig. 5). The accuracy of the standard model was low for big size leaves compared to other methods. The p-values for Eqs. 5 and 14 in the Fisher least-significant difference post-test were 0.427 and 0.904, respectively (Table 3). Therefore, it would be more adequate to apply the modified model using node numbers. Under regression analysis using only leaf length and width, non-uniformity was shown in the leaf areas of each leaf size group. The use of the same number of leaves by growth stage in regression analysis has been reported to solve this problem (Jung et al., 2016). It is difficult, however, to estimate the leaf area of an individual paprika plant because old leaves of over 150 cm² occupy a high portion of the paprika plant, while their number is lower than that of new or growing leaves. The leaf area of each individual paprika plant in a specific growth period was considered especially useful in the prediction of irrigation control (Ferrara et al., 2011; Patanè, 2011; Shin et al., 2011), estimation of photosynthesis (Díaz-Pérez, 2013), and distribution management of assimilation products (Marcelis et al., 2004; González-Real et al., 2009). Therefore, use of the modified model including node numbers to non-destructively estimate leaf area in paprika is appropriate, in addition to utilizing leaf length and width.

http://static.apub.kr/journalsite/sites/kshs/2018-036-02/N0130360204/images/Figure_HST_36_02_04_F5.jpg

Fig. 5. Mean difference in the total leaf area of each plant measured and estimated by Eq. 5, Eq. 14, and the artificial neural network (ANN) for six paprika plants. Vertical bars represent the mean ± SD (n = 6).

Table 3. One-way ANOVA on the difference in leaf area between measured and estimated by Eq. 5, Eq. 14, and the artificial neural network (ANN)
All variables in the models above are significant at p = 0.05.

Training and Verification of ANN

The training frequency of the ANN was 70,000 times, assuming that the number of training times was one when all data were used, and training accuracy had an R² of 0.92. Using the trained ANN with leaf length and width and node number, verification performed between the estimated and measured leaf areas of 105 leaves showed an R² of 0.97, which was higher than those of Eqs. 5 and 14 (Fig. 6A). For all groups classified by leaf size, the ANN also showed higher accuracy than regression analysis (Fig. 6B-6D). The p-value was 0.917, showing a higher accuracy of individual leaf area estimation than regression models (Table 3).

http://static.apub.kr/journalsite/sites/kshs/2018-036-02/N0130360204/images/Figure_HST_36_02_04_F6.jpg

Fig. 6. Comparison of measured and estimated leaf areas in paprika using the artificial neural network [A, whole verification data (n = 105); B, small size group (n = 20); C, middle size group (n = 66); D, big size group (n = 19)].

ANN is a suitable algorithm to use big data that are difficult to analyze manually, and its accuracy becomes higher with the use of more data (Trejo-Perea et al., 2009). Because this study only used 1,768 points (leaf length, leaf width, node number, and leaf area of 442 leaves), the ANN did not show a dramatic improvement in estimation accuracy. However, the application of ANN to leaf area estimation has advantages over the regression analysis method, in addition to higher accuracy. In this case, adding only the node number increased the model accuracy, without the use of additional equipment or labor for measuring leaf areas. However, many variables such as SPAD, leaf length and shape ratio, and environmental factors have been suggested for improving the accuracy of leaf area estimation (Lu et al., 2004; Serdar and Demirsoy, 2006; Cho et al., 2007; Antunes et al., 2008; Jung et al., 2016). Use of ANNs is suitable for finding optimal combinations through fast and simple methods with high accuracy, while deriving the relationship between factors and results in complicated systems via regression analysis is difficult (Vaidyanathan et al., 2015; Taormina and Chau, 2016; Wang et al., 2016). Therefore, ANN is anticipated to be a more accurate and faster method of nondestructive estimation of crop growth indicators such as leaf area. Since leaf area is an important growth index related to crop growth and yield, the nondestructive prediction of leaf area with simple indicators is important.

The present study proposed the node number of leaves as a measurable variable that requires no additional equipment or labor and can improve the accuracy of models for leaf area estimation. Furthermore, use of ANN allowed estimation of leaf areas with higher precision than the regression analysis method. Therefore, the results of the present study may help devise an effective method to estimate leaf area in paprika.

References

Aminifard MH, Aroiee H, Ameri A, Fatemi, H (2012) Effect of plant density and nitrogen fertilizer on growth, yield and fruit quality of sweet pepper (Capsicum annuum L.). Afr J Agric Res 7:859-866

Antunes WC, Pompelli MF, Carretero DM, DaMatta FM (2008) Allometric models for non-destructive leaf area estimation in coffee (Coffea arabica and Coffea canephora). Ann Appl Biol 153:33-40. doi.org/10.1111/j.1744-7348.2008.00235.x

Blanco FF, Folegatti MV (2005) Estimation of leaf area for greenhouse cucumber by linear measurements under salinity and grafting. Sci Agric 62:305-309. doi.org/10.1590/S0103-90162005000400001

Cho YY, Oh SB, Oh MM, Son JE (2007) Estimation of individual leaf area, fresh weight, and dry weight of hydroponically grown cucumbers (Cucumis sativus L.) using leaf length, width, and SPAD value. Sci Hortic 111:330-334. doi.org/10.1016/j.scienta.2006. 12.028

de Swart EAM, Groenwold R, Kanne HJ, Stam P, Marcelis LFM, Voorrips RE (2004) Non-destructive estimation of leaf area for different plant ages and accessions of Capsicum annuum L. J Hortic Sci Biotechnol 79:764-770. doi.org/10.1080/14620316.2004.11511840

Díaz-Pérez JC (2013) Bell pepper (Capsicum annuum L.) crop as affected by shade level: Microenvironment, plant growth, leaf gas exchange, and leaf mineral nutrient concentration. HortScience 48:175-182

Díaz-Pérez JC (2014) Bell Pepper (Capsicum annuum L.) crop as affected by shade level: Fruit yield, quality, and postharvest attributes, and incidence of Phytophthora blight (caused by Phytophthora capsici Leon.). HortScience 49:891-900

Dickinson TA, Parker WH, Strauss RE (1987) Another approach to leaf shape comparisons. Taxon 36:1-20. doi.org/10.2307/1221345

Ferrara A, Lovelli S, Di Tommaso T., Perniola M (2011) Flowering, growth and fruit setting in greenhouse bell pepper under water stress. J Agron 10:12-19. doi.org/10.3923/ja.2011.12.19

Gamiely S, Randle WM, Mills HA, Smittle DA (1991) Greenhouse energy consumption prediction using neural networks models. Int J Agric Biol 11:1-6

González-Real MM, Liu HQ, Baille A (2009) Influence of fruit sink strength on the distribution of leaf photosynthetic traits in fruit-bearing shoots of pepper plants (Capsicum annuum L.). Environ Exp Bot 66:195-202. doi.org/10.1016/j.envexpbot.2009.01.005

Jung DH, Cho YY, Lee JG, Son JE (2016) Estimation of leaf area, leaf fresh weight, and leaf dry weight of irwin mango grown in greenhouse using leaf length, leaf width, petiole length, and SPAD value. Protected Hortic Plant Fac 25:146-152. doi.org/10.12791/ KSBEC.2016.25.3.146

Jung DH, Kim D, Yoon HI, Moon TW, Park KS, Son JE (2016) Modeling the canopy photosynthetic rate of romaine lettuce (Lactuca sativa L.) grown in a plant factory at varying CO₂ concentrations and growth stages. Hortic Environ Biotechnol 57:487-492

Launay M, Guérif M (2003) Ability for a model to predict crop production variability at the regional scale: an evaluation for sugar beet. Agronomie 23:135-146. doi.org/10.1051/agro:2002078

Lohr VI, Sudkamp AB (1989) Pruning responses of tissue-cultured plantlets of Rhododendrons. J Environ Hortic 7:23-25

Lu HY, Lu CT, Wei ML, Chan LF (2004) Comparison of different models for nondestructive leaf area estimation in taro. Agron J 96:448-453. doi.org/10.2134/agronj2004.4480

Marcelis LFM, Heuvelink E, Baan Hofman-Eijer LR, Den Bakker J, Xue LB (2004) Flower and fruit abortion in sweet pepper in relation to source and sink strength. J Exp Bot 55:2261-2268. doi.org/10.1093/jxb/erh245

Montero F, de Juan JA, Cuesta A, Brasa A (2000) Nondestructive methods to estimate leaf area in Vitis vinifera L. HortScience 35:696-698

Park KS, Bekhzod K, Kwon JK, Son JE (2016) Development of a coupled photosynthetic model of sweet basil hydroponically grown in plant factories. Hortic Environ Biotechnol 57:20-26

Patanè C (2011) Leaf area index, leaf transpiration and stomatal conductance as affected by soil water deficit and VPD in processing tomato in semi-arid Mediterranean climate. J Agron Crop Sci 197:165-176. doi.org/10.1111/j.1439-037X.2010.00454.x

Peksen E (2007) Non-destructive leaf area estimation model for faba bean (Vicia faba L.). Sci Hortic 113:322–328. doi.org/10.1016/ j.scienta.2007.04.003

Rosenthal WD, Vanderlip RL (2004) Simulation of individual leaf areas in grain sorghum. Agronomie 24:493-501. doi.org/10.1051/ agro:2004046

Serdar U, Demirsoy H (2006) Non-destructive leaf area estimation in chestnut. Sci Hortic 108:227-230. doi.org/10.1016/j.scienta. 2006.01.025

Sharma VK, Semwal CS, Uniyal SP (2010) Genetic variability and character association analysis in bell pepper (Capsicum annuum L.). J Hortic Forest 2:58-65

Shin JH, Ahn TI, Son JE (2011) Modeling of transpiration of paprika (Capsicum annuum L.) plants based on radiation and leaf area index in soilless culture. Hortic Environ Biotechnol 52:265-269. doi.org/10.1007/s13580-011-0216-3

Tai NH, Ahn TI, Park JS, Son JE (2009) Estimation of leaf area, fresh weight, and dry weight of Paprika (Capsicum annuum L.) using leaf length and width in rockwool-based soilless culture. Hortic Environ Biotechnol 50:422-426

Taormina R, Chau KW (2015) Neural network river forecasting with multi-objective fully informed particle swarm optimization. J Hydroinform 17:99-113. doi.org/10.2166/hydro.2014.116

Vaidyanathan S (2015) 3-cells cellular neural network (CNN) attractor and its adaptive biological control. Int J Pharmtech Res 8:632-640

Wang T, Gao H, Qiu J (2016) A combined adaptive neural network and nonlinear model predictive control for multirate networked industrial process control. IEEE Trans Neural Netw Learn Syst 27:416-425. doi.org/10.1109/TNNLS.2015.2411671

Weight C, Parnham D, Waites R (2008) Technical advance: LeafAnalyser: a computational method for rapid and large‐scale analyses of leaf shape variation. Plant J 53: 578-586. doi.org/10.1111/j.1365-313X.2007.03330.x

Xia C, Lee JM, Li Y, Song YH, Chung BK, Chon TS (2013) Plant leaf detection using modified active shape models. Biosyst Eng 116:23-35. doi.org/10.1016/j.biosystemseng.2013.06.003

Horticultural Science and Technology 원예과학기술지 ISSN:1226-8763(Print) 2465-8588(Online)

Preview

Estimation of Leaf Area in Paprika Based on Leaf Length, Leaf Width, and Node Number Using Regression Models and an Artificial Neural Network

ABSTRACT

MAIN

References