Application of fuzzy DEA and machine learning algorithms in efficiency estimation of paddy producers of rural Eastern India

Abstract
Data envelopment analysis (DEA) has wide applications in the agricultural sector to evaluate the efficiency with crisp input and output data. However, in agricultural production, impreciseness and uncertainty in data are common. As a result, the data obtained from farmers vary. This impreciseness in crisp data can be represented in fuzzy sets. This paper aims to employ a combination of fuzzy data envelopment analysis (FDEA) approach to yield crisp DEA efficiency values by converting the fuzzy DEA model into a linear programming problem and machine learning algorithms for better evaluation and prediction of the variables affecting the farm efficiency. DEA applications are focused on the use of a common two-step approach to find crucial factors that affect efficiency. It is important to identify impactful variables for minimizing production adversities. In this study, first, FDEA was applied for efficiency estimation and ranking of the paddy growers. Second, the support vector machine (SVM) and random forest (RF) were used for identifying the key leading factors in efficiency prediction. The proposed research was conducted with 450 paddy growers. In comparison to the general DEA approach, the FDEA model evaluates fuzzy DEA efficiency giving the user the flexibility to measure the performance at different possibility levels. The use of machine learning applications introduces advanced strategies and important factors influencing agricultural production, which may help future research in farms' performance.