Optimizing Crop Yield Prediction Using C4.5 Classification Algorithm for Sustainable Agriculture
  • Author(s): Sani Umar; J. S. Mshelia; Solomon Makasda Dickson; Aisha Muhammad Hussein; Ahmad Jidda M
  • Paper ID: 1713074
  • Page: 1910-1915
  • Published Date: 25-12-2025
  • Published In: Iconic Research And Engineering Journals
  • Publisher: IRE Journals
  • e-ISSN: 2456-8880
  • Volume/Issue: Volume 9 Issue 6 December-2025
Abstract

Accurate crop yield prediction is a critical component of sustainable agricultural planning, particularly in regions affected by climatic variability and limited resources. This study evaluates the effectiveness of the C4.5 decision tree classification algorithm for crop yield prediction using agro-climatic and soil-related variables. Empirical data obtained from farms in Mubi North, Adamawa State, Nigeria, comprising soil nutrients (N, P, K), pH, organic matter, rainfall, temperature, humidity, and management practices, were used to classify crop yield into low, medium, and high categories. The performance of the optimized C4.5 model was compared with ensemble learning approaches, namely Random Forest and Gradient Boosting classifiers. Model enhancement techniques, including pruning, cost-sensitive learning, and probability calibration, were applied to improve generalization and reliability. Experimental results show that the pruned C4.5 model achieved superior performance, recording 1.00 accuracy, macro-F1 score, and Cohen’s Kappa on the test dataset, outperforming the ensemble models under the given experimental conditions. Feature importance and SHAP-based explainability analysis identified rainfall and soil nutrient levels as the most influential predictors of yield variability. The findings demonstrate that interpretable decision tree models remain highly effective for crop yield prediction in data-constrained agricultural environments and provide actionable insights for sustainable farm management.

Keywords

C4.5, Classification Algorithm, Decision tree, Random forest, Gradient boosting

Citations

IRE Journals:
Sani Umar, J. S. Mshelia, Solomon Makasda Dickson, Aisha Muhammad Hussein, Ahmad Jidda M "Optimizing Crop Yield Prediction Using C4.5 Classification Algorithm for Sustainable Agriculture" Iconic Research And Engineering Journals Volume 9 Issue 6 2025 Page 1910-1915 https://doi.org/10.64388/IREV9I6-1713074

IEEE:
Sani Umar, J. S. Mshelia, Solomon Makasda Dickson, Aisha Muhammad Hussein, Ahmad Jidda M "Optimizing Crop Yield Prediction Using C4.5 Classification Algorithm for Sustainable Agriculture" Iconic Research And Engineering Journals, 9(6) https://doi.org/10.64388/IREV9I6-1713074