Open Access Open Access  Restricted Access Subscription Access

doi:10.3808/jei.202500545
Copyright © 2025 ISEIS. All rights reserved

A Systematic Study of Hyperparameter Tuning for Environmental Text Classification: Implications for Environmental Management

J. J. Kim1, J. Adamowski2, S. Park3, K. Lim3, 4, and H. Jeong1, 5 *

  1. Institute of Environmental Technology, Seoul National University of Science and Technology, Seoul 01811, Republic of Korea
  2. Department of Bioresource Engineering, McGill University, Ste-Anne-de-Bellevue, Quebec H9X 3V9, Canada
  3. Department of Applied Artificial Intelligence, Seoul National University of Science and Technology, Seoul 01811, Republic of Korea
  4. Graduate School of Culture Technology, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
  5. Department of Environmental Engineering, Seoul National University of Science and Technology, Seoul 01811, Republic of Korea

*Corresponding author. Tel.: +82-2-970-6630. E-mail address: hanjeong@seoultech.ac.kr (H. Jeong)

Abstract


Environmental management increasingly relies on rapid and precise information analysis to resolve critical environmental problems. This study evaluated the effectiveness of hyperparameter tuning and its impact on automatic environmental text classification performance using different Machine Learning (ML) classifiers and term-weighting schemes. Our results indicated that hyperparameter tuning generally enhanced classification performance, with the eXtreme Gradient Boosting (XGBoost) classifier showing the highest performance. The study also highlighted the trade-off between performance improvement and computational cost, i.e., enhanced classification accuracy at the expense of increased execution time. Notably, hyperparameter sensitivity varied among ML classifiers. For example, the Multinomial Naive Bayes classifier was less sensitive to hyperparameter tuning under certain term-weighting schemes. These findings provide new insights into the relationships between hyperparameter optimization, classification performance, and computational efficiency in environmental text classification. They offer valuable guidance for selecting and tuning classifiers to support better-informed decisions in environmental management.

Keywords: Korean news articles, machine learning classifier, environmental big data, term-weighting schemes, text mining in environmental informatics


Full Text:

PDF SI

Supplementary Files:

Refbacks

  • There are currently no refbacks.