Abstract
The quality of red wine is crucial for both consumers and producers, influencing purchasing decisions and product improvements. This study aims to enhance red wine quality prediction models through effective feature selection and model optimization. By employing feature engineering to construct and assess feature contributions, the study identifies the best feature combinations and utilizes a five-dimensional evaluation framework of accuracy, precision, recall, F1 score, and Area Under the Receiver Operating Characteristic Curve (AUC-ROC) to screen various models. The research integrates new feature combinations with the optimal model and compares performance before and after feature selection through cross-validation, focusing on stability and generalization. The findings reveal that the Random Forest model, when combined with feature selection, outperforms models using original features in terms of generalization and stability. Key features such as alcohol content and free Sulphur dioxide significantly enhance prediction accuracy. However, new feature construction does not always improve model performance and may introduce noise. These results not only offer practical insights for production and quality control but also underscore the importance of careful feature selection in model prediction, contributing valuable academic knowledge to the field.
Original language | English |
---|---|
Title of host publication | Proceedings of the 3rd International Conference on Financial Technology and Business Analysis |
Editors | Ursula Faura-Martínez |
Publisher | EWA Publishing |
Pages | 1-8 |
Number of pages | 8 |
ISBN (Electronic) | 9781835588284 |
ISBN (Print) | 9781835588277 |
DOIs | |
Publication status | Published - Jan 2025 |
Event | 3rd International Conference on Financial Technology and Business Analysis - Murcia, Spain Duration: 4 Dec 2024 → 4 Dec 2024 https://www.icftba.org/3nd.html https://www.ewadirect.com/proceedings/aemps/volume/view/614 |
Publication series
Name | Advances in Economics Management and Political Sciences |
---|---|
Publisher | EWA Publishing |
Volume | 139 |
ISSN (Print) | 2754-1169 |
Conference
Conference | 3rd International Conference on Financial Technology and Business Analysis |
---|---|
Country/Territory | Spain |
City | Murcia |
Period | 4/12/24 → 4/12/24 |
Internet address |
User-Defined Keywords
- Red Wine Quality
- Feature Selection
- Random Forest
- Model Optimization