eprintid: 72441 rev_number: 10 eprint_status: archive userid: 12460 dir: disk0/00/07/24/41 datestamp: 2025-08-21 03:51:11 lastmod: 2025-08-21 03:51:11 status_changed: 2025-08-21 03:51:11 type: thesis metadata_visibility: show contact_email: muh.khabib@uin-suka.ac.id creators_name: Ibnu Raju Humam, NIM.: 21106050047 title: PENGARUH KARAKTERISTIK DATA DAN OPTIMALISASI MODEL TERHADAP KINERJA ALGORITMA KLASIFIKASI ENSEMBLE ispublished: pub subjects: 004. divisions: Informatika(S1) full_text_status: restricted keywords: Klasifikasi, Ensemble Learning, Boosting, Bagging, LASSO, Hyperparameter Tuning note: Dr. Shofwatul Uyun, S.T., M.Kom. abstract: Data classification is a fundamental task in machine learning for recognizing patterns. However, the diversity of data types, such as numerical, categorical, and mixed, poses a challenge in selecting the optimal model. Tree-based algorithms, such as Decision Trees, are frequently used, including ensemble techniques like Random Forest and boosting methods like AdaBoost, Gradient Boosting, LightGBM, and XGBoost. This study aims to evaluate the impact of data type on the performance of these classification algorithms. Additionally, the study assesses the effectiveness of feature selection using LASSO and hyperparameter tuning optimization. The research methodology involves comparing models across three scenarios: (1) a baseline model using all features, (2) a model with LASSO feature selection, and (3) a model with LASSO optimized through hyperparameter tuning. The results show that ensemble boosting algorithms (Gradient Boosting, LightGBM, XGBoost) consistently perform best on numerical and mixed datasets. On the other hand, the effectiveness of optimization through LASSO and tuning showed varying results. However, it has the potential to improve both the F1-Score and computational efficiency, as there is often a trade-off between the two. Evaluation of purely categorical data faces limitations due to the difficulty in finding suitable datasets. date: 2025-07-21 date_type: published pages: 94 institution: UIN SUNAN KALIJAGA YOGYAKARTA department: FAKULTAS SAINS DAN TEKNOLOGI thesis_type: skripsi thesis_name: other citation: Ibnu Raju Humam, NIM.: 21106050047 (2025) PENGARUH KARAKTERISTIK DATA DAN OPTIMALISASI MODEL TERHADAP KINERJA ALGORITMA KLASIFIKASI ENSEMBLE. Skripsi thesis, UIN SUNAN KALIJAGA YOGYAKARTA. document_url: https://digilib.uin-suka.ac.id/id/eprint/72441/1/21106050047_BAB-I_IV-atau-V_DAFTAR-PUSTAKA.pdf document_url: https://digilib.uin-suka.ac.id/id/eprint/72441/2/21106050047_BAB-II_sampai_SEBELUM-BAB-TERAKHIR.pdf