Information Gain and Random Forest for Sex Classification Based on Craniometric Measurements

Nabilla Alya Firana, Iis Afrianty, Novriyanto Novriyanto, Febi Yanto

Sari


Sex identification from human skulls is a crucial aspect of forensic anthropology; however, traditional methods still face limitations such as subjective assessment and inter-population variation. This study proposes the application of Information Gain as a feature selection technique and Random Forest as a classification algorithm for sex determination based on craniometric data. The dataset used is the Howells dataset consisting of 2,524 samples with 83 skull measurement features. Feature selection using Information Gain was performed with threshold values of 0.01, 0.05, and 0.09, followed by additional testing across a threshold range of 0.01 to 0.09. Model evaluation was conducted using 10-Fold Cross Validation with default Random Forest parameters. The results show that a threshold of 0.02 produced 57 selected features from the original 83, achieving the best performance with an accuracy of 87.40%, precision of 87.53%, recall of 87.40%, and F1-score of 87.41%. These results outperform the baseline model without feature selection, which achieved an accuracy of 86.57%. This study demonstrates that Information Gain feature selection can reduce data dimensionality by 31.3% while simultaneously improving sex classification performance based on craniometric data.

Teks Lengkap:

PDF

Referensi


G. Mason and H. Yusuf, “Physics and Geology as a crime solving science,” Criminology, pp. 2162–2175, 2024, [Online]. Available: https://jicnusantara.com/index.php/jicn

F. R. Amalia et al., “Identification of Forensic Odontology in Investigation Case of Human Skeletal Findings: Case Report,” vol. 06, no. 04, pp. 996–1006, 2025, doi: 10.37899/journallamedihealtico.v6i4.

M. Zhang, “The Application of Forensic Imaging to Sex Estimation: Focus on Skull and Pelvic Structures,” Perspect. Leg. Forensic Sci., vol. 1, no. 1, pp. 10005–10005, 2024, doi: 10.35534/plfs.2024.10005.

I. Afrianty, D. Nasien, and H. Haron, “Performance Analysis of Support Vector Machine in Sex Classification of The Sacrum Bone in Forensic Anthropology,” J. Tek. Inform., vol. 15, no. 1, pp. 63–72, Jun. 2022, doi: 10.15408/jti.v15i1.25254.

N. S. Appel and H. J. H. Edgar, “A Pilot Study of Age Estimation and Cause of Death: Insights into Skeletal Aging,” Forensic Sci., vol. 4, no. 4, pp. 508–522, Oct. 2024, doi: 10.3390/forensicsci4040034.

L. Sugara, F. Yanti, and Hanif, “Jurnal Kesehatan Dan Ilmu Kedokteran ( JUKIK ),” J. Kesehat. dan ilmu Kedokt., vol. 06, no. 3, pp. 8–24, 2024.

T. Suryadi, M. J. Ramadhanif, R. P. Sari, A. Wulandari, and F. Kamila, “Identifikasi Pada Jenazah Yang Ditemukan Di Pinggir Pantai,” Indones. J. Leg. Forensic Sci., vol. 11, no. 2, p. 112, 2021, doi: 10.24843/ijlfs.2021.v11.i02.p06.

Arthy, R. Goel, and M. Sreenivas, “Determination of sex by osteometry of third metatarsal,” Indian J. Forensic Med. Toxicol., vol. 14, no. 3, pp. 1–6, 2020, doi: 10.37506/ijfmt.v14i3.10315.

F. Curate, “The Estimation of Sex of Human Skeletal Remains in the Portuguese Identified Collections: History and Prospects,” Forensic Sci., vol. 2, no. 1, pp. 272–286, Mar. 2022, doi: 10.3390/forensicsci2010021.

M. J. S. Mota et al., “Enhancing sex determination in forensic anthropology: A comparative analysis of cranial measurements using artificial neural network,” Forensic Sci. Int. Reports, 2025, doi: 10.1016/j.fsir.2025.100422.

X. Wang, G. Liu, Q. Wu, Y. Zheng, F. Song, and Y. Li, “Sex estimation techniques based on skulls in forensic anthropology: A scoping review,” PLoS One, vol. 19, no. 12, pp. 1–22, 2024, doi: 10.1371/journal.pone.0311762.

D. Nasien, M. H. Adiya, I. Afrianty, N. A. Ali, A. A. Samah, and Y. Rahayu, “Determination of Sex and Race in Forensic Anthropology: A Comparison of Artificial Neural Network and Support Vector Machine,” in 2021 4th International Conference of Computer and Informatics Engineering (IC2IE), IEEE, Sep. 2021, pp. 51–55. doi: 10.1109/IC2IE53219.2021.9649182.

P. Mesejo, R. Martos, Ó. Ibáñez, and J. Novo, “applied sciences A Survey on Artificial Intelligence Techniques for Biomedical Image Analysis in Skeleton-Based Forensic Human Identification,” 2020, doi: 10.3390/app10144703.

S. S. Rahayu, I. Afrianty, E. Budianita, and F. Syafria, “Klasifikasi Tulang Tengkorak Berdasarkan Jenis Kelamin dalam Antropologi Forensik Menggunakan Metode Support Vector Machine,” INOVTEK Polbeng - Seri Inform., vol. 9, no. 1, pp. 243–256, 2024, doi: 10.35314/isi.v9i1.4046.

E. Nikita and P. Nikitas, “On the use of machine learning algorithms in forensic anthropology,” Leg. Med., vol. 47, p. 101771, Nov. 2020, doi: 10.1016/j.legalmed.2020.101771.

E. Faisal and T. L. Rogers, “A review of the literature on the applications of machine learning in forensic anthropology,” Forensic Sci. Int., vol. 376, p. 112579, Nov. 2025, doi: 10.1016/j.forsciint.2025.112579.

W. Yang, M. Zhou, P. Zhang, G. Geng, X. Liu, and H. Zhang, “Skull Sex Estimation Based on Wavelet Transform and Fourier Transform,” Biomed Res. Int., vol. 2020, no. 1, Jan. 2020, doi: 10.1155/2020/8608209.

S. Knecht, F. Santos, Y. Ardagna, V. Alunni, P. Adalian, and L. Nogueira, “Sex estimation from long bones: a machine learning approach,” Int. J. Legal Med., vol. 137, no. 6, pp. 1887–1895, Nov. 2023, doi: 10.1007/s00414-023-03072-4.

M. Stephanie Cole, “Developing Subadult Sex Estimation Standards Using Adult Morphological Sex Traits and an Ontogenetic Approach,” 2022.

P. S. Prabha, A. Ganesan, K. C. Lakshmi, and A. J. Murugan, “Sex determination through analysis of mandibular indices using lateral cephalogram: An Artificial intelligence diagnostics,” Discov. Artif. Intell., vol. 5, no. 1, 2025, doi: 10.1007/s44163-025-00371-0.

I. K. Hasan, R. Resmawan, and J. Ibrahim, “Perbandingan K-Nearest Neighbor dan Random Forest dengan Seleksi Fitur Information Gain untuk Klasifikasi Lama Studi Mahasiswa,” Indones. J. Appl. Stat., vol. 5, no. 1, p. 58, May 2022, doi: 10.13057/ijas.v5i1.58056.

N. Tsawaabul Khair, I. Afrianty, F. Syafria, E. Budianita, and S. Kurnia Gusti, “Penerapan Information Gain Untuk Seleksi Fitur Pada Klasifikasi Jenis Kelamin Tulang Tengkorak Menggunakan Backpropagation,” Media Online), vol. 5, no. 4, pp. 666–678, 2025, doi: 10.47065/bulletincsr.v5i4.637.

S. N. Salsabila, B. N. Sari, and R. Mayasari, “Klasifikasi Ulasan Pengguna Aplikasi Discord Menggunakan Metode Information Gain Dan Naïve Bayes Classifier,” INFOTECH J., vol. 9, no. 2, pp. 383–392, Jul. 2023, doi: 10.31949/infotech.v9i2.6277.

A. Demos, “Penerapan Metode Random Forest Dan Information Gain Untuk Prediksi Kanker Payudara,” 2023, [Online]. Available: https://repository.ithb.ac.id/id/eprint/275/

M. A. N. Anargya, W. Ghozi, and F. A. Rafrastara, “Optimizing IoV Attack Detection Using Random Under Sampling Techniques,” J. Inform. J. Pengemb. IT, vol. 10, pp. 11–19, 2025, doi: 10.30591/jpit.v10i1.8034.

V. W. Lumumba, D. Kiprotich, M. Lemasulani Mpaine, N. Grace Makena, and M. Daniel Kavita, “Comparative Analysis of Cross-Validation Techniques: LOOCV, K-folds Cross-Validation, and Repeated K-folds Cross-Validation in Machine learning Models,” SSRN Electron. J., vol. 13, no. 5, pp. 127–137, 2025, doi: 10.2139/ssrn.5266507.

A. Sunyoto and H. Al Fatta, “Klasifikasi Penyakit Jantung Menggunakan Random Forest Classifier,” J. Sist. Komput. dan Kecerdasan Buatan, vol. VII, no. September, pp. 31–40, 2023.

M. R. Adriana, M. P. Putra, M. H. Rafialdy, and N. A. Rakhmawati, “Perbandingan Metode Klasifikasi Random Forest Dan Svm Pada Analisis Sentimen Psbb,” J. Inform. UPGRIS, vol. 7, no. 1, pp. 36–40, 2021, [Online]. Available: https://doi.org/10.26877/jiu.v7i1.7099




DOI: http://dx.doi.org/10.30811/jaise.v6i2.9409

Refbacks

  • Saat ini tidak ada refbacks.


Indexing :

Creative Commons License
Journal of Artificial Intelligence and Software Engineering (JAISE) licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.