Inflammatory bowel diseases can be severe, but with access to metagenomic data, we can diagnose them and take the necessary steps to prevent further complications. The key to identifying the composition in the human body that causes the disease is carefully selecting features from the metagenomic data. Our research has demonstrated that using the Random Forest machine learning technique to rank the relative abundance of features for disease prediction tasks is reliable. We have also discovered that selecting features ranging from 1 to 50 improves the accuracy of diagnosis. In addition, we have performed an intersection on the Top 10, 20, 30, 40, and 50 features to determine which ones appear in all datasets. Our experiments on six inflammatory bowel disease-related datasets have yielded better results than previous studies.
Tạp chí khoa học Trường Đại học Cần Thơ
Lầu 4, Nhà Điều Hành, Khu II, đường 3/2, P. Xuân Khánh, Q. Ninh Kiều, TP. Cần Thơ
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn
Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên