BERT-Promoter: An improved sequence-based predictor of DNA promoter using BERT pre-trained model and SHAP feature selection

Hướng dẫn

Tìm kiếm nâng cao

Tựa bài viết

Tìm

Tác giả

Năm xuất bản

Tóm tắt

Lĩnh vực

Phân loại

Số tạp chí

Bản tin định kỳ

Báo cáo thường niên

Tạp chí khoa học ĐHCT

Tạp chí tiếng anh ĐHCT

Tạp chí trong nước

Tạp chí quốc tế

Kỷ yếu HN trong nước

Kỷ yếu HN quốc tế

Book chapter

BERT-Promoter: An improved sequence-based predictor of DNA promoter using BERT pre-trained model and SHAP feature selection

Số tạp chí 99:107732(2022) Trang:

Tác giả: Lê Nguyễn Quốc Khánh, Hồ Quang Thái, Nguyễn Văn Núi, Jung Su Chang

Tạp chí: Computational Biology and Chemistry

Liên kết: https://doi.org/10.1016/j.compbiolchem.2022.107732

Tóm tắt

A promoter is a sequence of DNA that initializes the process of transcription and regulates whenever and wherever genes are expressed in the organism. Because of its importance in molecular biology, identifying DNA promoters are challenging to provide useful information related to its functions and related diseases. Several computational models have been developed to early predict promoters from high-throughput sequencing over the past decade. Although some useful predictors have been proposed, there remains short-falls in those models and there is an urgent need to enhance the predictive performance to meet the practice requirements. In this study, we proposed a novel architecture that incorporated transformer natural language processing (NLP) and explainable machine learning to address this problem. More specifically, a pre-trained Bidirectional Encoder Representations from Transformers (BERT) model was employed to encode DNA sequences, and SHapley Additive exPlanations (SHAP) analysis served as a feature selection step to look at the top-rank BERT encodings. At the last stage, different machine learning classifiers were implemented to learn the top features and produce the prediction outcomes. This study not only predicted the DNA promoters but also their activities (strong or weak promoters). Overall, several experiments showed an accuracy of 85.5 % and 76.9 % for these two levels, respectively. Our performance showed a superiority to previously published predictors on the same dataset in most measurement metrics. We named our predictor as BERT-Promoter and it is freely available at https://github.com/khanhlee/bert-promoter.

Các bài báo khác

First-principles study of electronic and optical properties of small edge-functionalized penta-graphene quantum dots

Số tạp chí 12(2022) Trang:

Tác giả: Đặng Minh Triết, Phạm Thị Bích Thảo, Trần Thị Ngọc Thảo, Nguyễn Thành Tiên

Tạp chí: AIP Advances

Tóm tắt

Effect of phosphorus doping positions on electronic transport properties in the sawtooth penta-graphene nanoribbon- First-principles insights

Số tạp chí 353(2022) Trang: 114859

Tác giả: Vo Trung Phuc, Nguyễn Thành Tiên, Phạm Thị Bích Thảo, Rajeev Ahuja

Tạp chí: Solid State Communications

Tóm tắt

A comparison study of the structural, electronic and electronic transport properties of nanoribbons based on Penta-graphene, Penta-P2C and Penta-SiC2

Số tạp chí 32(2022) Trang:

Tác giả: Trần Yến Mi, Huỳnh Anh Huy, Nguyễn Thành Tiên

Tạp chí: Materials Today Communications

Tóm tắt

Determining atmospheric electric fields using MGMR3D

Số tạp chí 105(2022) Trang:

Tác giả: Trịnh Thị Ngọc Gia, O. Scholten, S. Buitink, K. D. de Vries, P. Mitra, Nguyễn Thanh Phong, D. T. Si

Tạp chí: Physical Review D

Tóm tắt

Identifying SNARE Proteins Using an Alignment-Free Method Based on Multiscan Convolutional Neural Network and PSSM Profiles

Số tạp chí 62 (19), 4820-4826(2022) Trang: 4820-4826

Tác giả: Kha Quang Hiền, Hồ Quang Thái, Lê Nguyễn Quốc Khánh

Tạp chí: Journal of Chemical Information and Modeling

Tóm tắt

Deep transformers and convolutional neural network in identifying DNA N6-methyladenine sites in cross-species genomes

Số tạp chí 204(2022) Trang: 199-206

Tác giả: Lê Nguyễn Quốc Khánh, Hồ Quang Thái

Tạp chí: Methods

Tóm tắt

A new species of Bulbophyllum from Northern of Vietnam

Số tạp chí 542(2022) Trang: 095-099

Tác giả: Nguyễn Văn Tú, Leonid V. Averyanov, Đặng Văn Sơn, Tatiana Maisak, Bùi Văn Hướng, Đặng Minh Quân, Sung Min Boo, Trương Bá Vương

Tạp chí: Phytotaxa

Tóm tắt

A new species of Lasianthus (Rubiaceae) from Kon Chu Rang Nature Reserve in central highlands of Vietnam

Số tạp chí 541(2022) Trang: 291–296

Tác giả: Nguyễn Đình Hiệp, Đặng Minh Quân, Le Ngan Thi Kim, Quach Van Toan Em, Pham Van Ngot, Le Van Tho, Trương Bá Vương, Akiyo Naiki, Đặng Văn Sơn

Tạp chí: Phytotaxa

Tóm tắt

Two new species of Lasianthus Jack (Rubiaceae) from southern Vietnam

Số tạp chí 806(2022) Trang: 19–31

Tác giả: Đặng Minh Quân, Nguyen Manh Ha, Hoang Ngia Son, Le Van Tho, Nguyen Thi Mai Huong, Ho Nguyen Quynh Chi, Trương Bá Vương, Đặng Văn Sơn

Tạp chí: European Journal of Taxonomy

Tóm tắt

Effects of Substrate Temperature on Nanomechanical Properties of Pulsed Laser Deposited Bi2Te3 Films

Số tạp chí 12(2022) Trang: 871

Tác giả: Hui-Ping Cheng, Lê Thị Cẩm Tuyên, Lê Hữu Phước, Sheng-Rui Jian, Yu-Chen Chung, I-Ju Teng, Chih-Ming Lin, Jenh-Yih Juang

Tạp chí: Coatings

Tóm tắt

Resistance Induction by Salicylic Acid Formulation in Cassava Plant against Fusarium solani

Số tạp chí 38(2022) Trang: 212-219

Tác giả: Chanon Saengchan, Lê Thanh Toàn, Piyaporn Phansak, Kanjana Thumanu, Supatcharee Siriwong, Rungthip Sangpueak, Wannaporn Thepbandit, Narendra Kumar Papathoti, Natthiya Buensanteai

Tạp chí: The Plant Pathology Journal

Tóm tắt

Spatial and temporal variabilities of surface water and sediment pollution at the main tidal-influenced river in Ca Mau Peninsular, Vietnamese Mekong Delta

Số tạp chí 41(2022) Trang: 1-17

Tác giả: Lê Văn Mười, Chotpantarat Srilert, Văn Phạm Đăng Trí, Phạm Văn Toàn

Tạp chí: Journal of Hydrology: Regional Studies

Tóm tắt

Evaluation of water loss and solute uptake during osmotic treatment of white radishes (Raphanus sativus L.) in saltsucrose solution

Số tạp chí x(2022) Trang: 1-7

Tác giả: Nguyễn Minh Thủy, Võ Quang Minh, Nguyen Thi Ngoc Tham, Phạm Thanh Vũ, Ngô Văn Tài

Tạp chí: PLANT SCIENCE TODAY

Tóm tắt

Effect of conventional and ultrasonic-assisted extracts on betacyanin content of red dragon fruit (Hylocereus polyrhizus)

Số tạp chí 6(2022) Trang: 389-395

Tác giả: Nguyễn Minh Thủy, Phạm Thị Bé Ngọc, Ngô Văn Tài

Tạp chí: Food Research

Tóm tắt

Replacing a part of wheat flour with starchy food containing high levels of resistant starch in noodles processing

Số tạp chí 6(2022) Trang: 396-402

Tác giả: Kiều Minh Vương, Nguyễn Minh Thủy, Nguyễn Bích Trâm, Lê Ngọc Tuyền, Lê Thị Tường Vy, Ngô Văn Tài

Tạp chí: Food Research

Tóm tắt

Effect of Foaming Conditions on Foam Properties and Drying Behavior of Powder from Magenta (Peristrophe roxburghiana) Leaves Extracts

Số tạp chí 8(2022) Trang: 546

Tác giả: Nguyễn Minh Thủy, Võ Quốc Tiến, Ngô Văn Tài, Võ Quang Minh

Tạp chí: Horticulturae

Tóm tắt

Effect of different cooking conditions on resistant starch and estimated glycemic index of macaroni

Số tạp chí 10(2022) Trang: 151-157

Tác giả: Nguyễn Minh Thủy, Ngô Văn Tài

Tạp chí: Journal of Applied Biology & Biotechnology

Tóm tắt

Changes in quality properties of anthocyanin, protein and amylose contents in colored rice grains during storage

Số tạp chí 16(2022) Trang: 389-393

Tác giả: Lê Thị Kim Loan, Nguyễn Minh Thủy, Lê Hữu Hải, Nguyen Thi Tho, Tran Dang Khanh

Tạp chí: Australian Journal of Crop Science

Tóm tắt

Employing the lens of andragogy theory to understand Vietnamese tertiary EFL lecturers’ perceived needs for professional development (PD)

Số tạp chí 2022(2022) Trang: 1-17

Tác giả: Ngô Huỳnh Hồng Nga, Sue Cherrington

Tạp chí: Studies in the Education of Adults

Tóm tắt

Resistant starch in various starchy vegetables and the relationship with its physical and chemical characteristics

Số tạp chí 10(2022) Trang: 181-188

Tác giả: Nguyễn Minh Thủy, beverly cheruto Too, Kiều Minh Vương, Phan Thị Trúc Lan, Phan Thị Thanh Tuyền, Nguyễn Bích Trâm, Lê Thị Tường Vy, Lê Ngọc Tuyền, Ngô Văn Tài

Tạp chí: Journal of Applied Biology & Biotechnology

Tóm tắt

Application of butterfly pea flower extract in processing some Vietnamese traditional foods

Số tạp chí 10(2022) Trang: 143-150

Tác giả: Nguyễn Minh Thủy, Trần Chí Bên, Phạm Thị Bé Ngọc, Ngô Văn Tài

Tạp chí: Journal of Applied Biology & Biotechnology

Tóm tắt

Effect of thickness of polyethylene packaging and temperature on quality of solar-dried oyster mushroom (Pleurotus sajor-caju)

Số tạp chí 9(2022) Trang: 722-727

Tác giả: Nguyễn Thị Ngọc Giang, Nguyễn Minh Thủy, Tran Van Khai

Tạp chí: Plant Science Today

Tóm tắt

Chitosan Nanoparticles-Based Ionic Gelation Method: A Promising Candidate for Plant Disease Management

Số tạp chí 14(2022) Trang: 1-25

Tác giả: Nguyễn Huy Hoàng, Lê Thanh Toàn, Rungthip Sangpueak, Jongjit Treekoon, Chanon Saengchan, Wannaporn Thepbandit, Narendra Kumar Papathoti, Anyanee Kamkaew, Natthiya Buensanteai

Tạp chí: Polymers

Tóm tắt

Lactic Acid Fermentation of Radish and Cucumber in Rice Bran Bed

Số tạp chí 87(2022) Trang: 245-252

Tác giả: Nguyễn Minh Thủy, Hồ Thị Ngân Hà, Ngô Văn Tài

Tạp chí: Agriculturae Conspectus Scientificus

Tóm tắt

THE INFLUENCE OF PACKAGING AND STORAGE TEMPERATURE ON THE CHEMICAL COMPOSITION OF FRESH OYSTER MUSHROOMS (PLEUROTUS SAJOR-CAJU) DURING STORAGE

Số tạp chí 21(2022) Trang: 261-269

Tác giả: Nguyễn Thị Ngọc Giang, Nguyễn Minh Thủy, Tran Van Khai

Tạp chí: Acta Sci. Pol. Technol. Aliment.

Tóm tắt

An Integrated Framework of Professional Development for Vietnamese Lecturers of English as a Foreign Language

Số tạp chí 2022(2022) Trang: 1-16

Tác giả: Ngô Huỳnh Hồng Nga, Sue Cherrington, David Crabbe

Tạp chí: RELC Journal

Tóm tắt

Efficacy of Chitosan Nanoparticle Loaded-Salicylic Acid and -Silver on Management of Cassava Leaf Spot Disease

Số tạp chí 14(2022) Trang: 1-23

Tác giả: Nguyễn Huy Hoàng, Lê Thanh Toàn, Wannaporn Thepbandit, Jongjit Treekoon, Chanon Saengchan, Rungthip Sangpueak, Narendra Kumar Papathoti, Anyanee Kamkaew, Natthiya Buensanteai

Tạp chí: Polymers

Tóm tắt

Optimization of ingredient levels of reduced-calorie blackberry jam using response surface methodology

Số tạp chí 10(2022) Trang: 68-75

Tác giả: Nguyễn Minh Thủy, Huỳnh Mạnh Tấn, Ngô Văn Tài

Tạp chí: Journal of Applied Biology & Biotechnology

Tóm tắt

Dietary Effects of Carotenoid on Growth Performance and Pigmentation in Bighead Catfish (Clarias macrocephalus Günther, 1864).

Số tạp chí 7(2022) Trang: 2-16

Tác giả: Trần Thị Thanh Hiền, Trịnh Văn Lộc, Trần Lê Cẩm Tú, Trần Minh Phú, Phạm Minh Đức, Hứa Thái Nhân, Phạm Thanh Liêm

Tạp chí: Fishes

Tóm tắt

Đầu tiên Trước 19 20 21 22 23 24 25 26 27 28 Tiếp Cuối

Vietnamese | English

Tạp chí khoa học Trường Đại học Cần Thơ
Khu II, Đại học Cần Thơ, Đường 3/2, Phường Ninh Kiều, Thành phố Cần Thơ, Việt Nam
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn

Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên

Vui lòng chờ...