<b>Human Colon Cancer Drug Release Nanoparticle Systems Design: </b><b>Dataset Compilation and Machine Learning Modeling</b>
- Munteanu, Cristian R.
- He, Shan
- de Bilbao, Begoña
- Bediaga, Harbil
- Casanola-Martin, Gerardo M.
- Ascencio, Estefania
- Chelu, Mariana
- Magdalena Musuc, Adina
- Arrasate, Sonia
- Pazos Sierra, Alejandro
- Rasulev, Bakhtiyor
- González-Díaz, Humberto
Resum
Nanoparticles (NPs) are interesting for Human Colon Cancer (HCC) therapy. In these systems either the Drug (D) and/or the NP <i>per se</i>, by exert biological activity. However, the high number of combinations of HCC cell lines, anti-cancer drugs, and/or NP to be tested slows down the assay process of new HCCNP systems. Nevertheless, the data scarcity and high complexity of these systems make difficult AI/ML studies. Here, a new dataset of HCCNP systems was compiled from literature. Herein 11 different Artificial Intelligence/Machine Learning (AI/ML) algorithms were used to seek the predictive models. The LDA and Random Forest (RF) models showed high values of sensitivity and specificity (> 0.9) in training/validation series and 3-fold cross validation respectively. The new AI/ML models are able to predict 14 output properties (CC<sub>50 </sub>(µM), EC<sub>50 </sub>(µM), Inhibition (%), <i>etc</i>.) for all combinations of 54 different NP cores classes <i>vs</i>. 15 different coats and <i>vs</i>. 41 different cell lines allowing to short list the more interesting results for experimental assays. It may reduce the cost of the traditional trial and error procedures.