Application of SMILES strings to identification of functional groups responsible for biological activity in medicinal compounds
Abstract
An efficient and practical approach to identification of important functional groups in the structure of medicinal molecules that are main factor to create biological activity by use of SMILES line notation system is described. Simplicity, high proficiency and fast timing are the main of current method. In this study we aim to find an association between some of the identified functional groups, using SMILES code and their corresponding biological properties in the Canada Drug database. In this study, each functional group and its category which has been tested is presented in the corresponding number of occurrences in the category and the total number is shown as well. The p-value for each functional group – category is calculated using proportion test and R statistical software. The tabular results, the last column indicates the impact of our hypothesis for example, sulfonylurea and 5-thio-1H-tetrazole functional groups are associated with their corresponding category and are significant at 0.05 level. Penicillin and 3-aminopropane-1,2-diol are also significant in the majority of their categories. we have developed a method to create a logical and robust relationship between functional groups and biological activity of molecules. According to existing protocol, finding functional groups responsible for the biological activity of medicinal or chemical compounds is possible.
Downloads
References
Law, V., Knox, C., Djoumbou, Y., Jewison, T., Guo, A., Liu, Y., Maciejewski, A., Arndt, D., Wilson, M., Neveu, V., Tang, A., Gabriel, G. y Adamjee, C, Dame, Z., Han, B., Zhou, y Wishart, D. (2013. DrugBank 4.0: shedding new light on drug metabolism. Nucleic acids research 2013;42(D1):D1091-D7.
Lawal, B. (2014). Applied statistical methods in agriculture, health and life Sciences: Springer.
Lehmann, E., Romano, J. (2005). Testing Statistical Hypotheses (Springer Texts in Statistics). Lemke, T. (2003). Review of organic functional groups: introduction to medicinal organic chemistry: Lippincott Williams & Wilkins.
Team RC. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2013. ISBN 3-900051-07-0; 2014.
Weininger D. SMILES (1988). A chemical language and information system. 1. Introduction to methodology and encoding rules. Journal of chemical information and computer sciences; 28(1): 31-6.
Wishar, D., Knox, C., Guo, A., Shrivastava, S., Hassanali, M., Stothard, P., Chang, Z., Woolsey, J. (2006). DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Research; 34(suppl_1): D668-D72.