Dr Y-H Taguchi – In Silico Drug Discovery for COVID-19 Using an Unsupervised Feature Extraction Method

Sep 8, 2021 | Life Sciences & Biology, Medical & Health Sciences

In silico drug discovery is useful for screening and identifying large numbers of drug candidate compounds in a way that is not possible using classical experimental approaches. Dr Y-H Taguchi at Chuo University, Japan, has developed a computational technique known as ‘tensor decomposition-based unsupervised feature extraction’. He has successfully applied this as an in silico phenotype-based drug discovery method to repurpose known drugs for severe acute respiratory syndrome coronavirus 2 and has successfully identified various known anti-viral drugs as viable candidates for the successful treatment of COVID-19.

A Mathematical Framework for In Silico Drug Discovery

Since January 2020, the COVID-19 pandemic has critically affected communities worldwide, prompting scientists to identify new, effective drugs that could tackle the disease. To repurpose old drugs toward the treatment of COVID-19, we must first understand the mechanism by which SARS-CoV-2 successfully invades human cells, causing the onset of disease. Driven by advances in information technology, a new approach, known as in silico experimentation, has generated reports of a large number of candidate drug compounds that may be useful for treating COVID-19. In biomedical research, an in silico experiment is one that is conducted with the aid of computer simulations.

Dr Y-H Taguchi and his team from the Department of Physics, Chuo University, Japan, have developed computational techniques that can support in silico experimentation, allowing researchers to predict the function of proteins, discover potential drug-like molecules and identify disease-causing genetic mutations.

Since disease alters gene expression, it is not surprising that there are specific sets of genes for which altered expression patterns can act as biomarkers to identify the presence of disease and estimate disease progression. Dr Taguchi and his collaborators had previously used a mathematical method known as ‘tensor decomposition (TD)-based unsupervised feature extraction (FE)’ and applied it to a gene expression profile dataset obtained from mouse liver infected with the mouse hepatitis virus, regarded by many as a suitable model of human coronavirus infection. The results of the study were recently published in April 2021.

The main purpose of the methods developed by Dr Taguchi is to perform feature selection, which means selecting a small or limited number of critical variables from a very large number of variables. Feature selection strategies can be classified into supervised ones and unsupervised ones. Generally, supervised strategies are more popular than unsupervised ones. This is because the purpose of feature selection is usually clear to the user. Despite this, the use of unsupervised feature selections provides a better choice when class labels for large sets of data are unclear or unavailable.

In September 2020, Dr Taguchi’s team published the results of the successful application of an unsupervised strategy able to predict anti-COVID-19 drug candidate compounds without prior knowledge of effective known compounds. The team analysed the gene expression profiles of multiple lung cancer cell lines infected with SARS-CoV-2, in the presence or the absence of several antiviral drugs. All the gene expression profiles were obtained from a public database.

SBDD can find drug candidate compounds in the absence of structural similarity to known drugs and requires massive computational resources for ‘docking’ simulations between compounds and proteins. Dr Taguchi’s TD-based unsupervised FE approach successfully overcame the limitations associated with SBDD, predicting a set of effective drug candidate compounds that are able to treat COVID-19.

Tensor Decomposition as a Feature Extraction Method

One classic approach used to identify significant variables is to conduct a statistical test. This test would compute the probability that a desired property can appear by chance rather than being associated with a specific feature. For example, if the alteration of a gene, or set of genes, follows the onset of disease, the probability of it happening by chance would be rather small. In scenarios where there are very large numbers of variables and a small number of observations, as in genomic science, this strategy often fails. To perform feature selection in these scenarios, Dr Taguchi has successfully applied a mathematical approach known as tensor decomposition.

Tensors are a feature of linear algebra and are at the top of a hierarchy that includes scalars, vectors and matrices. Scalars are simple numerical values, such as the mass of an object or the price of an item for sale. Vectors are composed of a set of scalars. The elements that make up vectors are represented by adding a suffix to scalars, e.g., xj, where x is the scalar value and j is a suffix that represents a whole number. This means that the value of the vector depends on both x and j.

As vectors are composed of scalars, matrices, X, are composed of x vectors. Any x vector belonging to a matrix will have to suffixes j and i (xij). For example, the ‘j’ component of vectors in a matrix could be an item such as ‘Bread’, ‘Fish’, or ‘Pork’, which can vary in value within certain categories denoted as ‘i’, with i1, for example, being ‘Mass’, i2 being ‘Price’, i3 being ‘Calories’.

As vectors are composed of scalars and matrices are composed of vectors, tensors can be composed of matrices. Suppose we have some samples of foods in two different shops. Now, we can define a tensor, Xijk, that describes the jth feature, attributed to the ith food, in the kth shop.

The technique of tensor decomposition can be applied to a large number of experimental conditions. For example, if gene expression is measured for various tissues taken from different patients, gene expression is better represented, not in a matrix, but as a tensor, where patients vs tissues vs genes, are the parameters that define the tensor.

Ivermectin: A Promising COVID-19 Treatment

TD-based unsupervised FE was applied to the gene expression profiles of multiple lung cancer cell lines infected with SARS-CoV-2. Five cell lines underwent two different treatments: one with SARS-CoV-2 and one with a ‘mock treatment’. There were 30 samples in the end, as each pair cell line/treatment was analysed in triplicate (5 cell lines x 2 treatments x 3 replicates = 30 samples). Since there is currently a lack of known drugs that are effective in treating SARS-CoV-2, a ligand based drug discovery approach would not be useful because it is based on the known structures of compounds. On the other hand, SBDD requires massive computational resources, like supercomputers, whereas Dr Taguchi’s method can be performed with standard computational servers that can be purchased even with reduced budgets.

The researchers identified several candidate compounds that could significantly alter the expression of the 163 genes selected by TD-based unsupervised FE. The 163 selected genes are all responsible for expressing proteins that significantly interact with the proteome of the SARS-CoV virus, which is closely related to SARS-CoV-2. Numerous drugs were successfully identified, especially antiviral drugs, including fluticasone, atorvastatin, gentamicin, among many others. The screening process detected ivermectin as the promising treatment for COVID-19. Ivermectin, which was previously identified as an anti-parasite drug, was recently included in clinical trials for SARS-CoV-2.

Summing up: Remarkable Progress

Dr Taguchi and his collaborators proposed an advanced unsupervised machine learning method for identifying numerous promising drug candidate compounds that could treat COVID-19 infection. When applied to the expression profiles of a pool of genes from lung cancer cell lines infected by SARS-CoV-2, the method identified numerous drug compounds that significantly altered the expression of the genes, indicating a change in the progression of the disease. The study was aimed at consolidating a similar strategy previously employed by Dr Taguchi to understand the infectious process of mouse hepatitis virus, a well-studied model for COVID-19.

In order to confirm the significance of the 163 genes in the context of human disease, Dr Taguchi and his collaborators compared the genes with those identified to be interacting with SARS-CoV-2 in humans. The 163 genes identified in this study turned out to be associated with human genes previously reported to interact with the SARS-CoV-2 proteome, contributing to disease progression.

Although ivermectin was recently reported to inhibit the replication of SARS-CoV-2 in vitro, to Dr Taguchi’s knowledge, his team was the first to report the in silico detection of ivermectin as a possible SARS-CoV-2 drug through an unsupervised feature extraction method. Most in silico drug discovery methods are supervised strategies that require known target-drug relations or drug-disease relations, which are currently not available for SARS-CoV-2. Furthermore, as ivermectin was first identified as an anti-parasite drug, no previous supervised in silico approach considered it, confirming the remarkable effectiveness of the unsupervised approach devised by Dr Taguchi and his collaborators.

Reference
https://doi.org/10.33548/SCIENTIA727

Meet the researcher


Dr Y-h. Taguchi

Department of Physics
Chuo University
Tokyo
Japan

Dr Y-h. Taguchi obtained his PhD in the theory of statistical mechanics of spin systems, from the Tokyo Institute of Technology in 1988. In the same year, he started his academic career as Assistant Professor at the Department of Physics at the Tokyo Institute of Technology. In 1997, he joined the Department of Physics at Chuo University, Tokyo, where he became Full Professor in 2006. Dr Taguchi’s most recent research interest revolves around the development of tensor decomposition methods applied to bioinformatics, particularly in relation to proteomics and gene expression patterns. Dr Taguchi has published a monograph and several peer-reviewed publications. As an outstanding scientist in his field, Dr Taguchki has received numerous prestigious honours and awards for his contributions to bioinformatics.

CONTACT

E: tag@granular.com

W: https://researchmap.jp/Yh_Taguchi/

Twitter: @Yh_Taguchi

KEY COLLABORATORS

Dr Turki Turki, King Abdulaziz University, Jeddah

FUNDING

KAKENHI (grant numbers 19H05270, 20H04848 and 20K12067)

Deanship of Scientific Research at King Abdulaziz University, Jeddah (grant number KEP-8-611-38)

FURTHER READING

YH Taguchi, T Turki, Application of Tensor Decomposition to Gene Expression of Infection of Mouse Hepatitis Virus Can Identify Critical Human Genes and Effective Drugs for SARS-CoV-2 Infection, IEEE Journal of Selected Topics in Signal Processing, 2021, 15(3), 746–758.

YH Taguchi, T Turki, A new advanced in silico drug discovery method for novel coronavirus (SARS-CoV-2) with tensor decomposition-based unsupervised feature extraction, PLoS ONE, 2020, 15(9), e0238907.

YH Taguchi, Unsupervised feature extraction applied to bioinformatics: PCA and TD based approach, 2020, Switzerland: Springer International.

Want to republish our articles?

 

We encourage all formats of sharing and republishing of our articles. Whether you want to host on your website, publication or blog, we welcome this. Find out more

Creative Commons Licence
(CC BY 4.0)

This work is licensed under a Creative Commons Attribution 4.0 International License. Creative Commons License

What does this mean?

Share: You can copy and redistribute the material in any medium or format

Adapt: You can change, and build upon the material for any purpose, even commercially.

Credit: You must give appropriate credit, provide a link to the license, and indicate if changes were made.


More articles you may like

Dr Bernhard Lüscher – Dr Patricia Korn | The Role of ADP-ribosylation in Innate Immunity

Dr Bernhard Lüscher – Dr Patricia Korn | The Role of ADP-ribosylation in Innate Immunity

Innate immunity, our standing defence against viruses, bacteria, and other infections, is comprised of a vast network of biomolecules working in orchestrated balance to defend against pathogens. Researchers have theorised that ADP-ribosylation – where ADP-ribose groups are added to modify host and viral biomolecules – may constitute an active part of the innate immune system. Dr Bernhard Lüscher and Dr Patricia Korn from RWTH Aachen University lead a team that studies the role of ADP-ribosylation and viral function.

Professor Sung-Hou Kim | New Insights into Ethnic and Genomic Diversity

Professor Sung-Hou Kim | New Insights into Ethnic and Genomic Diversity

Does our ethnic diversity translate to genomic diversity? New findings suggest that it might not and point instead to considerable genomic similarities across multiple ethnicities. Professor Sung-Hou Kim at the University of California, Berkeley, and his colleagues classified 164 ethnic groups into 14 genomic clusters spread across various geographical regions. Their findings reveal important new insights into our shared human genetic heritage.

Professor Andrei Khrennikov – Professor Emmanuel Haven | How Quantum-like Models Illuminate Complex Systems

Professor Andrei Khrennikov – Professor Emmanuel Haven | How Quantum-like Models Illuminate Complex Systems

Professor Andrei Khrennikov (Linnaeus University – Sweden) and Professor Emmanuel Haven (Memorial University of Newfoundland – Canada) utilise the mathematical framework of quantum theory to offer novel insights into the complexities of biological systems, cognition, decision-making, and other areas of social science, such as economics and finance. Their work showcases the potential of quantum-like models to decode the intricacies of non-physical systems, paving the way for innovative approaches to understanding the dynamics of life and thought, social processes and financial markets.

Professor Aristides Marcano Olaizola | Innovation in the Production of Singlet Oxygen

Professor Aristides Marcano Olaizola | Innovation in the Production of Singlet Oxygen

A novel method for producing singlet oxygen via stimulated Raman scattering (SRS) offers an innovative alternative to traditional photosensitizer-based techniques, promising safer and more efficient applications in disinfection, cancer treatment, and beyond. Professor Aristides Marcano Olaizola at Delaware State University is driving innovation in this critical field.