Big Data techniques find biomarkers for ASD
An algorithm based on levels of metabolites found in a blood sample can accurately predict whether a child is on the Autism spectrum of disorder (ASD), based upon a recent study. The algorithm, developed by researchers at Rensselaer Polytechnic Institute, is the first physiological test for autism and opens the door to earlier diagnosis and potential future development of therapeutics.
"Instead of looking at individual metabolites, we investigated patterns of several metabolites and found significant differences between metabolites of children with ASD and those that are neurotypical. These differences allow us to categorise whether an individual is on the Autism spectrum," said Juergen Hahn, lead author, systems biologist, professor, and head of the Rensselaer Department of Biomedical Engineering.
"By measuring 24 metabolites from a blood sample, this algorithm can tell whether or not an individual is on the Autism spectrum, and even to some degree where on the spectrum they land."
Big data techniques applied to biomedical data found different patterns in metabolites relevant to two connected cellular pathways (a series of interactions between molecules that control cell function) that have been hypothesised to be linked to ASD: the methionine cycle and the transulfuration pathway.
The methionine cycle is linked to several cellular functions, including DNA methylation and epigenetics, and the transulfuration pathway results in the production of the antioxidant glutathione, decreasing oxidative stress.
Autism Spectrum Disorder is estimated to affect approximately 1.5% of individuals and is characterised as "a developmental disability caused by differences in the brain," according to the Centers for Disease Control and Prevention. The physiological basis for ASD is not known, and genetic and environmental factors are both believed to play a role.
People with ASD "may communicate, interact, behave, and learn in ways that are different from most other people." According to the CDC, the total economic costs per year for children with ASD in the United States are estimated between $11.5 billion and $60.9 billion.
Research shows that early intervention can improve development, but diagnosis currently depends on clinical observation of behavior, an obstacle to early diagnosis and treatment. Most children are not diagnosed with ASD until after age 4 years.
Hahn's research, titled "Classification and Adaptive Behavior Prediction of Children with Autism Spectrum Disorder based upon Multivariate Data Analysis of Markers of Oxidative Stress and DNA Methylation," appears in PLOS Computational Biology, an open access journal published by the Public Library of Science.
In the article, Hahn describes an application of Fisher Discriminant Analysis - a big data analysis technique - to data from a group of 149 people, about half on the Autism spectrum. Deliberately omitting data from one of the individuals in the group, Hahn subjects the dataset to advanced analysis techniques, and uses the results to generate a predictive algorithm.
The algorithm then makes a prediction about the data from the omitted individual. Hahn cross-validated the results, swapping a different individual out of the group and repeating the process for all 149 participants. His method correctly identified 96.1% of all neurotypical participants and 97.6% of the ASD cohort.
"Because we did everything possible to make the model independent of the data, I am very optimistic we will be able to replicate our results with a different cohort," said Hahn, a member of the Rensselaer Center for Biotechnology and Interdisciplinary Studies (CBIS). "This is the first physiological diagnostic and it's highly accurate and specific."
Researchers have looked at individual metabolites produced by the methionine cycle and the transulfuration pathways and found possible links with ASD, but the correlation has been inconclusive. Hahn said the more sophisticated techniques he applied revealed patterns that would not have been apparent with earlier efforts.
"A lot of studies have looked at one biomarker, one metabolite, one gene, and have found some differences, but most of the time those differences weren't statistically significant or the results could not be reliably replicated," Hahn said. "Our contribution is using big data techniques that are able to look at a suite of metabolites that have been correlated with ASD and make statistically a much stronger case."
"Professor Hahn's innovative work to improve the diagnosis for Autism as well as other ongoing efforts to advance diagnostics and develop new treatments for Alzheimer's and neurodegenerative diseases at CBIS show how new breakthroughs in health care are possible when we focus our work at the interface between traditional boundaries," said Deepak Vashishth, director of the CBIS.
Hahn's interdisciplinary research at Rensselaer combines systems engineering, applied mathematics, and computer science to develop new ways of analysing nonlinear systems found in biological or chemical processes. He began investigating ASD about two years ago. His recently published work on ASD offers additional insights into the disorder.
For example, a recent paper published in the journal PLOS ONE with the research group of Arizona State University Professor James Adams made use of the same big data analysis techniques and showed a difference in the excretion of certain elements in the urine between children on the Autism spectrum and their neurotypical peers; these results further strengthen the findings as differences in the transulfuration pathway can be responsible for differences seen in the urine.
Furthermore, work published in the Journal of Theoretical Biology with the research group of Arkansas Children's Hospital researcher Jill James found differences in the parameter distribution functions of reaction rates in the methionine cycle and the transulfuration pathway.
The full results of Hahn's work on ASD diagnosis are publicly available and Hahn is hopeful his work will lead to a widely available test that can support early diagnosis, although he does not intend to commercialise his results.
For Hahn, the next step is to replicate the results with a new cohort working with his clinical collaborators. In the long run, Hahn hopes the model and diagnostic tool will aid in developing treatment options.
"If these pathways are different, what happens if I can manipulate the pathway so that it works similarly to the neurotypical ones?" said Hahn. "What do I need to prod? Which molecules do I need to add or take away? Having a model that describes these pathways makes it a lot easier to adjust them."