Sep 7, 2015

Mathematical 'Gingko trees' reveal mutations in single cells that characterize diseases

Seemingly similar cells often have significantly different genomes. This is often true of cancer cells, for example, which may differ one from another even within a small tumor sample, as genetic mutations within the cells spread in staccato-like bursts. Detailed knowledge of these mutations, called copy number variations, in individual cells can point to specific treatment regimens.

The problem is that current techniques for acquiring this knowledge are difficult and produce unreliable results. Today, scientists at Cold Spring Harbor Laboratory (CSHL) publish a new interactive analysis program called Gingko that reduces the uncertainty of single-cell analysis and provides a simple way to visualize patterns in copy number mutations across populations of cells.

The open-source software, which is freely available online, will improve scientists' ability to study this important type of genetic anomaly and could help clinicians better target medications based on cells' specific mutation profiles. The software is described online today in Nature Methods.

Mutations come in many forms. For example, in the most common type of mutation, variations may exist among individual people--or cells--at a single position in a DNA sequence. Another common mutation is a copy number variation (CNV), in which large chunks of DNA are either deleted from or added to the genome. When there are too many or too few copies of a given gene or genes, due to CNVs, disease can occur. Such mutations have been linked not only with cancer but a host of other illnesses, including autism and schizophrenia.

Researchers can learn a lot by analyzing CNVs in bulk samples--from a tumor biopsy, for example--but they can learn more by investigating CNVs in individual cells. "You may think that every cell in a tumor would be the same, but that's actually not the case," says CSHL Associate Professor Michael Schatz.

"We're realizing that there can be a lot of changes inside even a single tumor," says Schatz. "If you're going to treat cancer, you need to diagnose exactly what subclass of cancer you have." Simultaneously employing different drugs to target different cancer subclasses could prevent remission, scientists have proposed.

One powerful single-cell analytic technique for exploring CNV is whole genome sequencing. The challenge is that, before sequencing can be done, the cell's DNA has to be amplified many times over. This process is rife with errors, with some arbitrary chunks of DNA being amplified more than others. In addition, because many labs use their own software to examine CNVs, there is little consistency in how researchers analyze their results.

To address these two challenges, Schatz and his colleagues created Gingko. The interactive, web-based program automatically processes sequence data, maps the sequences to a reference genome, and creates CNV profiles for every cell that can then be viewed with a user-friendly graphical interface. In addition, Gingko constructs phylogenetic trees based on the profiles, allowing cells with similar copy number mutations to be grouped together.

Importantly, Gingko, which Schatz and his colleagues validated by reproducing the findings of five major single-cell studies, also analyzes patterns in the sequence reads in order to recognize, and greatly reduce, amplification errors.

Read more at Science Daily

No comments:

Post a Comment