A Java tool for visualization of genomic aberrations using Affymetrix SNP arrays.
Contents
Müller A, Holzmann KH, Kestler HA. Visualization of genomic aberrations using Affymetrix SNP arrays. Bioinformatics 23(4): 496-497, 2007.
The IdeogramBrowser is open source software and is hosted on github. You can downlaod the source code and legacy versions at our project page. Or you can just download the latest version.
The IdeogramBrowser can be used to easily compare a set of output files from the Affymetrix Chromosome Copy Number Analysis Tool. Loss of heterozygosity (LOH) or loss/gain markers are plotted against their corresponding chromosomal region in a karyotypic representation together with genes.
IdeogramBrowser provides:
|
The software requires the Java runtime environment 1.5.0 (JRE 5.0) which can be downloaded from java.sun.com.
IdeogramBrowser is packed into a single zip archive which must be unzipped (e.g. using WinZip) into the desired directory (e.g. "C:\Program Files\" or your home directory). Call ideogram.sh in the IdeogramBrowser directory from the command line (Linux or Mac OS X) or double click ideogram.bat under Microsoft Windows in the file explorer.
When importing many files IdeogramBrowser may report an out of memory error. In these cases open ideogram.sh (Linux or Mac OS X) or ideogram.bat (Windows) with a text editor (e.g. vi, Emacs, or notepad) and change the memory size by editing the two -X options of the java call. The -X options specify the memory size (in MB) assigned to the software. These can be adapted depending on the available memory and the testset size (number of imported .cnt files). The standard settings are usually too small for the most problems (2MB for Sun Java 1.5 on Linux and 1MB on win). Replace the two parameters e.g. by -Xmx1024m -Xms1024mfor assigning 1024MB memory to the software.
IdeogramBrowser includes the following open source Java libraries:
The following steps lead to a chromosomal displayed marker set:
Import File Formats IdeogramBrowser can import two different file formats: Tabbed Text File FormatThis is a tab delimited ASCII file format with one aberration value in each line which can easily be generated by e.g. Microsoft Excel or OpenOffice.
Each marker line should contain five columns:
CHROMOSOME (1..22,X,Y)
START_BP (Integer > 0)
STOP_BP (Integer > 0)
NAME (String)
VALUE (Integer)
A short example can be downloaded: marker_example.txt. The VALUEfield represents a copy number which is defined by the following ranges:
VALUE < 0 | Illegal value |
0 < VALUE < 2 | Loss |
2 | No aberration |
VALUE > 2 | Gain |
CNAT Export File Format IdeogramBrowser can import the Affymetrix Chromosome Copy Number Analyis Tool (CNAT) output file format (.cnt files). The tab-delimited TXT format of the CNAT tool is currently not supported. A brief summary of using CNAT to generate these files is given in the CNAT manual. Example files can be downloaded: Affymetrix sample data sets (these have to be first processed by CNAT 3.0 or newer to compute the copy numbers). The Affymetrix example data sets MCF7, Ref103, SKBR3, and ZR75-30 were processed by CNAT 3.0 with the standard settings and the corresponding CNT files can be downloaded ideo-examples.zip Importing Input Files Tabbed Text File Choose the menu point Load Markers in the File menu and navigate to a file in the format described in text file format. CNAT Export File Choose the menu point Load Markers in the File menu and navigate to a file in the format described in CNT file format. Filtering The imported marker sets can be thresholded and filtered. All required parameters can be controlled via the filtering panel. Upon pressing the OKbutton the visualization is immediately adapted.
The following parameters can be adapted:
Interactive ExplorationIdeogramBrowser allows the interactive exploration of the marker sets.
The upper (ideogram) panel shows the 24 chromosomes and the imported markers. When moving the mouse slowly over the chromosomes, the corresponding genome region under the mouse is shown in a small tool tip window. Also the GeneID and gene names are shown when moving the mouse over the blue bar on the right side of the chromosomes.
When a gene is selected with the left mouse button the details of the selection are listed in the info panel below the ideogram panel. If a marker is selected (blinking) the marker info and a summary of intersecting genes is displayed on the info panel.
The Karyogram Panel
Depending how many datasets are loaded into IdeogramBrowser one marker line appears on the left and the right side of the ideogram. A data slot can be selected with the mouse by clicking on the gray marker line. Once a data slot is selected all SNPs (independent of the merge mode and the number of losses/gains or LOHs) are shown on the left side for an assessment of the coverage of the genome.
The Info Panel
The lower (info) panel shows the information for the selected items on the ideogram panel. Marker info and/or gene info is available. Clicking on the hyperlinked gene names opens a browser window displaying additional information from the www.genecards.org webpage.
Tool Tip Info Window When moving the mouse over chromosomes, genes, or markers a small tool tip window appears showing information about the chromosomal region, the genes or the SNP markers. Popup Menu
By clicking the right mouse button on a marker/marker line the popup menu opens. It provides Zoom In and Zoom Out fields for zooming into the chromosomes. Show All shows the whole chromosome. If a marker is selected additional the Remove Marker File option is shown. By choosing this command the complete marker line containing the selected marker is removed from the workbench. The context menu item Merge marker file enables to merge the currently selected marker line with another CNT file. This feature is useful for importing split files such as those originating from the CNAT analysis of 100k chips. |
Keyboard and Mouse CommandsThe ideogram panel could be controlled by clicking the folowing keys:
Saving a Karyogram Select File/Save to save the current karyogram as JPEG, PDF, or SVG (scalable vector graphics). Hint: SVG export may take a very long time.
The original version of this page is available at http://www.informatik.uni-ulm.de/ni/staff/HKestler/ideo/doc.html
PrintingSingle chromosomes or the whole ideogram can be printed via the two menu items
Hint:The printing may not work under Linux in combination with CUPS - this is a Java 1.5/CUPS problem: export the diagrams as pdf and print e.g. with Acrobat reader.
The gene/ideogram database The two database table ideogram.gz and seq_gene.md.gz are packed into the jar archive of IdeogramBrowser. The current version of this database are available on the NCBI ftp site: ftp://ftp.ncbi.nlm.nih.gov/genomes/H_sapiens/mapview/. Older versions can be found in the archives ftp://ftp.ncbi.nlm.nih.gov/genomes/H_sapiens/ARCHIVE. Just place the above mentioned files into the ideogram/data directory in the archive (unjar and jar the archive with the manifest.txtfile). As the file format of these tables changes for (nearly) every release it may be necessary to
CNT files of the Affymetrix data sets MCF7, Ref103, SKBR3, and ZR75-30:
ideo-examples.zip
This work is licensed under a Creative Commons Attribution 2.5 License.
Wissenschaftlicher Mitarbeiter (m/w/d)
"Recent Trends and Future Challenges in Learning from Data" has been published with Springer.
Our paper "Permutation-invariant linear classifiers" has been published in Machine Learning.
Our paper "Prediction of resistance to bevacizumab plus FOLFOX in metastatic colorectal cancer-Results of the prospective multicenter PERMAD trial" has been published in PLoS One.
Our paper "Segmentation-based cardiomegaly detection based on semi-supervised estimation of cardiothoracic ratio" has been published in Scientific Reports.
"Prospective study validating a multidimensional treatment decision score predicting the 24-month outcome in untreated patients with clinically isolated syndrome and early relapsing–remitting multiple sclerosis, the ProVal-MS study" has been published in Neurological Research and Practice.
Our paper "GatekeepR: an R shiny application for the identification of nodes with high dynamic impact in boolean networks" has been published online first in Bioinformatics.
Our paper "The Necessity of Interoperability to Uncover the Full Potential of Digital Health Devices" has been published in JMIR Medical Informatics.
"Multicentric pilot study to standardize clinical whole exome sequencing (WES) for cancer patients" has been published in npj Precision Oncology.
Our paper "AMBAR-interactive alteration annotations for molecular tumor boards" has been published in Computer Methods and Programs in Biomedicine.
"A protocol for the use of cloud-based quantum computers for logical network analysis of biological systems" has been published in STAR Protocols.
Our paper "A systems biology approach to define mechanisms, phenotypes, and drivers in PanNETs with a personalized perspective" has been published in npj systems biology and applications.
"Supporting SURgery with GEriatric Co-Management and AI (SURGE-Ahead): A study protocol for the development of a digital geriatrician" has been published in PLoS One.
"Self-Assessment of Having COVID-19 With the Corona Check Mhealth App" has been published in IEEE Journal of Biomedical and Health Informatics.
Our first quantum computing paper "Leveraging quantum computing for dynamic analyses of logical networks in systems biology" has been published in Patterns.