FRI > Biolab > Supplements > FragViz: Visualization of Fragmented Networks

This set of web pages provides supplementary material to the following paper submitted to Bioinformatics:

FragViz: Visualization of Fragmented Networks
Miha Stajdohar, Minca Mramor, Blaz Zupan and Janez Demsar


Abstract:

Background: Researchers in systems biology use network visualization to summarize the results of their analysis. Such networks often include unconnected components, which popular network alignment algorithms place arbitrarily with respect to the rest of the network. This can lead to misinterpretations due to the proximity of otherwise unrelated elements.

Results: We propose a new network layout optimization technique called FragViz which can incorporate additional information on relations between unconnected network components. It uses a two-step approach by first arranging the nodes within each of the components and then placing the components so that their proximity in the network corresponds to their relatedness. In the experimental study with the leukemia gene networks we demonstrate that FragViz can obtain network layouts which are more interpretable and hold additional information that could not be exposed using classical network layout optimization algorithms.

Conclusions: Network visualization relies on computational techniques for proper placement of objects under consideration. These algorithms need to be fast so that they can be incorporated in responsive interfaces required by the explorative data analysis environments. Our layout optimization technique FragViz meets these requirements and specifically addresses the visualization of fragmented networks, for which standard algorithms do not consider similarities between unconnected components. The experiments confirmed the claims on speed and accuracy of the proposed solution.

Availability: Source code is available at http://orange.biolab.si/

Supplementary information on the following topics is available:

  • Network details, five leukemia gene networks described in the article are explained in more details. Network data files and distance matrices are also attached.
  • Exploring networks in Orange, a tutorial on how to explore networks in Orange, an open-source data visualization and analysis suite.

References

1. Lee TI, Rinaldi NJ, Robert F, Odom DT, Bar-Joseph Z, Gerber GK, Hannett NM, Harbison CT, Thompson CM, Simon I, Zeitlinger J, Jennings EG, Murray HL, Gordon DB, Ren B, Wyrick JJ, Tagne J, Volkert TL, Fraenkel E, Gifford DK, Young RA: Transcriptional Regulatory Networks in Saccharomyces cerevisiae. Science 2002, 298(5594):799-804.

2. Lehner B, Fraser A: A first-draft human protein-interaction map. Genome Biology 2004, 5(9):R63.

3. Rhodes DR, Tomlins SA, Varambally S, Mahavisno V, Barrette T, Kalyana-Sundaram S, Ghosh D, Pandey A, Chinnaiyan AM: Probabilistic model of the human protein-protein interaction network. Nat Biotech 2005, 23(8):951-959.

4. McKinney BA, Reif DM, Ritchie MD, Moore JH: Machine learning for detecting gene-gene interactions: a review. Applied Bioinformatics 2006, 5(2):77-88.

5. Goh K, Cusick ME, Valle D, Childs B, Vidal M, Barabasi A: The human disease network. Proceedings of the National Academy of Sciences 2007, 104(21):8685-8690.

6. Pavlopoulos G, Wegener AL, Schneider R: A survey of visualization tools for biological network analysis. BioData Mining 2008, 1:12.

7. Batada NN, Reguly T, Breitkreutz A, Boucher L, Breitkreutz B, Hurst LD, Tyers M: Stratus Not Altocumulus: A New View of the Yeast Protein Interaction Network. PLoS Biol 2006, 4(10):e317.

8. Iorio F, Tagliaferri R, di Bernardo D: Identifying Network of Drug Mode of Action by Gene Expression Profiling. Journal of Computational Biology 2009, 16(2):241-251.

9. Fruchterman TMJ, Reingold EM: Graph drawing by force-directed placement. Software: Practice and Experience 1991, 21(11):1129-1164.

10. Kamada T, Kawai S: An algorithm for drawing general undirected graphs. Information Processing Letters 1989, 31:7-15.

11. Frick A, Ludwig A, Mehldau H: A fast adaptive layout algorithm for undirected graphs (extended abstract and system demonstration). In Graph Drawing, Springer:388-403.

12. Saris C, Horvath S, van Vught P, van Es M, Blauw H, Fuller T, Langfelder P, DeYoung J, Wokke J, Veldink J, van den Berg L, Ophoff R: Weighted gene co-expression network analysis of the peripheral blood from Amyotrophic Lateral Sclerosis patients. BMC Genomics 2009, 10:405.

13. Onay V, Briollais L, Knight J, Shi E, Wang Y, Wells S, Li H, Rajendram I, Andrulis I, Ozcelik H: SNP-SNP interactions in breast cancer susceptibility. BMC Cancer 2006, 6:114.

14. Bhavnani S, Eichinger F, Martini S, Saxman P, Jagadish H, Kretzler M: Network analysis of genes regulated in renal diseases: implications for a molecular-based classification. BMC Bioinformatics 2009, 10(Suppl 9):S3.

15. Torgerson W: Multidimensional scaling: I. Theory and method. Psychometrika 1952, 17(4):401-419.

16. Kruskal JB, Wish M: Multidimensional Scaling. Sage University Paper series on Quantitative Application in the Social Sciences 1978.

17. Walshaw C: A multilevel algorithm for force-directed graph drawing. In Graph Drawing, Springer 2000:31-55.

18. Archambault D, Munzner T, Auber D: GrouseFlocks: steerable exploration of graph hierarchy space. IEEE transactions on visualization and computer graphics14(4):900-13.

19. Archambault D, Munzner T, Auber D: TugGraph: Path-preserving hierarchies for browsing proximity and paths in graphs. 2009 IEEE Pacific Visualization Symposium 2009, :113-120.

20. von Landesberger T, Gorner M, Schreck T: Visual analysis of graphs with multiple connected components. IEEE Symposium on Visual Analytics Science and Technology 2009, :155-162.

21. Eades P, Huang M: Navigating clustered graphs using force-directed methods. Journal of Graph Algorithms and Applications 2000, 4(3):157-181.

22. Morrison A, Ross G, Chalmers M: A Hybrid Layout Algorithm for Sub-Quadratic Multidimensional Scaling. In INFOVIS '02: Proceedings of the IEEE Symposium on Information Visualization (InfoVis'02), Washington, DC, USA: IEEE Computer Society 2002:152.

23. Herman I, Melancon G, Marshall M: Graph visualization and navigation in information visualization: A survey. IEEE Transactions on Visualization and Computer Graphics 2000, 6:24-43.

24. Sokal RR, Michener CD: A statistical method for evaluating systematic relationships. University of Kansas Scientific Bulletin 1958, 28:1409-1438.

25. de Leeuw J, Mair P: Multidimensional Scaling Using Majorization: SMACOF in R. Department of Statistics, UCLA. Department of Statistics Papers 2008.

26. Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES: Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science 1999, 286(5439):531-537.

27. Pagel P, Kovac S, Oesterheld M, Brauner B, Dunger-Kaltenbach I, Frishman G, Montrone C, Mark P, Stump en V, Mewes HW, Ruepp A, Frishman D: The MIPS mammalian protein-protein interaction database. Bioinformatics (Oxford, England) 2005, 21(6):832-4.

28. Tan PN, Steinbach M, Kumar V: Introduction to Data Mining. Addison Wesley, us ed edition 2005.

29. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: From the Cover: Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. PNAS 2005, 102(43):15545-15550.

30. Huttenhower C, Haley EM, Hibbs MA, Dumeaux V, Barrett DR, Coller HA, Troyanskaya OG: Exploring the human genome with functional maps. Genome Research 2009, 19(6):1093-1106.

31. Watts DJ, Strogatz SH: Collective dynamics of 'small-world' networks. Nature 1998, 393(6684):440-442.

32. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25:25-29.

33. Scholar EM, Calabresi P: Identification of the Enzymatic Pathways of Nucleotide Metabolism in Human Lymphocytes and Leukemia Cells. Cancer Res 1973, 33:94-103.

34. Pui CH, Evans WE: Treatment of Acute Lymphoblastic Leukemia. N Engl J Med 2006, 354(2):166-178.

35. The Hallmarks of Cancer. Cell 2000, 100:57-70.

36. White DM, Smith AG, Smith JL: Assessment of proliferative activity in leukaemic bone marrow using the monoclonal antibody Ki-67. Journal of clinical pathology 1994, 47(3):209-13.

37. Kaaijk P, Kaspers GJL, Van Wering ER, Broekema GJ, Loonen AH, Hahlen K, Schmiegelow K, Janka-Schaub GE, Henze G, Creutzig U, Veerman AJP: Cell proliferation is related to in vitro drug resistance in childhood acute leukaemia. British journal of cancer 2003, 88(5):775-81.

38. Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, Balakrishnan L, Marimuthu A, Banerjee S, Somanathan DS, Sebastian A, Rani S, Ray S, Harrys Kishore CJ, Kanth S, Ahmed M, Kashyap MK, Mohmood R, Ramachandra YL, Krishna V, Rahiman BA, Mohan S, Ranganathan P, Ramabadran S, Chaerkady R, Pandey A: Human Protein Reference Database-2009 update. Nucleic acids research 2009, 37(Database issue):D767-72.

39. Demsar J, Zupan B, Leban G: Orange: From Experimental Machine Learning to Interactive Data Mining 2004, [orange.biolab.si]. [Published: Faculty of Computer and Information Science, University of Ljubljana].