22 March 2016

Modelling Static Networks in Bioinformatics Applications Using Advanced Computing

Alhadi Bustamam, June 2011
The University of Queensland

Abstract
Molecular Biology and Bioinformatics or Computational Biology have helped enrich each other and together have helped us in understanding the fundamental question in biology, ”what is life?”. They have also provided insights at the molecular level and via systems biology to improve the quality of human health/life and the cure of diseases. Starting with DNA sequences analysis, Bioinformatics has become a major tool to support far wider research areas, thus allowing us to deal with the rapid increase of biological dataset sizes, including genomes, proteomes and metabolomes. Furthermore, advanced computing technology and tools underpins the progress of supporting molecular biology and systems biology research. Meanwhile, the empirical study of networks has enlightened our understanding of many application domains, including the topology of biological systems. Many successful network algorithms have been adopted and applied in bioinformatics applications including the Markov clustering algorithm (MCL). In this thesis, we introduce some different approaches for parallel implementations of fast Markov clustering algorithm, using two different advanced computing environments and sparse data structures, in order to improve theMCL performance to enable the analysis of massive, sparse biological networks. These two approaches include: (1) the MPI-MCL implementation in the supercomputer SGI Altix symmetric multiprocessing (SMP) clusters using the message passing interface (MPI) tool, Fortran 90 language, and compressed storage row (CSR) sparse data structure; and (2) the CUDA-MCL implementation in the graphic processing unit (GPU) streaming processors on desktop and laptop computers using the computed device architecture (CUDA) tool from NVIDIA, C language, and ELLPACK-R sparse data structure. The first environment for MPI-MCL represents a classical, huge, high maintenance cost and very expensive supercomputer machines with multi-core CPUs (SMP clusters). The second one for CUDA-MCL represents a new architecture, low maintenance cost, cheaper, scalable and is widely-available in the market with massively-parallel streaming processors (many-core GPUs). Recently, the original MCL algorithm developed for clustering graphs, has been successfully adopted for clustering biological networks. The MCL is now becoming an effective algorithm for the extraction of complexes from interaction networks and also a key algorithm within Bioinformatics for determining clusters in networks. For instance, clustering protein-protein interaction (PPI) networks is helping to find genes implicated in diseases such as cancer. However, with fast sequencing and other technologies generating vast amounts of data in the biological networks, performance and scalability issues are becoming a critical limiting factor in bioinformatics applications. The highest computational cost of the MCL algorithm is in the expansion and inflation operations on the stochastic (Markov)matrix associated with a graph. In this thesis, we introduce parallel implementation approaches, especially for reducing the computational costs of the expansion and inflation parts. The results show the significant improvements of the parallel Markov clustering implementation in both the SPM clusters or the GPUs. However, with the low maintenance cost, scalable architecture and widely-availability of the GPUs unit in the market, the CUDA-MCL is considered to be the best option for improving the performance in the future. In MPI-MCL, the central construct is message passing by packaging information into a message and sending it from one process or processor to another and computing the data concurrently. To achieve an efficient MPI implementation, we evaluate different parallel schemes such as point-to-point or collective communication approaches using the Single Program Multiple Data (SPMD) model. The SPMD model is the dominant method for structuring parallel programs. We evaluate the scalability and performance of our parallel MPI-MCL using CSR sparse data format on small, medium, and large PPI-network datasets. Our results demonstrate a performance of about 80% in terms of efficiency over a wide range of processors. In addition, this good performance is highly scalable with an increasing number of processors. Meanwhile, the GPU as a new advanced computing environment which uses massively-parallel thread is becoming a very powerful, efficient and a low-cost option for achieving substantial performance gains over the CPU approaches. The CUDA-MCL uses the Single Instruction Multiple Threads (SIMT) model adopted from SPMD model. The central construct is using CUDA tool to allow the GPU to perform parallel sparse matrix-matrix computations, and parallel sparse Markov matrix normalizations which are at the heart of the clustering algorithm. The key to optimizing our CUDA Markov Clustering (CUDA-MCL) is in using the ELLACK-R sparse data format to allow for effective and fine-grain massively-parallel processing. The CUDA also allows us to use the on-chip memory of the GPU efficiently to lower the latency time, thus circumventing a major issue in other parallel computing environments, such as the Message Passing Interface. In CUDA-MCL, comparing the GPU computation times against a modern quad-core CPU on the published (relatively sparse) standard BIOGRID protein interaction networks with 5156 and 23175 nodes, speed factors of four and nine times are obtained, respectively. On the Human Protein Reference Database, the speed of the clustering of 19599 proteins was improved by a factor of seven with the GPU algorithm. However, on artificially-generated densely-connected networks with 1600 to 4800 nodes, speedups by a factor of 40 to 120 times were readily obtained. As the results show, in all cases, the GPU implementation is significantly faster than the original MCL running on a CPU. In conclusion, our parallel Markov clustering implementations have a significant performance improvement and are scalable on both multiple CPU processors using MPI and (even better) in many-core GPU streaming processors. Such approaches, especially the GPU implementation, are allowing large-scale parallel-computations on off-the-shelf desktop machines that were previously only possible on super-computing architectures. They have the potential to significantly change the way bioinformaticians and biologists compute and interact with their data. Moreover, with the economically low setup and maintenance costs and the rapid improvement and continuous support from their vendor (such as NVIDIA CUDA), GPU computing provides considerable hope for further implementations of this parallel Markov clustering algorithm.

Keywords: MCL, MPI-MCL, CUDA-MCL, PPI networks, clustering, bioinformatics, computational biology, system biology, parallel computing, GPU computing. Australian and New Zealand Standard Research Classification (ANZSRC): 060102 Bioinformatics 50%, 080301 Bioinformatics Software 25%, 080501 Distributed and Grid Systems 25%.

Share this article on:

Australia Awards in Indonesia

Modelling Static Networks in Bioinformatics Applications Using Advanced Computing

Related Articles

Quick Links

Our Programs

Contact Info

Related Articles

06 August 2019

Austempered Ductile Iron Production Technology From Base Material Produced By Ferro-Casting Industry In Indonesia

Rianti Dewi Wulansari Sulamet Ariobimo, 2003 University of Queensland Abstract The quality of... Read more
06 August 2019

10 January 2019

Postharvest Physiology Of Fresh-Cut Tomato Slices

Darwin H. Pangaribuan, 2005 University of Queensland Abstract Fresh-cut products are becoming... Read more
10 January 2019

30 October 2018

Global Issues in Adolescent Health

Sartiah Yusran, 2017 The University of Melbourne Abstract This qualitative study was conducte... Read more
30 October 2018

19 July 2018

Non-Compliance in Public Financial Management: A Case Study of a Local Government in Indonesia

Budi Cahyono Curtin University Abstract Utilising content analysis of external audit reports... Read more
19 July 2018

21 June 2018

Predictors Of Employees’ Intention To Whistleblow Using Theory Of Planned Behaviour: A Case Study Of An Indonesian Government Department

Bitra Suyatno Victoria University Abstract The purpose of this study was to facilitate opport... Read more
21 June 2018

28 May 2018

Decentralization and Development: The case of Talaud Islands Regency, Indonesia, 2004-2007

Johannes Aldrin Timbuleng Flinders University Abstract The correlation between decentralizati... Read more
28 May 2018

04 May 2018

Tsunami Vulnerability Assessment in Mandurah (Western Australia)

Hermina Manlea The University of Western Australia Abstract A novel tsunami vulnerability as... Read more
04 May 2018

04 May 2018

In Pursuit of Sustainable Alternative Online Media for Indonesia: Lessons from Australia

Ika Krismantari Monash University The research looks at the business models of three leading alt... Read more
04 May 2018

15 March 2018

Marriage Payment, Social Change and Women’s Agency Among Bimanese Muslims of Eastern Indonesia

Atun Wardatun Western Sydney University Abstract This thesis draws on ethnographic research t... Read more
15 March 2018

12 March 2018

Development of a Cost-Effective River Water Quality Index: A Case Study of West Java Province, Indonesia

Arief Dhany Sutadian Victoria University, Australia Abstract Having good water quality is imp... Read more
12 March 2018

08 February 2018

Environmental Factors and an Eco-epidemiological Model of Malaria in Indonesia

Ermi M. L. Ndoen Griffith University Abstract Indonesia is one of the countries in Southeast... Read more
08 February 2018

24 January 2018

Experience of Indonesian Pre-Service English As Foreign Language Teachers in Implementing Technology in Teaching Practicum: An Investigation Through Tpack Framework

Effendi Limbong Flinders University Abstract The increasing presence of ICT in foreign learni... Read more
24 January 2018

24 January 2018

The Physiological and Metabolic Effects of Stressors Associated with Long Duration Transportation on Male Bos Indicus Cattle

Cardial Leverson Octovianus, Leo-Penu James Cook University Abstract A series of experiments... Read more
24 January 2018

18 January 2018

Field Evaluation and Modelling of Water and Nitrogen Management Streategies in Tropical Lowland Rice–Based Production Systems

Ahmad Suriadi University of Southern Queensland Abstract With increased competition f... Read more
18 January 2018

29 December 2017

The Indonesian Gender Responsive Budgets Policy: Examining The Effectiveness of The GRB Initiative in Indonesia

Salbiyah, 2015 Flinder University Abstract This research aims to make contribution to bo... Read more
29 December 2017

14 December 2017

Quality Assurance in The Six State Islamic Universities in Indonesia Developing a Desirable Model of Quality Assurance for Islamic State Universities

Jejen Jaenudin, 2016 University of New South Wales Abstract In this study I address the probl... Read more
14 December 2017

06 December 2017

Synthesis and Structure of Metal Complexes and Coordination Polymers of 3-Pyrazol-1-yl Based Ligands

Yuniar Ponco Prananto, 2009 Monash University Abstract The objectives of this research are to... Read more
06 December 2017

04 December 2017

Analysis of Strategies of Reduction Diarrhea in Rural Areas Based On the Comprehensive Primary Health Care Concepts for Implementing in Rural Central Sulawesi Province, Indonesia

Tasnim, 2009 The Flinders University of South Australia Objectives: Diarrhoea remains a leading... Read more
04 December 2017

28 November 2017

CHANGING CONNECTIONS: Ontogenetic Ecophysiology of Secondary Hemi-epiphytic Vines

Yansen, 2012 James Cook University Abstract The objectives of this research are to explore th... Read more
28 November 2017

27 November 2017

Conservation of a Critically Endangered Orchid Drakaea Elastica Lindl. in the Context of Nutritional Requirements and Saprophytic Competency of the Mycorrhizal Fungus and its Propagation

Siti Nurfadilah, 2010 The University of Western Australia Abstract Drakaea elastica is a crit... Read more
27 November 2017

27 November 2017

Within-host Deterministic Modelling of Artesunate Resistance in Plasmodium falciparum

Marselinus Ulu F, 2015 The University of Melbourne Abstract A deterministic within-host pharm... Read more
27 November 2017

13 November 2017

English Language Lecturers' communication Strategies : A Case Study in Aceh Province, Indonesia

Muhammad Aulia, 2016 University of Technology Sydney Abstract This research critically analys... Read more
13 November 2017

21 August 2017

Eva Rahmi Kasim: Make Indonesia a Home for All

In 2016, the Indonesian government passed a new law supporting the needs of people with disabilities... Read more
21 August 2017

18 July 2017

Being Muslim in Bima of Sumbawa, Indonesia: Practice, Politics and Cultural Diversity By Muhammad Adlin

Muhammad Adlin Sila, 2014 The Australian National University Abstract This thesis argues that... Read more
18 July 2017

18 July 2017

Masculinities, Islam and Domestic Violence in Java

Rachmad Hidayat, 2009 Monas University Abstract This research explores the links between masc... Read more
18 July 2017

18 July 2017

Muslim Masculinities In Australia

Rachmad Hidayat, 2015 Monash University Abstract This study examined how migrant Muslim men&r... Read more
18 July 2017

18 July 2017

Identifying Shocks on the Economic Fluctuations in Indonesia and US: The Role of Oil Price Shocks in a Structural Vector Autoregression Model

Alfan Mansur, 2011 The Australian National University Abstract In this paper the sources of t... Read more
18 July 2017

18 July 2017

Effective Community Engagement for Climate Change Adaptation in Indonesia

Meuthia Alvernia Naim, 2013 Griffith University Abstract In regard to the highly vulnera... Read more
18 July 2017

18 July 2017

Evaluation of Performance Auditing in Indonesia: A Critical Systemic Approach to Addressing Public Accountability

Agus Bambang Irawan, 2015 Flinders University Abstract The research explores the concept of p... Read more
18 July 2017

18 July 2017

A Case Study Of The Use Of A Competency Framework In The Australian Army For Performance Management And Development

Eri Radityawara Hidayat, 2005 University of Sydney Abstract To improve staff performance in t... Read more
18 July 2017

Modelling Static Networks in Bioinformatics Applications Using Advanced Computing

Related Articles

Austempered Ductile Iron Production Technology From Base Material Produced By Ferro-Casting Industry In Indonesia

Postharvest Physiology Of Fresh-Cut Tomato Slices

Global Issues in Adolescent Health

Non-Compliance in Public Financial Management: A Case Study of a Local Government in Indonesia

Predictors Of Employees’ Intention To Whistleblow Using Theory Of Planned Behaviour: A Case Study Of An Indonesian Government Department

Decentralization and Development: The case of Talaud Islands Regency, Indonesia, 2004-2007

Tsunami Vulnerability Assessment in Mandurah (Western Australia)

In Pursuit of Sustainable Alternative Online Media for Indonesia: Lessons from Australia

Marriage Payment, Social Change and Women’s Agency Among Bimanese Muslims of Eastern Indonesia

Development of a Cost-Effective River Water Quality Index: A Case Study of West Java Province, Indonesia

Environmental Factors and an Eco-epidemiological Model of Malaria in Indonesia

Experience of Indonesian Pre-Service English As Foreign Language Teachers in Implementing Technology in Teaching Practicum: An Investigation Through Tpack Framework

The Physiological and Metabolic Effects of Stressors Associated with Long Duration Transportation on Male Bos Indicus Cattle

Field Evaluation and Modelling of Water and Nitrogen Management Streategies in Tropical Lowland Rice–Based Production Systems

The Indonesian Gender Responsive Budgets Policy: Examining The Effectiveness of The GRB Initiative in Indonesia

Quality Assurance in The Six State Islamic Universities in Indonesia Developing a Desirable Model of Quality Assurance for Islamic State Universities

Synthesis and Structure of Metal Complexes and Coordination Polymers of 3-Pyrazol-1-yl Based Ligands

Analysis of Strategies of Reduction Diarrhea in Rural Areas Based On the Comprehensive Primary Health Care Concepts for Implementing in Rural Central Sulawesi Province, Indonesia

CHANGING CONNECTIONS: Ontogenetic Ecophysiology of Secondary Hemi-epiphytic Vines

Conservation of a Critically Endangered Orchid Drakaea Elastica Lindl. in the Context of Nutritional Requirements and Saprophytic Competency of the Mycorrhizal Fungus and its Propagation

Within-host Deterministic Modelling of Artesunate Resistance in Plasmodium falciparum

English Language Lecturers' communication Strategies : A Case Study in Aceh Province, Indonesia

Eva Rahmi Kasim: Make Indonesia a Home for All

Being Muslim in Bima of Sumbawa, Indonesia: Practice, Politics and Cultural Diversity By Muhammad Adlin

Masculinities, Islam and Domestic Violence in Java

Muslim Masculinities In Australia

Identifying Shocks on the Economic Fluctuations in Indonesia and US: The Role of Oil Price Shocks in a Structural Vector Autoregression Model

Effective Community Engagement for Climate Change Adaptation in Indonesia

Evaluation of Performance Auditing in Indonesia: A Critical Systemic Approach to Addressing Public Accountability

A Case Study Of The Use Of A Competency Framework In The Australian Army For Performance Management And Development

Recruiting and Maintaining SME Involvement when Designing Voluntary Inter-Organisational IS

Improving the Quality of Pearls from Pinctada Maxima

The Impact Of A Gender Quota On Gender Equality In The Indonesian Parliament During The Consolidation Of Democracy

Biological Oceanography of Larval Fish Diversity and Growth off Eastern Australia

Characterisation of the Innate Immune Responses of Marron

Apoplastic Water Fraction and Rrehydration Techniques introduce Significant Errors in Measurements of Relative Water Content and Osmotic Potential in Plant Leaves

Patients’ Experience of Using Primary Care Services in the Context of Indonesian Universal Health Coverage Reforms

Young New Smoker Response to Pictorial Health Warnings on Cigarette Packages in Indonesia

An Exploration Of Linguistic Politeness Phenomena As A Reflection Of The Interplay Between Language And Power

Teacher Professional Development in Indonesia: The Influences of Learning Activities, Teacher Characteristics and School Conditions

Studies conducted in a Western context have shown that there are multiple factors coming into play to make Teachers Professional Development (TPD) a strategic and powerful tool for improving teacher instructions. However, there have been few studies in In

Accommodating English, Islam, and Secular Values: An Exploration of Pre-service English Teacher Education Curriculum in Islamic and Secular Public Universities in Indonesia

Corporate Social Disclosures by Indonesian Listed Companies

Assessing the impact of a marine protected area on coastal livelihoods: A case study from Pantar Island, Indonesia

Understanding Thermophilic Spore-forming Bacteria in Milk Powders

Indonesian Teachers’ Perspectives on the Impact of Child Work on Students’ Learning Outcomes

Dividends Family Ownership: Evidence from Indonesia

Insights into Photocatalytic Oxidation by Bare and Platinised TiO2: The Impact of Adding Hydroxyl Functional Groups to Butanedioic Acid and Propanol

Landscape Scale Carbon Stock Assessment of Tropical Peat Swamp Forests Using an Integrated Field Measurement and Remote Sensing Technique: A Case Study in PT Diamond Raya Timber, Rokan Hilir District, Riau Province, Indonesia

Genetic Diversity and Potential High Temperature Tolerance in Brassica Rapa L

From Institute of Teacher Training and Education (IKIP) to Makassar State University (UNM): Power Struggles in the Field of Teacher Education in Indonesia

Suppression Subtractive Hybridization to Investigate Viruses in the Lymphoid Organ of Penaeus Merguiensis and the Gills of Cherax Quadricarinatus

Countercyclical Fiscal Policy in Indonesia

Challenges and Opportunities in the Delimitation of Indonesia’s Maritime Boundaries: A Legal and Technical Approach

Towards an Integrated Coastal Disaster Management Framework: Bridging Conceptual and Practical Applications Using the Indonesian Legal and Planning Context as a Case Study

State Structural and Institutional Transformation and the New Dynamics of Business Power, Corruption and Clientelism In Indonesia

Associations between Community Orofacial Pain and Experimental Orofacial Pain with Physical, Social and/or Psychological Variables

Building Resilience to Disasters and Climate Change: Pathways for Adaptive and Integrated Disaster Resilience in Indonesia

Determining Features of Aerobic Granular Sludge Formation, Stability and Performance in Sequencing Batch Reactors

Reform Movements and Local Politics in Indonesia

Hospitalised Abortions in Yogyakarta: Characteristics and Implications

Studies on the Role of Gonadotropin-Inhibitory Hormone (GnIH) in the Neuroendocrine Regulation of Reproduction in The Sheep

Instructional Leadership in Indonesian School Reform: Local Perceptions and Practices

Visual Data Exploration of Temporal Cluster Changes Using Self-Organizing Maps

The Implementation Gap: Financial Management Reform in Indonesia 2003-2010

The Effects of Shrub Removal and Grazing on Vegetation and Soils in A Shrub-Encroached Australian Woodland

The Relationship between Political and Business Actors: Comparative Studies of Decentralised, Post-Authoritarian Indonesia and Russia

Spatial and Temporal Variations in the Response of the Vegetation Indices to Surface Temperature

Decentralization and Development Planning in Indonesia: A Case Study of Two Districts in Lombok

Policy Development for Effective Transitions to Climate Change: Adaptation at the Indonesian Local Government Level

Framework, Approach and System of Intelligent Fault Tree Analysis for Nuclear Safety Assessment

A Sensitivity Comparison of Neuro-fuzzy Feature Extraction Methods from Bearing Failure Signals

Studies of the Taxonomy of Banana Blood Disease Bacterium and Related Bacteria

In Vitro Digestion Study of Factors Affecting Sugar Availability from Fruits and Vegetables

Embodying the True Islam: Face-veiled Women in Contemporary Indonesia

Ecology of Palms in Response to Cyclonic Disturbances in North Queensland, Australia

Women interrupted: Determinants of Women’s Employment Exit and Return in Indonesia

Essays on the Economics of Education in Indonesia