Development of a New Platform Through Distributed Storage System for the Bioinformatics Analysis

Ali Osman CIBIKDIKEN, Mucahit BUYUKYILMAZ, Mehmet Akif NACAR, Ahmet Ercan TOPCU, Yusuf TAMBAG, Abdullah KARADAG

Abstract


Data security has become a major issue in recent years. It becomes a requirement to store the sensitive data with at least two backup systems in geographically distributed data centers in order to provide the security. In addition, there is a huge need for the computational power to analyze the big data generated through various high-throughput multiomics applications such as next-generation sequencing and mass spectrometry platforms. In this study, a geographically distributed asynchronous replication application and architecture, called LUNGBASE, based on the OpenStack Swift has been introduced. This study enables to replicate the data generated through the LUNGMARK project, a lung cancer molecular profiling study through bioinformatics-supported integrative multiomics analysis, carried out at the Molecular Oncology Laboratory of the TUBITAK-Marmara Research Center (MRC), Genetic Engineering and Biotechnology Institute (GEBI), in different geographical regions. In this paper, we proposed an architecture and application model that can perform an effective file transfer operations and smart and automated data movement on distributed environment.

Keywords


Object storage, Data replication, Cloud computing, Bio informatics


DOI
10.12783/dtcse/aiie2017/18233

Refbacks

  • There are currently no refbacks.