Storage aware data management system for Genomics

Conference paper


Shah, Z. and Farid, M. 2024. Storage aware data management system for Genomics. 5th International Conference on Big-data Service and Intelligent Computation. ACM Press. https://doi.org/10.1145/3633624
AuthorsShah, Z. and Farid, M.
TypeConference paper
Abstract

In recent years, nucleotide sequencing has become increasingly instrumental in both research and clinical settings. This has led to explosive growth in sequencing data produced worldwide along with an increase in complex analysis algorithms. As the amount of data and analysis increases, so does the need for automated solutions for processing and analysis. The concept of workflows has gained favor in the bioinformatics community, but there is little in the scientific literature describing end-to-end operational automation systems. We provided an automation system that aims at providing a solution to the genomics related operational challenges that face sequencing of both research and clinical facilities. We built on existing open-source technologies, with a modular design allowing for a community-driven effort to create plug and play services. In this research, we describe the system and elaborate on the underlying conceptual framework. Which can be reduced to 3 conceptual levels: Data tagging (using metadata automation), Classifying Storage systems (the steps involved in the classification of storage systems), and execution (using a series of rules to move data around on an operational level).

Keywordsnucleotide sequencing; clinical settings ; bioinformatics
Year2024
Conference5th International Conference on Big-data Service and Intelligent Computation
PublisherACM Press
Digital Object Identifier (DOI)https://doi.org/10.1145/3633624
Web address (URL)https://dl.acm.org/doi/abs/10.1145/3633624.3633628
Journal citationpp. 23 - 27
ISBN 9798400708923
File
License
File Access Level
Open
Output statusPublished
Publication dates
Online29 Jan 2024
Publication process dates
Deposited07 Aug 2024
Permalink -

https://repository.derby.ac.uk/item/q7qv2/storage-aware-data-management-system-for-genomics

Download files


File
3633624.3633628.pdf
License: CC BY 4.0
File access level: Open

  • 15
    total views
  • 8
    total downloads
  • 1
    views this month
  • 1
    downloads this month

Export as

Related outputs

Neurotechnological solutions for post-traumatic stress disorder: A perspective review and concept proposal
Laugharne, R., Farid, M., James, C., Dutta, A., Mould, C., Molten, N., Laugharne, J. and Shankar, R. 2023. Neurotechnological solutions for post-traumatic stress disorder: A perspective review and concept proposal. Healthcare Technology Letters. 10 (6), pp. 133-138. https://doi.org/10.1049/htl2.12055
Comparative study of the scaling behavior of the Rényi entropy for He-like atoms
Farid, M, Abdel-Hady, A, Nasser, I and Farid, Mohsen 2017. Comparative study of the scaling behavior of the Rényi entropy for He-like atoms. IOP Publishing. https://doi.org/10.1088/1742-6596/869/1/012011
Contextualizing geometric data analysis and related data analytics: A virtual microscope for big data analytics
Farid, Mohsen and Murtagh, Fionn 2017. Contextualizing geometric data analysis and related data analytics: A virtual microscope for big data analytics. Journal of Interdisciplinary Methodologies and Issues in Sciences. https://doi.org/10.18713/JIMIS-010917-3-1
Frontal view gait recognition with fusion of depth features from a time of flight camera
Afendi Tengku Mohd, Kurugollu, Fatih, Crookes, Danny, Bouridane, Ahmed and Farid, Mohsen 2018. Frontal view gait recognition with fusion of depth features from a time of flight camera. IEEE Transactions on Information Forensics and Security. https://doi.org/10.1109/TIFS.2018.2870594
Exploiting in-memory systems for gnomic data analysis.
Shah, Zeeshan Ali, El-Kalioby, Mohamed, Faquih, Tariq, Shokrof, Moustafa, Subhani, Shazia, Alnakhli, Yasser, Aljafar, Hussain, Anjum, Ashiq and Abouelhoda, Mohamed 2018. Exploiting in-memory systems for gnomic data analysis. Springer. https://doi.org/10.1007/978-3-319-78723-7_35
Cloud-based video analytics using convolutional neural networks.
Yaseen, M., Anjum, Ashiq, Farid, Mohsen and Antonopoulos, Nick 2018. Cloud-based video analytics using convolutional neural networks. Software Practice and Experience. https://doi.org/10.1002/spe.2636
Video authentication based on statistical local information
Al-Athamneh, Mohammad, Crookes, Danny and Farid, Mohsen 2016. Video authentication based on statistical local information. IEEE.
Digital video source identification based on green-channel photo response non-uniformity (G-PRNU)
Al-Athamneh, Mohammad, Kurugollu, Fatih, Crookes, Danny and Farid, Mohsen 2016. Digital video source identification based on green-channel photo response non-uniformity (G-PRNU). https://doi.org/10.5121/csit.2016.61105
The structure of argument: Semantic mapping of US supreme court cases
Murtagh, Fionn and Farid, Mohsen 2015. The structure of argument: Semantic mapping of US supreme court cases. Springer. https://doi.org/10.1007/978-3-319-17091-6_34