Data Management in Computational Systems Biology: Exploring Standards, Tools, Databases, and Packaging Best Practices.
Computational systems biology involves integrating heterogeneous datasets in order to generate models. These models can assist with understanding and prediction of biological phenomena. Generating datasets and integrating them into models involves a wide range of scientific expertise. As a result these datasets are often collected by one set of researchers, and exchanged with others researchers for constructing the models. For this process to run smoothly the data and models must be FAIR-findable, accessible, interoperable, and reusable. In order for data and models to be FAIR they must be structured in consistent and predictable ways, and described sufficiently for other researchers to understand them. Furthermore, these data and models must be shared with other researchers, with appropriately controlled sharing permissions, before and after publication. In this chapter we explore the different data and model standards that assist with structuring, describing, and sharing. We also highlight the popular standards and sharing databases within computational systems biology.
PubMed ID: 31602618
Publication type: Journal
Journal: Methods Mol Biol
Citation: Methods Mol Biol. 2019;2049:285-314. doi: 10.1007/978-1-4939-9736-7_17.
Date Published: 12th Oct 2019
Registered Mode: by PubMed ID
Professor of Computer Science University of Manchester Co-Director of the FAIRDOM Initiative and co-leader of the SEEK4Science Platform Development Deputy Head of Node ELIXIR-UK Co-lead ELIXIR Interoperability Backbone Platform Lead ISBE WP Data and Model Management Data lead SynBioChem Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals
Projects: FAIRDOM, Early Metabolic Injury (LiSyM-EMI - Pillar I), Chronic Liver Disease Progression (LiSyM-DP - Pillar II), Regeneration and Repair in Acute-on-Chronic Liver Failure (LiSyM-ACLF - Pillar III), LiSyM Core Infrastructure and Management (LiSyM-PD), Liver Function Diagnostics (LiSyM-LiFuDi - Pillar IV), Model Guided Pharmacotherapy In Chronic Liver Disease (LiSyM-MGP), Multi-Scale Models for Personalized Liver Function Tests (LiSyM-MM-PLF), The Hedgehog Signalling Pathway (LiSyM-JGMMS), Molecular Steatosis - Imaging & Modeling (LiSyM-MSIM), Kinetics on the move - Workshop 2016, Example use cases, FAIRDOM user meeting, MS_DILI, COMBINE Multicellular Modelling, FAIRDOM & LiSyM & de.NBI Data Structuring Training, EnzymeML, GMDS Project Group "FAIRe Dateninfrastrukturen für die Biomedizinische Informatik", FAIRDOM Community Workers, COVID-19 Disease Map, COVID-19 related studies and tools in Germany, nfdi4health - German National Research Data Infrastructure for Personal Health Data, ModeleXchange initiativehttps://orcid.org/0000-0002-8683-7084
Data management and standardization expert for systems biology and systems medicine, responsible for the data management user requirements and user contacts within the German LiSyM network (Liver Systems Medicine: http://lisym.org/) and associated to the FAIRDOM team. Involved in different standardization initiatives and committees, i.e. COMBINE (http://co.mbine.org), ISO/TC 276 Biotechnology (https://www.iso.org/committee/4514241.html), European COST action CHARME (http://www.cost-charme.eu) and ...
Software Engineer and Architect working within the FAIRDOM team.
Leads the development of SEEK and RightField.
Projects: PSYSMO, MOSES, SysMO DB, SysMO-LAB, SulfoSys, SulfoSys - Biotec, Whole body modelling of glucose metabolism in malaria patients, FAIRDOM, Molecular Systems Biology, COMBINE Multicellular Modelling, HOTSOLUTE, Steroid biosynthesis, Yeast glycolytic oscillations, Computational pathway design for biotechnological applications, SCyCode The Autotrophy-Heterotrophy Switch in Cyanobacteria: Coherent Decision-Making at Multiple Regulatory Layers
Institutions: Manchester Centre for Integrative Systems Biology, University of Manchester, University of Stellenbosch, University of Manchester - Department of Computer Science, Stellenbosch University
Projects: SysMO DB, Whole body modelling of glucose metabolism in malaria patients, Manchester Institute for Biotechnology, FAIRDOM, ICYSB 2015 - International Practical Course in Systems Biology, GenoSysFat, DigiSal, FAIRDOM user meeting, FAIRDOM Templateshttps://orcid.org/0000-0003-4958-0184
Interested in systems + synthetic biology, biotechnology, mountaineering, swimming, running, and the occasional cup of tea. Once diagnosed as an ENFP.
Projects: de.NBI-SysBio, GenoSysFat, Kinetics on the move - Workshop 2016, Example use cases, COMBINE Multicellular Modelling, GMDS Project Group "FAIRe Dateninfrastrukturen für die Biomedizinische Informatik", COVID-19 related studies and tools in Germany, nfdi4health - German National Research Data Infrastructure for Personal Health Datahttps://orcid.org/0000-0002-5886-5563
I am a computer scientist by training with a specialisation on database and information systems. Since December 2018 I am professor of Medical Informatics at the University Medicine in Greifswald, Germany, at the Institute of Community Medicine. My lab focuses on research data management in biomedicine, data integration across health care providers, and provenance of clinical research data items within clinical information systems. Furthermore, I am actively involved in COMBINE standardisation ...
Projects: de.NBI-SysBio, Kinetics on the move - Workshop 2016, Example use cases, SBEpo - Systems Biology of Erythropoietin, FAIRDOM & LiSyM & de.NBI Data Structuring Training, FAIRDOM, EnzymeML, GMDS Project Group "FAIRe Dateninfrastrukturen für die Biomedizinische Informatik", FAIRDOM Community Workers, MIX-UP, CEPLAS – Cluster of Excellence on Plant Sciences, COVID-19 Disease Map, ERNEST Mapping Group Pilot Study, NMTrypI - New Medicines for Trypanosomatidic Infections, Standardization of enzyme-catalyzed reaction measurement, Standardization of enzyme-catalyzed reaction modelling, ModeleXchange initiative, CoVIDD - Coronavirus interactions in drug discovery - optimization and implementation, Mass spectrometry proteomics for biomarker discoveryhttps://orcid.org/0000-0002-9077-5664
The Disease Maps Project is designed as a large-scale community effort. It is a network of groups that work together in order to better understand disease mechanisms. The project exchanges best practices, share information, develop tools to make it easier for all the involved groups to achieve their goals.
Projects: COVID-19 Disease Map
Web page: https://disease-maps.org
This is the exchange platform of the COMBINE network.
COMBINE is an initiative to coordinate the development of various community standards and formats for computational models: BioPax, CellML, NeuroML, Synthetic Biology Open Language (SBOL), Systems Biology Graphical Notation (SBGN), Systems Biology Markup Language (SBML), Simulation Experiment Description Markup Language (SED-ML).
Web page: http://co.mbine.org
LiSyM (Liver Systems Medicine) represents a research network of German centers and institutions, brought together by a 20 Million Euro funding program of the German Government, in which mathematicians, modelers, pharmacologists, molecular biologists and clinical scientists work together to develop a Systems Medicine approach to study early and advanced liver disease. The aim of this unique research program is to acquire and use new experimental data and data from existing data bases to build ...
Projects: Early Metabolic Injury (LiSyM-EMI - Pillar I), Chronic Liver Disease Progression (LiSyM-DP - Pillar II), Regeneration and Repair in Acute-on-Chronic Liver Failure (LiSyM-ACLF - Pillar III), LiSyM Core Infrastructure and Management (LiSyM-PD), Liver Function Diagnostics (LiSyM-LiFuDi - Pillar IV), Model Guided Pharmacotherapy In Chronic Liver Disease (LiSyM-MGP), Multi-Scale Models for Personalized Liver Function Tests (LiSyM-MM-PLF), The Hedgehog Signalling Pathway (LiSyM-JGMMS), Molecular Steatosis - Imaging & Modeling (LiSyM-MSIM), FAIRDOM & LiSyM & de.NBI Data Structuring Training, New LiSyM project
Web page: http://www.lisym.org
FAIRDOM is establishing a support and service network for European Systems Biology. FAIRDOM is a joint action of ERA-Net ERASysAPP and European Research Infrastructure ISBE to establish a data and model management service facility for Systems Biology. Our prime mission is to support researchers, students, trainers, funders and publishers by enabling Systems Biology projects to make their Data, Operating procedures and Models, Findable, Accessible, Interoperable and Reusable (FAIR).
Web page: http://fair-dom.org
FAIRDOM will establish a support and service network for European Systems Biology. We will serve projects in standardising, managing and disseminating data and models in a FAIR manner: Findable, Accessible, Interoperable and Reusable.
Here we share resources and best practices to develop a disease map for COVID-19. The project is progressing as a broad community-driven effort. We aim to establish a knowledge repository on virus-host interaction mechanisms specific to the SARS-CoV-2. The COVID-19 Disease Map is an assembly of molecular interaction diagrams established based on literature evidence.