pISA-tree - a data management framework for life science research projects using a standardised directory tree

Abstract

        We have developed pISA-tree, a straightforward and flexible data management solution for organisation of life science project-associated research data and metadata. It enables on the fly creation of enriched directory tree structure (
        p
        roject/
        I
        nvestigation/
        S
        tudy/
        A
        ssay), via a series of sequential batch files in a standardised manner based upon the ISA metadata framework. Metadata, according to the system-provided metadata templates, is generated in parallel at each level. The system supports reproducible research and is in accordance with the Open Science initiative and FAIR principles. Compared with similar frameworks, it does not require any systems administration and maintenance as it can be run on a personal computer or network drive. It is complemented with two R packages,
        pisar
        and
        seekr
        , where the former facilitates integration of the pISA-tree datasets into bioinformatic pipelines and the latter enables synchronisation with the FAIRDOMHub public repository using the SEEK API. Source code and detailed documentation of pISA-tree and its supporting R packages are available from
        https://github.com/NIB-SI/pISA-tree
        . We demonstrate the usability of pISA-tree with two examples of medium sized life science projects. Accordingly, it is suitable and also currently used to manage larger projects including several partners from different countries. Since pISA-tree was initiated by end user requirements with an emphasis on practicality, it will facilitate adoption of FAIR data management practices and open science principles.

pISA-tree - a data management framework for life science research projects using a standardised directory tree

Related items