Competency information
Details
Deploy the workflow, including maintenance and upgrading, ensuring compliance with quality assurance procedures, including version control.
Considerations
- How to develop a data and meta-data capture strategy for a genomics laboratory.
- How to perform a strategic analysis of the computational requirements of a genomics laboratory through the use of computational analysis and design tools, e.g. structured systems analysis and design method (SSADM).
- How to evaluate computing solutions for their fitness for purpose within the security and IT governance frameworks of the NHS.
- How to evaluate data integration strategies for their fitness for purpose within the security and IT governance frameworks of the NHS.
- How to create simple workflows capable of integrating and analysing clinical functional genomics data.
Computational infrastructure
- Data encryption and data encryption standards.
- Governance and security issues for large data in the NHS.
- Basic cloud computing architectures (software as service, compute as service, etc.).
- Public and private cloud architectures (including commercial systems such as Azure and EC2).
- A basic introduction to workflows in computer science.
- An introduction to workflow tools (Taverna, Galaxy, etc.).
Functional genomics and genomics data sets
- The concept of meta-data.
- The role of minimum information standards to allow effective sharing.
- Tools to capture minimal information data (Extensible Markup Language; XML).
- An introduction to ontologies.
- Community annotation through ontology.
- Interoperating with ontologies.
- Strategies for large-scale data integration and data mining.
- The pros and cons of data warehouses versus data integration over distributed heterogeneous data.
- Examples of ontology-driven data integration.
- Examples of data warehouses for genomic integration (Ensembl).
Workflows
- The basic theory of computational workflows.
- The architecture of workflow systems.
- Examples of workflows in genetics (Galaxy assembly of NGS data).
- Analysis of current literature and data integration and workflows in genetics and medicine.
Relevant learning outcomes
# | Outcome |
---|---|
# 1 | Outcome Identify a clinical and/or laboratory bioinformatics requirement and develop, validate and deploy a bespoke workflow for clinical or public health analysis. |