Module information
Module details
- Title
- IT for Advanced Bioinformatics Applications
- Type
- Specialist
- Module code
- SBI126
- Credits
- 10
- Requirement
- Compulsory
Aim of this module
The volume of data being generated by new functional genomics and NGS methodologies is unprecedented in medicine. The challenges of being able to capture and integrate this data effectively such that it can be used effectively require solutions beyond those that have typically been used in clinical medicine. The trainee will be introduced to modern computational methodologies for handling and integrating large data. This will involve them in developing a good understanding of data description standards (through ontologies) and data federation methodologies. Workflow systems will be introduced as tools for industrial-scale bioinformatics analyses, as well as a discussion of cloud-based computer solutions for extending the compute resource available within the NHS. A strong focus will be placed on the ethical and governance issues raised by using such technologies within an NHS setting. In this module trainees will be expected to use computational methodologies for handling and integrating large data in accordance with data description standards (through ontologies) and data federation standards. They will be expected to use and design dynamic systems as tools for industrial-scale bioinformatics analyses within the ethical and governance issues raised by using such technologies within an NHS setting from the perspective of the patient, the clinical department and the organisation.
Work-based content
Competencies
# | Learning outcome | Competency | Action |
---|---|---|---|
# 1 | Learning outcome 1 |
Competency
Identify a task capable of automated workflow analysis. |
Action View |
# 2 | Learning outcome 1 |
Competency
Carry out a feasibility study, discussing user requirements, and develop a detailed requirements specification with key stakeholders, and validate and gain authorisation. |
Action View |
# 3 | Learning outcome 1 |
Competency
Perform, evaluate and present an options appraisal of technology and resource options. |
Action View |
# 4 | Learning outcome 1 |
Competency
Develop, test and evaluate the workflow and perform user acceptance procedures. |
Action View |
# 5 | Learning outcome 1 |
Competency
Deploy the workflow, including maintenance and upgrading, ensuring compliance with quality assurance procedures, including version control. |
Action View |
# 6 | Learning outcome 1 |
Competency
Finalise workflow documentation and file in accordance with local standard operating procedures. |
Action View |
# 7 | Learning outcome 1 |
Competency
Train workflow users and assess the effectiveness of the training. |
Action View |
Assessments
You must complete:
- 2 case-based discussion(s)
- 2 of the following DOPS/ OCEs:
Demonstrate the data backup procedure in accordance with departmental protocols | DOPS |
Demonstrate techniques checksum etc for testing the integrity of data transfer | DOPS |
Organise and oversee deployment of a software update. | DOPS |
Create a secure backup of an NGS dataset ensuring data integrity | DOPS |
Organise and oversee deployment of software. | DOPS |
Present a strategy for an identifiable clinical bioinformatics requirement to a team of professionals | OCE |
Learning outcomes
- Identify a clinical and/or laboratory bioinformatics requirement and develop, validate and deploy a bespoke workflow for clinical or public health analysis.
Academic content (MSc in Clinical Science)
Important information
The academic parts of this module will be detailed and communicated to you by your university. Please contact them if you have questions regarding this module and its assessments. The module titles in your MSc may not be exactly identical to the work-based modules shown in the e-portfolio. Your modules will be aligned, however, to ensure that your academic and work-based learning are complimentary.
Learning outcomes
- Describe basic cloud computing infrastructure.
- Describe the philosophy behind minimum information standards used to capture functional genomics data.
- Describe international data repositories for genetic and functional genomics data.
- Discuss the basic principles of ontologies for describing metadata.
- Describe the use of ontologies for capturing disease phenotype information.
- Discuss strategies for genetic data analysis over large-scale heterogeneous data.
- Describe a range of modern computational workflow systems.
- Discuss the application of workflow systems to NGS analysis.
- Discuss issues of data quality in medicine.
- Discuss the importance of data quality for patient safety.
- Describe the ethical and governance regulations relating to data capture in the NHS.
- Describe the ethical and governance concerns regarding data integration in the NHS.
- Describe basic principles of data encryption and international data encryption standards in medicine.
- Discuss the importance of information governance for patient safety.
Indicative content
Computational infrastructure
- Data encryption and data encryption standards
- Governance and security issues for large data in the NHS
- Basic cloud computing architectures (software as service, compute as service, etc.).
- Public and private cloud architectures (including commercial systems such as Azure and EC2)
- A basic introduction to workflows in computer science.
- An introduction to workflow tools (Taverna, Galaxy, etc.)
Functional genomics and genomics data sets
- The concept of metadata
- The role of minimum information standards to allow effective sharing
- Tools to capture minimal information data (XML)
- An introduction to ontologies
- Community annotation through ontology
- Interoperating with ontologies
- Strategies for large-scale data integration
- The pros and cons of data warehouses versus data integration over distributed heterogeneous data
- Examples of ontology-driven data integration
- Examples of data warehouses for genomic integration (Ensembl)
Workflows
- The basic theory of computational workflows
- The architecture of workflow systems
- Examples of workflows in genetics (Galaxy assembly of NGS data)
- Analysis of current literature and data integration and workflows in genetics and medicine
Clinical experiences
Important information
Clinical experiential learning is the range of activities trainees may undertake in order to gain the experience and evidence to demonstrate their achievement of module competencies and assessments. The list is not definitive or mandatory, but training officers should ensure, as best training practice, that trainees gain as many of these clinical experiences as possible. They should be included in training plans, and once undertaken they should support the completion of module assessments and competencies within the e-portfolio.
Activities
- Work within clinical teams and clinical settings using clinical bioinformatics as part of routine service and service development, and critically evaluate the current and future role of clinical bioinformatics from the perspective of the patient, the department and the healthcare organisation.