Competency information

Details

Summarise data from a large or complex data set and present the results

Considerations

Database management and data mining

  • The relational model of data
  • Implementation of relational databases
  • Advanced SQL programming
  • Query optimisation
  • Concurrency control and transaction management
  • Database performance tuning
  • Distributed relational systems and data replication
  • Column store/data warehousing database engines
  • Document oriented databases (eg Lucene)
  • Security considerations
  • Data mining
  • Large data set methodologies
  • Database standards and standards for interoperability and integration
  • Data analysis and presentation
  • The use of commonly available databases, spreadsheets and statistics packages (eg MS SQL, mySQL, SPSS, Excel, Minitab, SAS, R)

Relevant learning outcomes

# Outcome
# 2 Outcome Implement SQL and data mining strategies on a large data set
# 3 Outcome Summarise and present data from large datasets