GRA 4142 Data Management and Python Programming

GRA 4142 Data Management and Python Programming

Course code: 
GRA 4142
Department: 
Data Science and Analytics
Credits: 
6
Course coordinator: 
Jan Kudlicka
Course name in Norwegian: 
Data Management and Python Programming
Product category: 
Master
Portfolio: 
MSc in Business Analytics
Semester: 
2022 Autumn
Active status: 
Active
Level of study: 
Master
Teaching language: 
English
Course type: 
One semester
Introduction

According to Statista, the annual amount of data created, captured, copied, and consumed worldwide will reach 97 zettabytes in 2022. Using available data to gain insights and make correct decisions is becoming essential for almost any business in today’s world.

This course introduces two of the most popular and indispensable programming languages for data analysts:

  • Python (with focus on data cleaning, processing, analysis and visualization)
  • SQL

In addition, the course also covers the basics of data management with focus on relational databases.

Students cannot do both GRA 4151 and GRA 4142 since these courses are equivalent.

Learning outcomes - Knowledge

Upon completion of the course the student shall be able to:

  • understand, explain and use fundamental programming concepts, including:
    • syntax and semantics,
    • variables,
    • types,
    • basic data structures,
    • expressions and statements,
    • control flow (conditionals and loops),
    • functions and libraries,
    • input/output operations,
    • exceptions

    with focus on the Python programming language,

  • understand and explain principles of data modeling and relational databases,
  • understand, explain and use SQL statements and queries.
Learning outcomes - Skills

Upon completion of the course the student shall be able to:

  • use integrated development environments to create computer programs,
  • design, implement, run, test and debug programs in Python based on a given textual description of a problem,
  • process, analyze, summarize and visualize datasets using Python, NumPy, Matplotlib and Seaborn and other libraries,
  • read and understand Python source code implemented by others,
  • create a data model based on a given textual description of a problem,
  • implement this data model in a relational database using the SQL language,
  • query and modify relational databases using the SQL language,
  • create computer programs in Python that store, modify and query data stored in relation databases,
  • set up indexes to improve the performance of databases.
General Competence

Upon completion of the course the student shall have stronger competence in:

  • processing and analyzing data with help of computers,
  • using online resources as aids to solve problems,
  • reading and understanding technical documentation,
  • working in groups.
Course content
  • Introduction, installation of Python, Jupyter lab, IDEs.
  • Executing Python code.
  • Variables, basic types, user input and output.
  • Control flow (conditional execution, loops).
  • Organizing code (functions and libraries).
  • Data structures.
  • Strings, reading, writing and processing text files.
  • Vectors and matrices (NumPy), random numbers and the Monte Carlo method.
  • Processing and analyzing tabular data with Pandas (reading, cleaning, manipulating, grouping and aggregating data).
  • Plotting and visualization (Matplotlib, Seaborn).
  • Introduction to relational databases.
  • Structured Query Language (SQL).
  • The entity-relationship (ER) model and the relational model.
  • Programming with databases.
  • Indexes.
  • Transactions.
Teaching and learning activities
  • Organized classes combining classical lectures with discussing and solving practical problems. (Students are expected to prepare for these sessions by going through given Jupyter notebooks and other reading material and/or watching selected videos online.)
  • Homework exercises (ungraded, solved individually or in groups of 2-3 students).

Software tools: open-source software (more information will be given at the beginning of the course).

Software tools
Software defined under the section "Teaching and learning activities".
Additional information

Please note that while attendance is not compulsory, it is the student’s own responsibility to obtain any information provided in class.

The exam for this course has been changed. The course now has one exam instead of two. It is not possible to retake one of the old exam versions. If you want to improve your grade in the course you will have to retake the new 100% exam. The code for the new exam is GRA 41423.  

Qualifications

All courses in the Masters programme will assume that students have fulfilled the admission requirements for the programme. In addition, courses in second, third and/or fourth semester can have specific prerequisites and will assume that students have followed normal study progression. For double degree and exchange students, please note that equivalent courses are accepted.

Disclaimer

Deviations in teaching and exams may occur if external conditions or unforeseen events call for this.

Assessments
Assessments
Exam category: 
Submission
Form of assessment: 
Written submission
Weight: 
100
Grouping: 
Individual
Duration: 
30 Hour(s)
Exam code: 
GRA 41423
Grading scale: 
ECTS
Resit: 
Examination when next scheduled course
Type of Assessment: 
Ordinary examination
Total weight: 
100
Student workload
ActivityDurationComment
Teaching
36 Hour(s)
Group work / Assignments
100 Hour(s)
Prepare for teaching
12 Hour(s)
Examination
12 Hour(s)
Expected time: 8-16 hours.
Sum workload: 
160

A course of 1 ECTS credit corresponds to a workload of 26-30 hours. Therefore a course of 6 ECTS credits corresponds to a workload of at least 160 hours.