The course will teach you:
To understand and be familiar with:
- The differences between structured, semi-structured and unstructured data, and how it affects data processing.
- To understand what datasets are and how they are used and modified for different purposes, such as for data analysis.
- What is dimensional data modeling and its basics
To know, be familiar with and recognize:
- Some data format standards and their uses
- To recognize different data formats (master data, transaction data, reference data, temporary data, meta data).
To do:
- Data processing such as: filtering, cleansing, validation, alignment, enrichment, conversion from one format to another.
- To design a schema for the intended use.
- To convert data from one file format to another programmatically.
In addition, the course develops:
- Problem-solving and decision-making skills: how to divide data (pre)processing into stages based on the requirements
- ethics, responsibility and sustainable development: how to implement data processing in energy efficient way, how to protect the privacy of the users and other people, how recognize information that should be kept secret.
- digital skills: programming