The course will teach you:
To understand and be familiar with:
- The differences between structured, semi-structured and unstructured data, and how it affects data processing.
- To understand what datasets are and how they are used and modified for different needs, such as for data analysis.
- To understand the privacy requirements for data processing
- To choose a visualization method based on the need and data
- To modify data as the basis for visualization
- Why report, and how to plan the content of the report.
- What is dimension-based data modeling and its basics
To be familiar with, know and recognize:
-Some data format standards and their uses,
-To recognize the different forms of data (master data, transaction data, reference data, temporary data, meta data). -To understand the basics of data visualization.
To do:
- Data processing such as: filtering, cleansing, validation, alignment, enrichment, transformation from one format to another.
- Design a schema for the intended use.
- Convert data from one file format to another programmatically.
- Simple data visualization and reporting
In addition, the course develops:
- Problem-solving and decision-making skills: how to divide data (pre)processing into stages based on the requirements
- ethics, responsibility and sustainable development: how to implement data processing in energy efficient way, how to protect the privacy of the users and other people, how recognize information that should be kept secret.
- digital skills: programming