Course
Published
(Updated )
Helsingin yliopisto
Data Analysis with Python
From
5 cr
Online
Intermediate studies
The course uses practical approach to different phases of data analysis pipeline: data fetching and cleaning, reshaping, subsetting, grouping, and combining data; and using aggregation, machine learning and data visualization to extract knowledge from data.
- Libraries: Numpy, Pandas, Scikit-learn, (Matplotlib)
- Interactive study materials: Jupyter notebook
- Automatic checking of exercises: Test My Code framework
- Basics of Python language
- Numpy
- Creation and indexing of arrays
- Array concatenation and splitting
- Fast computation using universal functions
- Summary statistics
- Broadcasting
- Matrix operations and basic linear algebra
- Pandas
- Creating and indexing of Series and DataFrames
- Handling missing data
- Concatenation of Series and DataFrames
- Grouping and aggregating
- Merging DataFrames
- Gentle introduction to machine learning through Scikit-learn library
- Linear regression
- Naive Bayes classification
- Principal component analysis
- k-means clustering
- Project on applying the learned skills on an application field
Select how you would like to study:
Additional information and registration
Price and registration period
Check on the university's website
Additional information:
Fields
Information and communication technologies
Scope
5 cr
1 component
Code
BSCS2015
What you need to know before the course
What you will learn
Learning material
Additional information