This course will bring you to the forefront of Applied Machine Learning and Big Data Analysis

Applied Machine Learning and Big Data Analysis

Machine Learning is entering essentially all data-based fields, and Big Data is omnipresent from private industries to governmental organizations. It is a new approach to problem solving, and while the potential is often exaggerated, Machine Learning does indeed introduce new opportunities, but it also poses some very real challenges. The ability to analyze and combine large amounts of data from different sources has obvious applications. However, the lack of quality in the data combined with a high variance means that conventional analysis often fails, while Machine Learning algorithms are less affected, if trained and used correctly. 

This course will bring you to the forefront of the field of applied machine learning by introducing you to the newest tools and methods in large-scale data analysis based on cutting-edge research and the extensive experience.

Throughout the course, we will use examples of structured datasets in a commercial context, which will be used to demonstrate the different steps in Big Data Analysis. Participants will also have the chance to ask questions about specific data and challenges.

Core elements

    • Data cleaning and statistical methods: Detecting and correcting (or removing) corrupt or inaccurate records, and robust statistical methods for data with very large variance and cross checks.
    • Machine Learning algorithms: Introduction to a variety of methods, how they work behind the scenes, their strengths and weaknesses, and their applications.
    • Finding patterns and outliers in Big Data: Which methods can be used to identify sparse patterns in very large datasets, and how can we identify data that does not follow the general pattern of a dataset?
    • Collecting data from instruments and devices: How to collect, store, and analyze data from a multitude of sources (e.g. apparatus, IoT, etc.).
    • Systems for Big Data Analysis: Hadoop, PyDisco, etc., and hardware systems design for efficient analysis.
    • Selected machine learning algorithms for large-scale data: Random forests, (deep) neural networks, support vector machines, and large-scale exact nearest neighbour search.
    • Systems for Big Data Analysis: Common systems for BDA; Hadoop, PyDisco, etc., and hardware systems design for efficient BDA.

Tools/methods introduced

    • Selected machine learning algorithms for large-scale data: Random forests, support vector machines, and large-scale exact nearest neighbour search
    • Data curation: How to select data for long time curation, systems, techniques and standards for data curation

We will primarily be working with Python; however, all techniques that are covered are easily implemented with all standard data-analysis languages.

The course is strictly focused on Machine Learning and Big Data Analysis, so a prerequisite is that you have a background in statistics and/or conventional data analysis. This course assumes you have studied to at least Bachelor degree level and/or have several years of data analysis experience.

Share this page

Testimonials

"Best course I ever been to." 
Course participant, 2019

"Very good general overview of ML, I fell much more confident in applying the techniques in my projects."
Jan Dudek, Data Analyst, Ministry of Health of the Slovak Republic, 2019

"Variety of methids covered with examples from real world."
Course participant, 2019

"Really good teachers, very good at explaining and applying it to real data/problems."
Course participant, 2019

Course directors - Applied Machine Learning

Joachim Mathiesen

Associate Professor, Biocomplexity, Niels Bohr Institute, University of Copenhagen

Kenneth Skovhede

Assistant professor, Xray and Neutron Science, Niels Bohr Institute, University of Copenhagen

Course information - Applied Machine Learning

Duration: 5 days
Dates and time:

August 16-20, 2021, 9 am - 4.30 pm

Price: EUR 2,755 (DKK 20,500) excl. Danish VAT. The price includes tuition, course material and all meals during course hours.
Language: English
Location: South Campus, Faculty of Law, Njalsgade 76, DK-2300 Copenhagen S, Denmark
Registration deadline: August 1, 2021
Contact: Copenhagen Summer University
csu@adm.ku.dk
+45 3533 3423


Download the course description for Applied Machine Learning as PDF