Applied Machine Learning - Week 1
Monday the 25th - Friday the 29th of April 2022
Groups: For the final project you should find a group (administrated by Azzurra),
but we highly recommend that you also work/collaborate/discuss in a group for exercises.
Monday 25th of April (afternoon):
Lectures: Intro to course, outline, groups, and discussion of data and goals (TP, VR, RS, AD).
Introduction to Machine Learning (TP).
Course information can be found here:
ML2022_CourseInformation.pdf
     Recording of last year's
Lecture (467 MB) (for reference).
Exercise: Setup of infrastructure (Github, ERDA, Zoom, Slack). Test your Python setup with
ML_MethodsDemos.ipynb.
     Getting a feel for the
Curse of Dimensionality, making life in high dimensions a lonely one!
     Inspecting data and making a "human" decision tree for classification of b-quark jets in Aleph data:
     Code for initial analysis:
BjetSelection.ipynb (classifying b-jets using if-sentences!)
     Data:
READMEAlephBtagData.txt (2 kB),
AlephBtag_MC_small_v1.csv (2.8 MB), and
AlephBtag_MC_small_v2.csv (2.8 MB).
Wednesday 27th of April (morning):
Lectures:
Introduction to Tree-based algorithms (TP).
     Additional slides:
ML2022_Example_HousingPrices.pdf (2.8 MB)
     Recording of last year's
Lecture (314 MB).
Exercise: Exercise: Classification of b-quark jets in Aleph data with Tree based methods.
     Compare performance to your own Decision Tree and the Aleph NN.
     Additional (reference) data, on classifying stars, galaxies, and quasars:
Data_SDSS.txt (6.3 MB).
Wednesday 27th of April (afternoon):
Lectures:
Introduction to NeuralNet-based algorithms (TP).
     Additional slides:
ML2022_AppliedML_Top10.pdf (100 kB)
     Recording of last year's
Lecture (331 MB).
Exercise: Exercise: Classification of b-quark jets in Aleph data with Neural Net based methods.
     Compare performance to your tree based method(s) and the Aleph NN.
     Challenge: Given a "large" dataset on b-jets (
AlephBtag_MC.csv (168 MB)), see how performance improves with data size.
Example solutions from week 1:
The following are
example solutions and related code, which comes with absolutely no warrenty. However, you may let yourself be inspired by these solutions:
Solution example using LightGBM (tree based) and MLPClassifier (NN based).
Solution example using Tensor Flow and PyTorch (both NN based).
Last updated: 27th of of April 2022 by Troels Petersen.