Applied Statistics - Week 6

Monday the 2nd - Friday the 6th of January 2017

The following is a description of what we will go through during this week of the course. The chapter references and computer exercises are considered read, understood, and solved by the beginning of the following class, where I'll shortly go through the exercise solution.

General notes, links, and comments:
  • :

    Monday:
    We will have a short exercise about planning of experiments, using "A Lady tasting tea" as our source of inspiration, and then expanding from there. Part of class will also be dedicated to continue working on Project2.

    Reading:
  • Lady tasting tea (Wikipedia).
  • Short note on Lady tasting tea.
  • NOTE: You should by now have read curriculum (roughly Barlow chapters 1-8).
    Lecture(s):
  • There are no more formal lectures.
    Computer Exercise(s):
  • Weldon's Dices: WeldonsDices.py.
  • Planning an Experiment: PlanExp.py


    Tuesday:
    We will use two days for the last "major" theme in these course, which is MultiVariate Analysis (MVA), that is analysis of data with more than one (typically many) variables. To begin with, we will consider the relatively simple linear case, which is described by Fisher's Discriminant, but Friday move on to more complex sets of data, for which more advanced non-linear methods, such as Neural Networks (NN) and Boosted Decision Trees (BDT) are more useful.

    Reading:

  • Lecture(s):
  • AS2016_0103_MultiVariateAnalysis1.pdf
    Computer Exercise(s):
  • 2par_discriminant.py
  • fisher_discriminant.py and data

    Friday:
    Reading:

  • Lecture(s):
  • AS2016_0106_MultiVariateAnalysis2.pdf
    Computer Exercise(s):
  • MVA_train_discriminant.py and
    three TMVA files: TMVAGui.C, TMVAlogon.C, and tmvaglob.C
  • Data samples: DataSample.root, atlas_test_beam_data.root Higgs14TeV.root, ZZ14TeV.root
  • "Real life" TMVA setup example (with two simple variables): GenerateData.py, TrainClassifier.py, and UseClassifier.py.

    Finally, a link to an online course on Machine Learning (by Udacity), which I was recommended.

    Last updated: 7th of January 2017.