Applied Statistics - Week 6

The following is a description of what we will go through during this week of the course. The chapter references and computer exercises are considered read, understood, and solved by the beginning of the following class, where I'll shortly go through the exercise solution.

General notes, links, and comments:
  • :

    Monday:
    We will have a short exercise about planning of experiments, using "A Lady tasting tea" as our source of inspiration, and then expanding from there. Part of class will also be dedicated to continue working on Project2.

    Reading:
  • Lady tasting tea (Wikipedia).
  • Short note on Lady tasting tea.
  • NOTE: You should by now have read curriculum (roughly Barlow chapters 1-8).
    Lecture(s):
  • There are no more formal lectures.
    Computer Exercise(s):
  • Planning an Experiment: PlanExp.py


    Tuesday:
    We will use two days for the last "major" theme in these course, which is MultiVariate Analysis (MVA), that is analysis of data with more than one (typically many) variables. To begin with, we will consider the relatively simple linear case, which is described by Fisher's Discriminant, but Friday move on to more complex sets of data, for which more advanced non-linear methods, such as Neural Networks (NN) and Boosted Decision Trees (BDT) are more useful.

    Reading:

  • Lecture(s):
  • AS2016_0105_MultiVariateAnalysis1.pdf
    Computer Exercise(s):
  • 2par_discriminant.py along with an example solution.
  • fisher_discriminant.py and data along with an example solution.

    Friday:
    Reading:

  • Lecture(s):
  • AS2016_0108_MultiVariateAnalysis2.pdf
    Computer Exercise(s):
  • MVA_train_discriminant.py and
    three TMVA files: TMVAGui.C, TMVAlogon.C, and tmvaglob.C
  • Data samples: DataSample.root, atlas_test_beam_data.root Higgs14TeV.root, ZZ14TeV.root
  • "Real life" TMVA setup example (with two simple variables): GenerateData.py, TrainClassifier.py, and UseClassifier.py.
    Last updated: 8th of January 2016.