The course intends to provide the basic notions for data and signal analysis with emphasis on application to biology and medicine.
The course provides students with the essential tools and operational skills for quantitative analysis of data and signals of interest for medicine and biology, on a probabilistic perspective
By the end of the course the students will be able to:
1. Design and apply methods of analysis and modelling of data - including temporal data (signals) - of interest for medicine and biology
2. Identify the correct approach (model selection, model identification, data visualization) for a specific data analysis problem
3. Use MATLAB to correctly display and model biomedical data and signals
There are no formal prerequisites, but the course requires solid foundations in mathematical analysis and linear algebra.
The course is organised as a combination of lectures and lab activities.
Lectures will focus on theory and methods for data analysis.
Lab activities will focus on application to real data analysis problems in the context of bioengineering.
A. Data Analysis and Data Display (wk 1-3) Data types. Descriptive statistics. Analysis as modeling. Statistical data analysis. Regression. Visual display of information.
B.Probability density estimates (wk 4-6) Unsupervised learning. Gaussian model. Principal Component Analysis, Factor Analysis, Independent Component Analysis, Cluster analysis and the EM algorithm. Graphical models. Regression and factor analysis as graphical models.
C. Pattern analysis and decision theory (wk 7-8) Bayesian decision theory. Bayes classifiers. Logistic classifiers. Performance of a classifier: ROC curve. Generalized linear models. Introduction to neural networks. Model generalization and bias-variance trade-off.
D. Dynamic Models (wk 9-10). Temporal data (signals). Hidden Markov Models, Linear dynamical systems. Kalman filters.
E. Model selection (wk 11-12). Statistical Inference. Hypothesis testing. General Linear models and the analysis of variance. Mixed-effects models. Bayesian model selection.
Murphy, KP. Machine Learning: A Probabilistic Perspective. MIT Press, 2012.
Ricevimento: VITTORIO SANGUINETI. Appointment: Tel. 0103356487 or vittorio.sanguineti@unige.it
Ricevimento: For appointment contact: martina.brofiga@unige.it
VITTORIO SANGUINETI (President)
CECILIA DE VICARIIS
MARTINA BROFIGA (President Substitute)
https://corsi.unige.it/11159/p/studenti-orario
Written exam (weight 50%)
Project work (individuals or couples, weight 50%)
• Solution of a real problem of biomedical data analysis/processing, chosen from a list of proposed projects
• Development of software for calculation/analysis/processing
• Interactive application (MATLAB Livescript) reporting the results
• Fixed deadline for submission (early February)
Project work will be assessed in terms of:
1) Documentation (correctess, clarity, synthesis, terminology): 10 pts
2) Implementation (code structure and organization, efficiency): 10 pts
3) Data Visualization (technical quality of figures, adequacy of display technique, efficacy, clarity): 10 pts
4) Bonus (max 2 pts) if the report provides additional analysis (in addition to those required). Bonus is only awarded if maximum score is obtained in the other three criteria.