CODE 80563 ACADEMIC YEAR 2024/2025 CREDITS 9 cfu anno 1 BIOENGINEERING 11159 (LM-21) - GENOVA SCIENTIFIC DISCIPLINARY SECTOR ING-INF/06 LANGUAGE English TEACHING LOCATION GENOVA SEMESTER 1° Semester TEACHING MATERIALS AULAWEB OVERVIEW The course intends to provide the basic notions for data and signal analysis with emphasis on application to biology and medicine. AIMS AND CONTENT LEARNING OUTCOMES The course provides students with the essential tools and operational skills for quantitative analysis of data and signals of interest for medicine and biology, on a probabilistic perspective AIMS AND LEARNING OUTCOMES By the end of the course the students will be able to: 1. Design and apply methods of analysis and modelling of data - including temporal data (signals) - of interest for medicine and biology 2. Identify the correct approach (model selection, model identification, data visualization) for a specific data analysis problem 3. Use MATLAB to correctly display and model biomedical data and signals PREREQUISITES There are no formal prerequisites, but the course requires solid foundations in mathematical analysis and linear algebra. TEACHING METHODS The course is organised as a combination of lectures and lab activities. Lectures will focus on theory and methods for data analysis. Lab activities will focus on application to real data analysis problems in the context of bioengineering. SYLLABUS/CONTENT A. Data Analysis and Data Display (wk 1-3) Data types. Descriptive statistics. Analysis as modeling. Statistical data analysis. Regression. Visual display of information. B.Probability density estimates (wk 4-6) Unsupervised learning. Gaussian model. Principal Component Analysis, Factor Analysis, Independent Component Analysis, Cluster analysis and the EM algorithm. Graphical models. Regression and factor analysis as graphical models. C. Pattern analysis and decision theory (wk 7-8) Bayesian decision theory. Bayes classifiers. Logistic classifiers. Performance of a classifier: ROC curve. Generalized linear models. Introduction to neural networks. Model generalization and bias-variance trade-off. D. Dynamic Models (wk 9-10). Temporal data (signals). Hidden Markov Models, Linear dynamical systems. Kalman filters. E. Model selection (wk 11-12). Statistical Inference. Hypothesis testing. General Linear models and the analysis of variance. Mixed-effects models. Bayesian model selection. RECOMMENDED READING/BIBLIOGRAPHY Murphy, KP. Machine Learning: A Probabilistic Perspective. MIT Press, 2012. TEACHERS AND EXAM BOARD VITTORIO SANGUINETI Ricevimento: VITTORIO SANGUINETI. Appointment: Tel. 0103356487 or vittorio.sanguineti@unige.it MARTINA BROFIGA Ricevimento: For appointment contact: martina.brofiga@unige.it Exam Board VITTORIO SANGUINETI (President) CECILIA DE VICARIIS MARTINA BROFIGA (President Substitute) LESSONS LESSONS START https://corsi.unige.it/11159/p/studenti-orario Class schedule The timetable for this course is available here: Portale EasyAcademy EXAMS EXAM DESCRIPTION Written exam (weight 50%) Project work (individuals or couples, weight 50%) • Solution of a real problem of biomedical data analysis/processing, chosen from a list of proposed projects • Development of software for calculation/analysis/processing • Interactive application (MATLAB Livescript) reporting the results • Fixed deadline for submission (early February) ASSESSMENT METHODS Project work will be assessed in terms of: 1) Documentation (correctess, clarity, synthesis, terminology): 10 pts 2) Implementation (code structure and organization, efficiency): 10 pts 3) Data Visualization (technical quality of figures, adequacy of display technique, efficacy, clarity): 10 pts 4) Bonus (max 2 pts) if the report provides additional analysis (in addition to those required). Bonus is only awarded if maximum score is obtained in the other three criteria.