CODE 80563 ACADEMIC YEAR 2020/2021 CREDITS 9 cfu anno 1 BIOENGINEERING 11159 (LM-21) - GENOVA SCIENTIFIC DISCIPLINARY SECTOR ING-INF/06 LANGUAGE Italian (English on demand) TEACHING LOCATION GENOVA SEMESTER 1° Semester TEACHING MATERIALS AULAWEB OVERVIEW Provides the basic notions for data and signal analysis and practical applications to biomedical data and signals. AIMS AND CONTENT LEARNING OUTCOMES Analysis and visualization of data. Static graphical models (Bayesian networks, regression, factor analysis, decision theory and Gaussian mixtures). Dynamic graphic models (hidden markov models, linear dynamic models). Neural networks. Vector quantization and vector support machines. Bayesian approach to model comparison and hypothesis testing. AIMS AND LEARNING OUTCOMES The aim of this course is to provide students with the essential tools for quantitative analysis of data and signals of interest for medicine and biology. By the end of the course the students will be able to: 1. Design and apply methods of analysis and modelling of data - including temporal data (signals) - of interest for medicine and biology 2. Identify the correct approach (model selection, model identification, data visualization) for a specific data analysis problem 3. Use MATLAB to correctly and effectively visualizing biomedical data and signals and to implement models of them PREREQUISITES There are no formal prerequisites, but the course requires solid foundations in mathematical analysis and linear algebra. TEACHING METHODS The course is organised as a combination of lectures and lab activities. Lectures will focus on theory and methods for data analysis. Lab activities will focus on application to real data analysis problems in the context of bioengineering. In AY 2020-2021 the course will be taught in presence but all lectures and lab activities will be also offered at a distance, both online and offline. SYLLABUS/CONTENT A. Data Analysis and Data visualization: Types of data. Analysis as modeling. Statistical data analysis. The do’s and don’t’s of data visualization B. Probability density estimates: Unsupervised learning. Gaussian model. Principal Component Analysis, Factor Analysis, Independent Component Analysis, Cluster analysis and the EM algorithm. C. Pattern analysis and decision theory: Bayesian decision theory. Bayes classifiers. Logistic classifiers. Generalized linear models. Performance of a classifier: ROC curve. D. Multilayer neural networks: Perceptrons. Multilayer neural networks and the backpropagation algorithm. Design of a neural network model E. Graphical Models: Static data. A general framework for data modeling. The EM algorithm. Regression, decision theory, factor analysis as graphical models. Dynamic graphical models (overview): Temporal data (signals). Discrete vs continuous signals. Hidden Markov Models, Linear dynamical systems as dynamic graphical models F. Generalization: Regularization theory, Vector Quantization. Support Vector Machines. RECOMMENDED READING/BIBLIOGRAPHY Murphy, KP. Machine Learning: A Probabilistic Perspective. MIT Press, 2012. TEACHERS AND EXAM BOARD VITTORIO SANGUINETI Exam Board VITTORIO SANGUINETI (President) MAURO GIACOMINI MARCO MASSIMO FATO (President Substitute) LESSONS LESSONS START Mid-September 2020. Check the course calendar for class hours Class schedule The timetable for this course is available here: Portale EasyAcademy EXAMS EXAM DESCRIPTION Written exam (weight 50%) Project work (individuals or couples, weight 50%) • Solution of a real problem of biomedical data analysis/processing, chosen from a list of proposed projects • Development of software for calculation/analysis/processing • Interactive application (MATLAB Livescript) reporting the results • Fixed deadlines (early February, early July) ASSESSMENT METHODS Project work will be assessed in terms of: 1) Documentation (correctess, clarity, synthesis, terminology): 10 pts 2) Implementation (code structure and organization, efficiency): 10 pts 3) Data Visualization (technical quality of figures, adequacy of display technique, efficacy, clarity): 10 pts 4) Bonus (max 2 pts) if the report provides additional analysis (in addition to those required). Bonus is only awarded if maximum score is obtained in the other three criteria. Exam schedule Data appello Orario Luogo Degree type Note 13/01/2021 09:30 GENOVA Scritto 11/02/2021 09:30 GENOVA Scritto 10/06/2021 09:30 GENOVA Scritto 08/07/2021 09:30 GENOVA Scritto 09/09/2021 09:30 GENOVA Scritto