CODE 41601 ACADEMIC YEAR 2024/2025 CREDITS 9 cfu anno 1 ECONOMICS AND DATA SCIENCE 11267 (LM-56) - GENOVA SCIENTIFIC DISCIPLINARY SECTOR SECS-S/01 LANGUAGE English TEACHING LOCATION GENOVA SEMESTER 2° Semester PREREQUISITES Propedeuticità in ingresso Per sostenere l'esame di questo insegnamento è necessario aver sostenuto i seguenti esami: ECONOMICS AND DATA SCIENCE 11267 (coorte 2024/2025) SOFTWARE R 106839 2024 Propedeuticità in uscita Questo insegnamento è propedeutico per gli insegnamenti: Economics and Financial Institutions 8700 (coorte 2024/2025) FINANCIAL ECONOMETRICS 85554 TEACHING MATERIALS AULAWEB OVERVIEW The course aims at providing a thorough account of classical and modern statistical inference at an intermediate level, focusing both on the classical likelihood method and on simulation techniques. Then, the theory and applications of regression and classification models will be introduced, with special attention to socio-economic applications. AIMS AND CONTENT LEARNING OUTCOMES The course aims to provide a precise overview of statistical inference at an intermediate level. The first part concerns classical mathematical statistics, based on likelihood, with some hints on techniques based on simulation. The second part focuses on the main regression and classification techniques. The topics covered and the examples presented will allow students to autonomously. deal with data analysis problems in different contexts and to understand the fundamentals of statistical models currently used in economic applications. AIMS AND LEARNING OUTCOMES The course is divided into two parts: 1) Some topics in classical inferential Statistics: the main families of univariate and multivariate distributions, the likelihood and its properties, simulation and the bootstrap. 2) Statistical models for classification and regression. Multiple regression, generalized linear model theory, logistic regression and regression for count data, nonparametric techniques. Nonparametric techniques. Model selection techniques. All the topics will be accompanied by practical exercises in R, so that the student can also combine the understanding of the theory with the ability to apply correct statistical analyses in real contexts and to read correctly the output of the statistical procedures. Knowledge and understanding: Students will know the main techniques and the main tools for inferential statistics. They must be able to frame these tools in general terms (both theoretical and applied), and to analyze the underlying mathematical and statistical background. Ability to apply knowledge and understanding: Students will be able to identify, when faced with problems from different contexts, the correct analysis. Moreover, they will be able to evaluate the results obtained through statistical software. Making judgments: Students will have to become aware of the potential and limits of the statistical techniques, through the analysis of examples and case studies. Communication skills: Students must be able to use the correct technical statistical language for the communication of the results and for the description of the techniques. Learning skills: Students will develop adequate learning skills in order to continue with further studies about other aspects of the subject and different fields of application than those illustrated. Furthermore, they must also be able to use the R software in a general context. PREREQUISITES The typical skills of the introductory courses in Mathematics and Statistics for Economics and Business. TEACHING METHODS Lectures and computer lab tutorials with R. Discussion of case studies. 24 hours (approx 1/3 of the total) will be held in the computer lab. Students who have valid certification of physical or learning disabilities and who wish to discuss possible accommodations or other circumstances regarding lectures, coursework and exams, should speak both with the instructor and with Professor Serena Scotto (scotto@economia.unige.it), the Department’s disability liaison. SYLLABUS/CONTENT 0. Introduction and basic recalls on estimation and hypothesis testing. 1. Some families of discrete probability distributions. 2. Some families of continuous probability distributions. 3. Multivariate distributions. 4. Likelihood. Maximum likelihood estimation. Information. The exponential family. 5. Simulation and the bootstrap. 6. Multiple linear regression and k-NN techniques. 7. Theory of Generalized linear models. 8. Logistic regression 9. Regression for count data. 10. Model selection and regularization. RECOMMENDED READING/BIBLIOGRAPHY Evans, Rosenthal, Probability and Statistics. The Science of Uncertainty, Second edition. James G, Witten D, Hastie T and Tibshirani R, An Introduction to Statistical Learning. With Applications in R. Springer (available from the Authors’ webpage). Further readings will be taken from: Casella G and Berger RL, Statistical Inference. Duxbury Efron B and Hastie T, Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. Stanford University (available from the Authors’ webpage). Additional course materials will be available on AulaWeb. TEACHERS AND EXAM BOARD FABIO RAPALLO Ricevimento: By appointment arranged by email fabio.rapallo@unige.it Exam Board FABIO RAPALLO (President) CORRADO LAGAZIO MARTA NAI RUSCONE LESSONS LESSONS START This class follows the Department calendar for the 2nd semester. Class schedule The timetable for this course is available here: Portale EasyAcademy EXAMS EXAM DESCRIPTION The exam is a written exam which consists of three parts: 1) a general essay question 2) one or more questions on specific topics 3) a comment on a R output. The complete exam rules will be available on the class Aulaweb page. For attending students, intermediate exams based on R exercises will be organized. Such intermediate exam will replace the complete exam. ASSESSMENT METHODS As far as possible, the questions of the written exam are chosen in order to cover all the topics of the course. The general question aims at assessing the degree of knowledge of the subject and the acquisition of the correct technical language, the specific questions are aimed at assessing the critical ability of the student, while the purpose of the comment to the output is to evaluate the application capabilities. Exam schedule Data appello Orario Luogo Degree type Note 20/12/2024 09:30 GENOVA Scritto 15/01/2025 09:30 GENOVA Scritto 30/01/2025 09:30 GENOVA Scritto 27/05/2025 09:30 GENOVA Scritto 19/06/2025 09:30 GENOVA Scritto 15/07/2025 09:30 GENOVA Scritto 01/09/2025 09:30 GENOVA Scritto Agenda 2030 - Sustainable Development Goals Quality education Partnerships for the goals