Text mining

Bernardo Magnini
Fondazione Bruno Kessler (FBK)


The information society needs both rapid and sophisticated technologies able to extract knowledge from huge repositories of unstructured data. In this context, there is a growing demand for applications that perform strong text mining processing, including improving web search engines, providing services for human-machine interaction systems, e-learning, information extraction and opinion mining, and much more. The course will give an overview of methodologies and tools in the field of Computational Linguistics, useful for operating actively in this sector starting from a background both in computer science and in human sciences. We will provide an introduction to recent approaches and technologies in text mining, with particular attention on vector-based representations of word meanings and machine learning algorithms.
·      Text mining: research filed and applications
·      Machine learning approach: methodology, task definition, training and test data, evaluation metrics
·      Distributional semantics and embeddings
·      Deep learning for text mining: neural networks and trasformers
·      A case study: sentiment analysis
Introductory readings
Pilehvar M.T. and Camacho-Collados J.: Embeddings in Natural language Processing, Synthesis Lectures on Human Language Technologies, Morgan&Claypool, 2021.

Course requirements 

Exam modality

Course material, enrollment and last-minute notifications
Made available by the teacher at this Moodle address

22 Feb 2022, 10:00-13:00
1 Mar 2022, 10:00-13:00

Rooms 1BC50 at the Dept. of Mathematics, via Trieste 63 Padova

<< Courses in 2021-2022