Distributed Information Systems :
This course introduces in detail several key technologies underlying today’s distributed information systems, including Web data management, information retrieval and data mining.
Course contents :
Web Information Management: Semi-structured data – graph data model, web ontologies, schema integration.
Information Search: Web search – vector space retrieval, inverted files, advanced retrieval models, word embeddings, web search.
Big Data Analytics: Data mining – associations rules, clustering, classification, model selection; Crowd-sourcing; Recommender systems – collaborative filtering and content-based recommendation.