Skip navigation.
Semantic Software Lab
Concordia University
Montréal, Canada

Semantic Computing Course

Printer-friendly versionPrinter-friendly versionPDF versionPDF version

The Semantic Computing course (SOEN 6211, formerly SOEN 691B) is offered at Concordia University, providing graduate students with a unique opportunity to study research and development of novel semantic software systems. The course is taught by Prof. René Witte and supported by team members from the Semantic Software Lab. Students from other universities in Québec can register for this course through CREPUQ.

1. Contents

This course provide an introduction to selected topics from Semantic Computing, including:
Open Linked Data CloudOpen Linked Data Cloud

Introduction to Semantic Computing
Introduction to Semantic Computing Paradigms and Applications; Processes, Tools, and Standards for Deriving Intelligence from (User-Generated) Content

Text Mining Systems
Foundations of Language Technology and Language Engineering; Text Mining Systems; Industry Standards; Design and Deployment of Text Mining Solutions; Information Extraction; Case Study: The General Architecture for Text Engineering (GATE)

Tagging and Tag Analysis
Social Tagging vs. Machine-Generated Tags; Folksonomies; Dynamic Navigation and Tag Clouds

Recommender Systems and Collaborative Filtering.
Theory and Implementation of Recommender Engines.

Analyzing the Blogosphere
Blog Tracking; Opinion Mining; Analysis of Microblogs (e.g., Twitter); Standards: RSS, ATOM; Services: Technorati, Bloglines

RDF and Linked Data
RDF; Triplestores; SPARQL; Microformats; GRDDL; Linked Open Data; Case Studies: DBpedia, Freebase, Wikidata, FOAF, GeoNames

Semantic Information Repositories and Information Retrieval
Digital Object Management; Semantic Digital Libraries; Semantic Metadata; Dublin Core; Information Retrieval; Lucene; Solr; FedoraCommons; GATE Mimir

Ontologies and the Semantic Web
Web Ontology Language (OWL); Ontology Editors; Ontology Learning and Ontology Population from Text

Semantic Desktops and Semantic Wikis
Integration of Semantic Technologies into Desktop Environments and Information Systems; Case Studies: Semantic MediaWiki, Semantic Assistants.

Scalable Semantic Systems Engineering
Clustering and Cloud Computing Paradigms; Apache Hadoop and MapReduce; Scalable Web Crawling using Nutch.

The course places a strong focus on the practical aspects of building semantic solutions. Students will have to complete a project on a selected course topic and present their work in class. Additionally, every student (including auditing students) will have to present a research paper during our weekly seminar.

2. Prerequisites

There are no formal prerequisites for this course. However, the course primarily targets students in a research option (PhD, MSc thesis) that have existing background knowledge in topics such as machine learning, natural language processing, intelligent systems, information systems, or the semantic web, for example through one of these courses offered at Concordia:

  • COMP 6321: Machine Learning
  • COMP 6531: Foundations of the Semantic Web
  • COMP 6751: Natural Language Analysis
  • COMP 6781: Statistical Natural Language Processing
  • COMP 6721: Introduction to Artificial Intelligence
  • COMP 6741: Intelligent Systems

Additionally, students taking this course are expected to have good knowledge of an object-oriented programming language (like Java or C++), a scripting language (such as Python or Groovy), as well as general software engineering and information system knowledge. The primary development environment for this course is Linux.

3. Relationship to other courses

A frequently asked question is how this course compares to COMP 6531: Foundations of the Semantic Web, taught by Prof. Haarslev. The two courses are largely complementary and can be taken in parallel; however, it is recommended to first complete SOEN 691B, then COMP 6531: The SOEN 691B course focuses on the technical aspects of several semantic computing techniques, in particular text mining and linked data, whereas COMP 6531 focuses on the formal concepts and design of OWL ontologies, including theoretical foundations in description logics and automated reasoning. Together with the NLP courses offered through the CLaC lab and related courses in intelligent systems and machine learning, Concordia students have an opportunity to develop comprehensive skills in semantic sciences that are unique across Canada.

4. Schedule

Due to Prof. Witte's sabbatical, the course will not be offered in the 2014/2015 academic year.