Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

education

CSI4107: Information Retrieval and the Internet

Basic principles of Information Retrieval. Indexing methods. Query processing. Linguistic aspects of Information Retrieval. Agents and artificial intelligence approaches to Information Retrieval. Relation of Information Retrieval to the World Wide Web. Search engines. Servers and clients. Browser and server side programming for Information Retrieval.

CSI5180: Topics in Artificial Intelligence

Semantic web technologies (RDF, RDFS, OWL). Ontology and knowledge base development. Data integration and normalization. Ontology matching. Semantic Web access through SPARQL queries. Semantic Web expansion from unstructured data (text), including Named Entity Recognition, Entity Linking and Relation Extraction from textual data. Question Answering over Linked Data. Data availability, redundancy, contextualization and trust.

COMP5900: Advanced Machine Learning

Machine learning (ML) is the scientific study of algorithms and statistical models that computers use in order to perform a specific task effectively without using explicit instructions, relying on patterns and inference instead. It is seen as a subset of artificial intelligence. This course will cover advanced topics in machine learning such as deep learning, transfer learning, multiview learning, clustering and Interpretability of ML methods.

presentations

publications

Towards a Computational Approach to Conceptual Metaphor

Exploring a computational model of Lakoff’s conceptual metaphor theory.

Recommended citation: Lynch, B., Danovitch, J., & Davies, J. (2018). Towards a Computational Approach to Conceptual Metaphor. Poster session at CogSci 2019, Montreal, CA.

ComplexDataLab at W-NUT 2020 Task 2: Detecting Informative COVID-19 Tweets by Attending over Linked Documents

We present Gapformer, which effectively classifies content as informative or not. It reformulates the problem as graph classification, drawing on not only the tweet but connected webpages and entities.

Pelrine, Kellin, et al. ComplexDataLab at W-NUT 2020 Task 2: Detecting Informative COVID-19 Tweets by Attending over Linked Documents.Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020). 2020. https://www.aclweb.org/anthology/2020.wnut-1.63/

The Surprising Performance of Simple Baselines for Misinformation Detection

We examine the performance of a broad set of modern transformer-based language models and show that with basic fine-tuning, these models are competitive with and can even significantly outperform recently proposed state-of-the-art methods

Pelrine, Kellin, Jacob Danovitch, and Reihaneh Rabbany. The Surprising Performance of Simple Baselines for Misinformation Detection. arXiv preprint arXiv:2104.06952 (2021). https://arxiv.org/abs/2104.06952

talks