Version Française

CanLink : a linked data project for Canadian theses

Can Link - a linked data project for Canadian theses is a proof of concept project of the Digital Projects Working Group of the Canadian Linked Data Initiative - Digital Projects Working Group of the Canadian Linked Data Initiative. Its goal is to increase the discoverability of Canadian theses and dissertations by leveraging the power of linked data to surface unexpected connections and relationships. Participating institutions are: University of British Columbia, University of Alberta, Library and Archives Canada/Bibliothèque et Archives Canada, Queens University, University of Toronto, McGill University, Université de Montréal, Memorial University of Newfoundland.

Q: Who is involved in the Can Link project and how is it funded?

A: Can Link involves members of the Canadian Linked Data Initiative Digital Projects Working Group (University of British Columbia, University of Alberta, Library and Archives Canada/Bibliothèque et Archives Canada, Queens University, University of Toronto, McGill UniversityUniversité de Montréal, Robert Warren (CWRC/Muninn)). In addition, records have been provided by Memorial University of Newfoundland. The University of Alberta Libraries and the Young Canada Works program are funding a summer student position in 2017 to carry out the bulk of the technical work.

Q: What data is included in the project?

A: The data included are subsets of records for dissertations and theses that have a publicly available digital object. Library and Archives Canada/Bibliothèque et Archives Canada are a subset of those found in the Theses Canada Portal.

Q: Can I contribute records to the project?

A: Yes! The goal of the project was to start with a small set of records so we could refine and document our processes, and then ask others to join in! If you would like to submit records we’d love to hear from you!

Q: Who can I contact for further information?

A: For further information on the project or to get involved, please email Sharon Farnel (sharondotfarnel at ualbertadotca) or Rob Warren (rwarren at muninn-projectdotorg).

Q: What is the long term plan for the project?

A: This initial phase is to pilot and document processes and workflows, as well as potential infrastructure. We are working with the Canadian Linked Data Initiative and other research groups on a long-term, sustainable plan for the project to continue to grow and thrive. More details will be announced as they become finalized.

Q: Can I download the data?

A: Data dumps are available for download at http://canlink.library.ualberta.ca/downloads/, the code is available for download at https://github.com/cldi/CanLink and a full description at https://datahub.io/dataset/can-link.

Q: What can I do with the data?

A: The data is made available under a CC Attribution Non-Commercial license.

Q: What formats is the thesis data available as?

A: The data is available using linked open data principles using both HTTP content negotiation and appending extensions to the URI (.rdf, .nt, .ttl, .json-ld). Additionally, thesis data can be exported to bibtex and ris formats using both HTTP content negotiation and appending extensions (.bibtex and .ris).

Q: Are the thesis full-text available as part of the dataset?

A: Some thesis entries have a frbr:manifestation node that links to a pdf version.

Q: What ontology is used to encode the thesis data?

A: The ontology is composed from a number of other ontologies, including FRBR, the bibliography ontology, schema.org, CWRC Genres, Library of Congress Languages, Library of Congress Subject Headings, Bibliothèque nationale de France Subject Headings, Canadian Subject Headings, W3 Provenance Ontology, Dublin Core Terms, Library of Congress Relators, Friend of a Friend Ontology, and DOAP.

Documentation for the ontology is located here.