Quebec’s National Library Launches Initiative to Create Cultural Database for AI Training

Sophie Tremblay, Quebec Affairs Reporter
6 Min Read
⏱️ 4 min read

Quebec’s national library is embarking on an innovative project aimed at establishing a comprehensive database that encapsulates the province’s cultural and governmental content. This initiative seeks to enhance artificial intelligence systems’ comprehension of Quebec’s unique society, diverse cultures, and Indigenous languages. The Bibliothèque et Archives nationales du Québec (BAnQ) has officially entered the experimental phase of this ambitious databank, following an insightful feasibility study completed earlier this year.

Addressing the Data Deficit

The primary motivation behind this project is the recognition that many prominent generative AI systems often lack reliable information about Quebec’s societal fabric, economy, and cultural nuances. The limited availability of Quebec-centric data has hindered these systems from accurately representing the province. Valérie D’Amour, who spearheaded the feasibility study, emphasised the project’s exploratory nature. “All scenarios are a little bit on the table right now,” she remarked, stressing the importance of validating ideas in collaboration with cultural stakeholders and data proprietors.

BAnQ has clarified that the forthcoming platform will not serve as a public repository for creative outputs, ensuring that access to the data remains highly regulated. Marie Grégoire, BAnQ’s president and CEO, stated that the objective is to craft AI systems that authentically reflect Quebec’s rich cultural tapestry. “That means having Quebec references, whether in small models or large models, whether they come from research or from the business community,” she noted.

A Collaborative Approach

This initiative draws inspiration from similar ventures in other nations, such as Sweden, which has successfully compiled extensive collections of Nordic-language texts to aid in the development of generative AI models for Scandinavian languages. BAnQ plans to kick off this project with its own archival collections before branching out to include data from external sources.

The impetus for this undertaking can be traced back to a recommendation made in a 2024 report by Quebec’s innovation council. The report highlighted the significant deficit of data pertaining to Quebec within existing AI training datasets as a critical barrier to accurate representation.

Destiny Tchéhouali, co-holder of a research chair in French-language artificial intelligence and digital technologies at Université du Québec à Montréal, expressed concern over the underrepresentation of Quebec culture in current AI corpora. “We run the risk of reproducing linguistic and cultural biases,” he warned. He further highlighted the importance of including Indigenous perspectives in this endeavour to mitigate any potential biases.

Protecting Creators’ Rights

As BAnQ forges ahead with its plans, copyright concerns have surfaced as a pressing issue within the cultural sector. However, Grégoire contended that the proposed database could provide creators with enhanced protections compared to the current landscape. “Right now, it’s a bit like the Wild West,” she remarked, pointing out that data is often harvested indiscriminately.

The envisioned database could serve as a centralised gateway, facilitating fair compensation for creators whose works are utilised. Grégoire believes that by collaborating, cultural organisations can better advocate for creators, thereby ensuring the sustainability of the sector in the long run.

Nonetheless, some artists remain apprehensive about the implications of contributing their work to AI training systems. Maxime Harvey, a postdoctoral researcher at the National Institute of Scientific Research, voiced concerns shared by many in the creative community. “The main criticism we hear in the field is that, even if artists earn income from it, they are still feeding the beast that will eventually replace contracts they may lose because of AI,” he cautioned.

Project Timeline and Funding

The feasibility study outlines an ambitious timeline, with the platform expected to become operational by 2029, although D’Amour indicated that this timeline would be reassessed after the experimental phase concludes. The estimated budget for the next five years is approximately $10.5 million, covering both operational and capital expenditures. To date, BAnQ has secured $340,000 from the Quebec government to support the feasibility study, along with an additional $750,000 earmarked for the project’s initial 12-month experimentation phase.

Project Timeline and Funding

Why it Matters

This initiative is not merely a technological advancement; it represents a significant cultural commitment to preserving and promoting Quebec’s identity in the digital age. By creating a tailored databank, Quebec aims to empower its cultural narratives and ensure that both local and Indigenous voices are accurately represented in AI systems. As we stand on the cusp of an increasingly AI-driven future, initiatives like this are crucial in safeguarding cultural integrity while navigating the complexities of a rapidly evolving landscape.

Share This Article
Deep-dive reporting on Quebec society, politics, and culture.
Leave a Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

© 2026 The Update Desk. All rights reserved.
Terms of Service Privacy Policy