Quebec’s National Library Launches Initiative to Enhance AI Understanding of Local Culture and Languages

Sophie Tremblay, Quebec Affairs Reporter
6 Min Read
⏱️ 4 min read

The Bibliothèque et Archives nationales du Québec (BAnQ) is embarking on an ambitious project to create a comprehensive database aimed at enriching the training of artificial intelligence systems with content reflecting Quebec’s unique society, culture, and Indigenous languages. Following the completion of a feasibility study earlier this year, BAnQ has entered the experimental phase of this initiative, which seeks to address the prevailing gap in reliable Quebec-specific data used by generative AI technologies.

Bridging the Data Gap

Currently, major AI systems often fall short in accurately representing Quebec’s cultural and social landscape, largely due to a scarcity of relevant data. “All scenarios are a little bit on the table right now,” explained Valérie D’Amour, the lead on the feasibility study. “We have a lot of ideas and we want to validate the possibilities with cultural stakeholders, as well as with data owners and providers, who will be involved in the discussions.”

This proactive approach aims not only to improve AI outputs but also to ensure a more nuanced understanding of Quebec’s diverse population, including the Indigenous communities that have historically been underrepresented in digital datasets.

Controlled Access and Cultural Representation

BAnQ has clarified that the new platform will not serve as a public distribution channel for creative works; access to data will be meticulously regulated. Marie Grégoire, BAnQ’s president and CEO, stated, “The goal is to ensure that AI systems better reflect Quebec society and culture.” She further emphasised the importance of including diverse references from both research and business sectors, whether in small or large AI models.

Controlled Access and Cultural Representation

Similar initiatives have gained traction in other regions, such as Sweden, where extensive collections of Nordic-language texts are being compiled to support the development of AI models. BAnQ intends to start with its own existing collections before expanding to other data sources.

Addressing Cultural Biases

The impetus for this initiative stems from a 2024 report by Quebec’s innovation council, which highlighted the “very small quantity of data on Quebec” available in existing AI training datasets. Destiny Tchéhouali, co-holder of a Quebec-based research chair focusing on French-language AI and digital technologies, voiced concerns about the cultural biases perpetuated by current AI models. He noted that the underrepresentation of Quebec culture in the AI landscape risks reinforcing linguistic and cultural stereotypes, particularly regarding Indigenous peoples.

Tchéhouali argues that the proposed database could serve as “strategic infrastructure,” establishing guidelines for the identification, cataloguing, and tracking of local content within today’s AI systems.

Protecting Creators in the Digital Age

As BAnQ develops this database, copyright concerns have surfaced in the cultural sector. However, Grégoire argues that the proposed platform could ultimately offer greater protection for creators than the current landscape, which she likened to the “Wild West.” The existing environment allows for data to be harvested without compensation, a practice she believes should change.

Protecting Creators in the Digital Age

The new database could act as a centralised gateway for compensating creators whose works are utilised, ensuring that the cultural sector remains sustainable for the long term. Nonetheless, some artists remain apprehensive about contributing to AI training systems, fearing that their work may facilitate the very technologies that could undermine their livelihoods. “The main criticism we hear in the field is that, even if artists earn income from it, they are still feeding the beast that will eventually be used to replace contracts they may lose because of AI,” remarked Maxime Harvey, a postdoctoral researcher involved in the discussion.

The feasibility study anticipates that the platform could be operational by 2029, although D’Amour indicated that this timeline is subject to reassessment as the experimental phase unfolds. The projected budget for the initiative is approximately $10.5 million over five years, which includes both operational and capital costs. BAnQ has already secured $340,000 from the Quebec government for the feasibility study, along with an additional $750,000 to support its 12-month experimental phase.

Why it Matters

This initiative is more than a technological upgrade; it is a critical step towards ensuring that Quebec’s diverse cultural narratives are preserved and adequately represented in the digital era. By creating a database that prioritises local content, BAnQ is not only addressing the data gap in AI but also empowering creators and safeguarding their rights. In an age where digital narratives shape perceptions and realities, this project has the potential to redefine how Quebec’s rich heritage is understood and celebrated in the global context.

Share This Article
Deep-dive reporting on Quebec society, politics, and culture.
Leave a Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

© 2026 The Update Desk. All rights reserved.
Terms of Service Privacy Policy