Quebec’s Bibliothèque et Archives nationales du Québec (BAnQ) is embarking on an ambitious project to develop a comprehensive database aimed at enriching artificial intelligence systems with culturally relevant content. This initiative, which includes content in French and Indigenous languages, follows a feasibility study completed earlier this year and seeks to address the notable gap in AI training data pertaining to Quebec’s unique societal and cultural landscape.
Addressing the Knowledge Gap
The experimental phase of this cultural databank is designed to tackle the challenges posed by existing generative AI systems, which often lack accurate information about Quebec’s society, economy, and cultural nuances. Valérie D’Amour, who spearheaded the feasibility study, expressed optimism about the project’s potential, stating that “all scenarios are a little bit on the table right now.” She emphasised the importance of collaborating with cultural stakeholders and data owners to validate various possibilities.
Marie Grégoire, president and CEO of BAnQ, affirmed the project’s mission to ensure that AI technologies better represent Quebec’s multifaceted society. “That means having Quebec references, whether in small models or large models, whether they come from research or from the business community,” she explained.
A Strategic Infrastructure for Local Content
The proposed platform is not intended as a public distribution channel for creative works; rather, it will maintain strict control over data access. BAnQ plans to initiate the database with its own collections before potentially incorporating data from additional sources. This approach aligns with recommendations from Quebec’s innovation council, which highlighted the scarcity of Quebec-related data in AI training datasets as a critical issue.

Destiny Tchéhouali, a co-holder of a research chair focused on French-language AI at the Université du Québec à Montréal, noted the underrepresentation of Quebec culture in the current AI landscape. He warned that this lack of representation could perpetuate linguistic and cultural biases, particularly concerning Indigenous peoples. “We run the risk of reproducing these biases,” he cautioned. Tchéhouali believes that the proposed databank could serve as “strategic infrastructure,” establishing guidelines for the identification and cataloguing of local content within AI systems.
Navigating Copyright Concerns
As the project develops, copyright issues remain a significant concern for the cultural sector. However, Grégoire argued that the new database could offer creators enhanced protection compared to the existing framework. “Right now, it’s a bit like the Wild West,” she remarked, indicating that data is often harvested without compensation. She envisions the database as a centralised platform that could streamline the process of compensating creators whose works are utilised.
Despite these assurances, some artists express apprehension that their contributions to AI training could jeopardise their livelihoods. Maxime Harvey, a postdoctoral researcher at the National Institute of Scientific Research, articulated a common concern: “Even if artists earn income from it, they are still feeding the beast that will eventually be used to replace contracts they may lose because of AI.”
Future Prospects
The feasibility study outlines a timeline for the platform to become operational by 2029, although D’Amour noted that the schedule would be reassessed after the experimental phase. The projected budget for the next five years stands at approximately $10.5 million, covering both operating and capital expenses. To date, BAnQ has received $340,000 from the Quebec government for the feasibility study and an additional $750,000 to support the 12-month experimentation phase.

Why it Matters
This initiative holds significant implications for the cultural landscape of Quebec and the broader domain of AI technology. By creating a databank that accurately reflects Quebec’s diverse society and languages, BAnQ is not only safeguarding cultural heritage but also ensuring that future AI systems can operate with greater understanding and respect for local contexts. In an age where AI increasingly influences various sectors, this project could pave the way for more inclusive and representative technological advancements, ultimately enriching the cultural fabric of Quebec and beyond.