Quebec’s National Library to Launch Cultural Database for AI Development

Sophie Tremblay, Quebec Affairs Reporter
5 Min Read
⏱️ 4 min read

The Bibliothèque et Archives nationales du Québec (BAnQ) is set to embark on an ambitious project aimed at establishing a comprehensive database of cultural and governmental information. This initiative seeks to enhance the training of artificial intelligence systems, thereby fostering a deeper understanding of Quebec’s diverse society, rich culture, and Indigenous languages. Following a successful feasibility study earlier this year, BAnQ has entered the experimental phase of this culturally significant databank, which will primarily feature content in French and Indigenous languages.

Addressing Data Shortages

A growing concern among experts is that mainstream generative AI systems often struggle to accurately represent Quebec’s unique societal and cultural landscape due to a scarcity of relevant data. Valérie D’Amour, who led the feasibility study, emphasised that “all scenarios are a little bit on the table right now”, indicating the openness to explore various possibilities in collaboration with cultural stakeholders and data providers.

BAnQ aims to create a platform that will not function as a public repository for creative works; rather, it will have controlled access to ensure the integrity and protection of the data involved. Marie Grégoire, the president and CEO of BAnQ, articulated the objective: to ensure that AI systems accurately reflect the nuances of Quebec society. “That means including Quebec references, whether they originate from research or the business community,” she stated.

Learning from Global Examples

Similar initiatives have emerged globally, notably in Sweden, where extensive collections of Nordic-language texts have been compiled to aid the development of generative AI models for Scandinavian languages. BAnQ plans to initially focus on its own collections before expanding to include data from external sources, addressing a recommendation made in a 2024 report by Quebec’s innovation council. The report pointed out the “very small quantity of data on Quebec” available in existing AI training datasets as a significant barrier to effective representation.

Learning from Global Examples

Destiny Tchéhouali, co-holder of a research chair dedicated to French-language AI and digital technologies, highlighted that Quebec’s cultural presence in global AI datasets is disproportionately low. “We run the risk of reproducing linguistic and cultural biases,” he warned. This proposed database represents “strategic infrastructure” that could help standardise how local content is identified and catalogued within AI systems.

As BAnQ moves forward with this innovative database, copyright issues are emerging as a pivotal topic of discussion within the cultural sector. Grégoire believes that the platform could offer enhanced protection for creators compared to the current landscape, which she described as resembling “the Wild West” of data collection. “Data is being harvested for free, and that should not be the case,” she remarked, suggesting that the database could serve as a central hub for ensuring creators are fairly compensated for their contributions.

Despite the potential benefits, some artists remain apprehensive about the implications of sharing their work for AI training. Maxime Harvey, a postdoctoral researcher at the National Institute of Scientific Research, noted the prevalent concern that artists might inadvertently contribute to a system that could threaten their livelihoods. “Even if artists earn income from it, they are still feeding the beast that could ultimately replace contracts they may lose because of AI,” he cautioned.

A Future Timeline

The feasibility study envisions a timeline for the platform’s operational launch by 2029, although D’Amour indicated that this schedule will be reassessed following the experimental phase. The projected budget for the initiative spans five years, totalling nearly $10.5 million, covering both operational and capital expenses. BAnQ has already secured $340,000 from the Quebec government for the feasibility study and an additional $750,000 to support the forthcoming twelve-month experimentation phase.

A Future Timeline

Why it Matters

This initiative is not merely a technological endeavour; it represents a crucial step towards preserving and promoting Quebec’s cultural identity in an increasingly digital world. By developing a dedicated database that prioritises local content and Indigenous languages, BAnQ is positioning itself as a leader in the ethical integration of artificial intelligence into society. This project holds the potential to ensure that Quebec’s rich heritage is not only recognised but also celebrated within the evolving landscape of AI, safeguarding its uniqueness for future generations.

Share This Article
Deep-dive reporting on Quebec society, politics, and culture.
Leave a Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

© 2026 The Update Desk. All rights reserved.
Terms of Service Privacy Policy