Quebec’s national library is embarking on an ambitious project to construct a comprehensive database aimed at enriching the cultural and governmental knowledge available to artificial intelligence systems. The Bibliothèque et Archives nationales du Québec (BAnQ) has initiated the experimental phase of this databank, focusing on French and Indigenous languages, following a successful feasibility study conducted earlier this year. This initiative seeks to address the challenges faced by major AI systems, which often lack accurate and reliable data pertaining to Quebec’s unique social and cultural fabric.
Addressing Cultural Gaps in AI
The move comes as a response to growing concerns that leading generative AI platforms struggle to accurately reflect Quebec’s society, economy, and cultural identity due to a scarcity of relevant data. “All scenarios are a little bit on the table right now,” stated Valérie D’Amour, who spearheaded the feasibility study. She emphasised the importance of engaging with cultural stakeholders and data providers to explore potential avenues for collaboration.
BAnQ has clarified that the forthcoming platform will not function as a public outlet for creative works and that strict controls will govern access to the data. Marie Grégoire, the president and CEO of BAnQ, articulated the project’s mission: “Our aim is to ensure that AI systems more accurately represent Quebec society and culture.”
Learning from Global Initiatives
Similar efforts are already underway in other parts of the world. For instance, Sweden has taken steps to compile large collections of Nordic-language texts to aid the development of generative AI models for Scandinavian languages. BAnQ intends to begin its initiative with its own archival collections before looking to incorporate data from external sources.

This initiative is rooted in a recommendation from a 2024 report by Quebec’s innovation council, which highlighted the inadequacy of available data on Quebec in existing AI training datasets. Destiny Tchéhouali, a co-holder of a research chair focused on French-language AI and digital technologies, pointed out that Quebec’s cultural contributions are often overshadowed in the current AI landscape. He warned of the inherent risks of perpetuating linguistic and cultural biases, particularly concerning Indigenous peoples.
Ensuring Creator Protection
As BAnQ develops its proposed database, concerns about copyright and creator rights remain a significant issue within the cultural sector. However, Grégoire believes that this new platform could offer enhanced protection for creators compared to the existing landscape. “Right now, it’s a bit like the Wild West,” she remarked. “Data is being harvested for free, and that should not be the case.” The proposed database aims to serve as a central hub, facilitating fair compensation for creators whose works are utilised in AI training.
Despite these assurances, some artists express apprehension about the potential consequences of contributing their work to AI systems. Maxime Harvey, a postdoctoral researcher at the National Institute of Scientific Research, echoed these concerns, noting that even if artists receive compensation, there is a fear that their contributions could eventually undermine their livelihoods by replacing traditional contracts with AI-generated solutions.
Project Timeline and Funding
According to the feasibility study, the platform is projected to become operational by 2029, although D’Amour indicated that this timeline would be reassessed following the experimental phase. The study outlines a budget of nearly $10.5 million over five years, covering both operational and capital costs. To support the feasibility study, BAnQ has secured $340,000 from the Quebec government, along with an additional $750,000 for the experimental phase slated to last 12 months.

Why it Matters
This initiative stands to reshape the relationship between cultural identity and technology in Quebec, offering a vital opportunity to enhance the representation of local voices within AI systems. As the digital landscape continues to evolve, ensuring that AI reflects the rich tapestry of Quebec’s society, including its languages and cultures, is crucial. By taking these steps, Quebec could set a precedent for other regions grappling with similar challenges, ultimately fostering a more inclusive and equitable technological future.