The Bibliothèque et Archives nationales du Québec (BAnQ) is embarking on an innovative project to create a comprehensive database that encompasses cultural and governmental content, aimed at enhancing the capabilities of artificial intelligence (AI) systems. This initiative aims to ensure that AI reflects the true essence of Quebec society, including its diverse cultures and Indigenous languages. Following a successful feasibility study earlier this year, BAnQ has officially entered the experimental phase of this ambitious project.
Addressing Data Gaps in AI
The necessity for this databank arises from ongoing concerns over the inadequacy of existing AI models in representing Quebec’s unique societal landscape. Valérie D’Amour, who spearheaded the feasibility study, articulated the significance of this venture, stating, “All scenarios are a little bit on the table right now. We have a lot of ideas and we want to validate the possibilities with cultural stakeholders, as well as with data owners and providers, who will be involved in the discussions.”
The project aims to bridge the gap created by the scarcity of Quebec-centric data in AI training sets. Marie Grégoire, BAnQ’s president and chief executive officer, emphasised the importance of integrating local references into AI systems to ensure they accurately reflect the province’s social and cultural dynamics.
Building on Local Foundations
BAnQ plans to commence this initiative with its own archival collections before considering additional data sources. This approach follows a recommendation from Quebec’s innovation council, which highlighted the limited quantity of Quebec data available for AI training. Destiny Tchéhouali, co-holder of a research chair focused on French-language AI, noted that Quebec’s cultural representation is insufficient in current AI models, warning that this could perpetuate linguistic and cultural biases, particularly concerning Indigenous perspectives.

Tchéhouali described the proposed database as “strategic infrastructure” that would provide a framework for identifying, cataloguing, and managing local content within AI systems. This infrastructure aims to ensure that cultural nuances are preserved and represented accurately.
Copyright Concerns and Artist Compensation
While the development of this database presents numerous opportunities, it also raises significant questions surrounding copyright and the protection of creators’ rights. Grégoire acknowledged these concerns, arguing that the platform could ultimately provide better safeguards for creators than the current landscape, which she likened to “the Wild West.” She asserted that the database could function as a centralised hub, allowing for fair compensation to artists whose works would contribute to AI training.
Despite the potential benefits, some artists remain apprehensive about the implications of their contributions to AI. Maxime Harvey, a postdoctoral researcher and member of Tchéhouali’s research team, expressed concerns that artists might inadvertently undermine their own job security by feeding material into AI systems that could eventually replace them.
Funding and Future Prospects
The feasibility study outlines a budget of nearly $10.5 million over five years to support the initiative through to 2030. So far, BAnQ has secured $340,000 from the Quebec government for the feasibility study and an additional $750,000 to kick-start the experimental phase. While the platform is projected to be operational by 2029, D’Amour noted that the timeline would be reassessed based on the outcomes of the initial experimental phase.

Why it Matters
This initiative from BAnQ is a pivotal step towards ensuring that Quebec’s rich cultural tapestry is accurately represented in the digital age. By creating a dedicated database for local content, the project not only aims to enhance the performance of AI systems but also seeks to empower creators by offering them greater control and compensation over their works. As AI increasingly influences various aspects of life, ensuring that it reflects the diverse voices and cultures of Quebec is vital for fostering inclusivity and authenticity in the digital realm.