Niveau : M.Sc.A.

Domaines de recherche : Automatisation, Intégration technologique, Intelligence artificielle.

AI-based Data Retrieval System from Construction Documentation

Abstract

This research designs and validates an AI-based search and data retrieval system for construction documents (contracts, technical specifications, standards). Construction documentation is lengthy, heterogeneous, discipline-specific, and often multilingual, so traditional keyword search fails when users cannot guess the exact phrasing. At the same time, standalone generative AI is unreliable for professional settings due to confidentiality needs and the risk of ungrounded answers. The project develops a construction-tailored retrieval-augmented generation (RAG) pipeline: building a representative, compliant corpus; preparing documents with structure-aware chunking and metadata for filtering, provenance, and traceability; and using hybrid retrieval that combines lexical and semantic methods. Answering is strictly evidence-grounded, with mandatory citations, uncertainty detection, and abstention when evidence is insufficient. The system will be implemented under realistic deployment constraints (access control and incremental updates) and evaluated with IR metrics (Recall@k, Precision@k, MRR, nDCG), plus faithfulness, citation relevance, abstention quality, latency, and indexing efficiency in industrial case studies.

Domaines de recherche : Automatisation, Intégration technologique, Intelligence artificielle.

Mots-clés : Automatisation, Gestion de l’information, Intégration des données, Multidisciplinarité, Traitement du langage naturel.

Résultats du projet

This research is expected to deliver a validated, deployable AI-based search and data retrieval system that enables natural-language querying over heterogeneous construction documents and returns evidence-grounded answers with explicit citations. The main expected results are: 1. Retrieval performance improvements: Compared to baseline keyword search, the system should demonstrate higher recall and ranking quality on representative construction queries, measured through standard information retrieval metrics (Recall@k, Precision@k, MRR, nDCG@k). Gains are expected particularly for queries where users lack exact terminology or where relevant information is distributed across long, structured documents. 2. Reliable, auditable answering: The proposed RAG pipeline should produce answers that are demonstrably faithful to retrieved evidence, with mandatory citations pointing to exact document locations (e.g., section/clause/table references). Answer quality will be assessed via accuracy, faithfulness, and citation relevance; the system should minimize unsupported claims by enforcing strict grounding. 3. Uncertainty and abstention behavior: The system should reduce harmful or misleading outputs by detecting insufficient evidence and abstaining or requesting clarification. Performance will be quantified through abstention precision/recall and expert review of borderline cases, supporting safer professional usage. 4. Operational readiness under real constraints: The implementation is expected to include confidentiality-aware ingestion, access control mechanisms, and automated incremental updates to support evolving project documentation. Operational results will include acceptable latency for interactive use, efficient indexing and update times, and robust logging for traceability and governance. 5. Industrial validation: Through case studies with corporate partners, the system is expected to demonstrate measurable productivity gains (reduced search time, fewer missed requirements, improved compliance checks) and increased user trust due to transparent citations and consistent behavior. Overall, the project aims to show that construction-specific RAG can outperform traditional search while remaining safer and more trustworthy than unconstrained generative AI.

Contributions du projet

Academically, this work will define and test a structure-aware retrieval pipeline for construction documents, comparing chunking, embeddings, and hybrid retrieval choices, and measuring their impact on retrieval quality, citation correctness, and abstention reliability. Industrially, it will deliver a practical, secure search-and-answer system that fits real project constraints: confidential data handling, role-based access control, and automatic incremental updates when new document versions arrive. Users will be able to ask questions in natural language and receive short, evidence-based answers with clear citations to the exact section/clause (and table rows when relevant). When evidence is missing or ambiguous, the system will refuse or ask for clarification instead of guessing. Validation with corporate case studies is expected to show less time spent searching, fewer missed requirements, stronger compliance checks, and improved trust because results are auditable and repeatable.

Design for adaptability to contribute to the circular economy in construction

Baienat, S., Iordanova, I., Helal, B., & Midoune, N. (2026). Design for adaptability to contribute to the circular economy in construction. Engineering, Construction and Architectural Management, 1-24.

BIM-Based Live Sensor Data Visualization Using Virtual Reality for Monitoring Indoor Conditions

Worawan, N. and Motamedi, A. (2019). BIM-Based Live Sensor Data Visualization Using Virtual Reality for Monitoring Indoor Conditions, 24th Annual Conference of the Association for Computer-Aided Architectural Design Research in Asia (CAADRIA 2019), vol 2, pp. 191-200, Wellington, New Zealand.

Live Data Visualization of IoT Sensors Using Augmented Reality (AR) and BIM

Worawan, N. and Motamedi, A. (2019). Live Data Visualization of IoT Sensors Using Augmented Reality (AR) and BIM, 36th International Symposium on Automation and Robotics in Construction (ISARC), Banf, Canada.

Process Re-engineering in Owner Organizations to Improve BIM-based Project Delivery Using Requirements Management Platform

Motamedi, A., Vaudou, S., Leygonie, R., Forgues, D. (2019). Process Re-engineering in Owner Organizations to Improve BIM-based Project Delivery Using Requirements Management Platform, 4th International Conference on Civil and Building Engineering Informatics (ICCBEI), Sendai, Japan, pp. 227-234, ISBN978-4-600-00276-3

Design and Implementation of Procedures and Automated Tools for FM-BIM Quality Management

Leygonie R., Motamedi A. and Iordanova I. (2020). Design and Implementation of Procedures and Automated Tools for FM-BIM Quality Management, CSCE2020.

Mask R-CNN Deep Learning-based Approach to Detect Construction Machinery on Jobsites

Raoofi H. and Motamedi A. (2020). Mask R-CNN Deep Learning-based Approach to Detect Construction Machinery on Jobsites, 37th International Symposium on Automation and Robotics in Construction (ISARC2020), Kitakyshu, Japan, pp. 1122-1127. (ISBN 978-952-94-3634-7) https://doi.org/10.22260/ISARC2020/0154

Équipe de recherche

L’équipe chargée de ce projet :

Équipe

L’équipe chargée de ce projet

Partenaires

Ce projet a été supporté par :

Recherches similaires

Explorez plus en profondeur notre recherche en explorant ces études et ressources connexes :

Vers un approvisionnement vert au Canada: articulation entre les devis de performance et les passeports numériques de produits pour la décarbonation du secteur de la construction.

Cycle de vie, Intégration des données, Durabilité, Mesures de performance

Niveau : Ph.D.

Année de publication : 2026

Développement d'un cadre pour la mise en œuvre de la réalité réduite basée sur le BIM dans l'industrie AEC-FM

BIM, Intégration des données, Jumeaux numériques, Vision par ordinateur

Niveau : Ph.D.

Année de publication : 2023

Faire le lien entre la numérisation et la cocréation de valeur dans les projets de transport

BIM, Collaboration, Gestion de l’information, Transformation numérique

Niveau : Ph.D.

Année de publication : 2025

Les applications potentielles du Traitement du Langage Naturel (NLP) dans l’industrie de la construction

Intégration des données, Transformation numérique, Vision par ordinateur, Traitement du langage naturel

Niveau : M.Ing.

Année de publication : 2023

Un cadre de vision par ordinateur pour la surveillance des déchets de construction dans les bennes statiques

Automatisation, Vision par ordinateur, Préfabrication

Niveau : M.Sc.A.

Année de publication : 2023

Towards value-driven Asset Management through dynamic Information Management

BIM, Gestion de l’information, Construction Lean, Valeur

Niveau : Ph.D.

Année de publication : 2027

AI-based Data Retrieval System from Construction Documentation

Résultats du projet

Contributions du projet

Équipe de recherche

Équipe

Partenaires

Recherches similaires

Vers un approvisionnement vert au Canada: articulation entre les devis de performance et les passeports numériques de produits pour la décarbonation du secteur de la construction.

Développement d'un cadre pour la mise en œuvre de la réalité réduite basée sur le BIM dans l'industrie AEC-FM

Faire le lien entre la numérisation et la cocréation de valeur dans les projets de transport

Les applications potentielles du Traitement du Langage Naturel (NLP) dans l’industrie de la construction

Un cadre de vision par ordinateur pour la surveillance des déchets de construction dans les bennes statiques

Towards value-driven Asset Management through dynamic Information Management

Le GRIDD fait partie de l'École de Téchnologie Supérieure, une constituante du réseau de I'Université du Québec

Copyright © GRIDD 2025

À propos

Social

Ressources

Le GRIDD fait partie de l'École de Téchnologie Supérieure, une constituante du réseau de I'Université du Québec

Copyright © GRIDD 2023

AI-based Data Retrieval System from Construction Documentation

Résultats du projet

Contributions du projet

Équipe de recherche

Équipe

Partenaires

Recherches similaires

Vers un approvisionnement vert au Canada: articulation entre les devis de performance et les passeports numériques de produits pour la décarbonation du secteur de la construction.

Développement d'un cadre pour la mise en œuvre de la réalité réduite basée sur le BIM dans l'industrie AEC-FM

Faire le lien entre la numérisation et la cocréation de valeur dans les projets de transport

Les applications potentielles du Traitement du Langage Naturel (NLP) dans l’industrie de la construction

Un cadre de vision par ordinateur pour la surveillance des déchets de construction dans les bennes statiques

Towards value-driven Asset Management through dynamic Information Management

Le GRIDD fait partie de l'École de Téchnologie Supérieure, une constituante du réseau de I'Université du Québec

Copyright © GRIDD 2025

À propos

Social

Ressources

Le GRIDD fait partie de l'École de Téchnologie Supérieure, une constituante du réseau de I'Université du QuébecCopyright © GRIDD 2023

Le GRIDD fait partie de l'École de Téchnologie Supérieure, une constituante du réseau de I'Université du Québec

Copyright © GRIDD 2023