Archie: Transforming AEC Compliance Workflows with Multimodal Graph-RAG
Technology
10
min read
May 19, 2025
In today's hyper-connected world, businesses across all sectors are grappling with an ever-increasing deluge of information. We're creating data at an unprecedented rate, but our ability to effectively manage, access, and utilize this knowledge often lags behind. Consider this: studies show that, on average, employees can spend up to 30% of their work time simply searching for information. Worse still, a staggering 35-50% of a company's information is often not centrally indexed, making it virtually invisible when needed. For large enterprises, this inefficiency translates to an average spend of $2.5 million per year just on locating and retrieving information.
This isn't just about wasted time; it's about missed opportunities, stifled innovation, and the cognitive overload placed on valuable employees. The traditional ways of managing knowledge – scattered documents, siloed databases, and reliance on individual memory – are clearly no longer fit for purpose.
Particularities of the AEC industry
If this information challenge is significant for general businesses, it's amplified tenfold in the Architecture, Engineering, and Construction (AEC) industry. AEC professionals navigate a uniquely complex information landscape characterized by:
Fragmented sources of knowledge: Information is often spread across countless documents, emails, and proprietary systems.
Non-machine-readable formats: Critical data is locked away in PDFs, scans, and various proprietary software formats.
Inefficient search experiences: Keyword searches often return a flood of irrelevant documents, requiring hours of manual sifting.
Prevalence of multi-modal knowledge: Significant information is conveyed through visual mediums like tables, diagrams, technical drawings, photographs, and even video, which traditional text-based systems struggle to interpret, index, and integrate.
Cognitive overload: The sheer volume and complexity of information can be overwhelming.
Reliance on personal knowledge transfer: Critical expertise often resides with individuals, creating bottlenecks and risks if that person leaves.
Commercial standards licensing: Access to crucial standards can be costly and impede easy collaboration across teams.
This confluence of factors means that AEC projects are constantly battling against the friction of inefficient information access. It's a daily struggle that impacts timelines, budgets, and the mental bandwidth of skilled professionals, diverting their focus from core design and problem-solving tasks to the frustrating hunt for the right piece of data.
One of the most critical and information-intensive areas in AEC is compliance. Professionals must adhere to a vast array of:
Building codes
Policies & regulations
Zoning laws
Environmental Health & Safety (EHS) standards
EPDs & Sustainability requirements
Industry Standards
Company-specific standards
Take the Australian National Construction Code (NCC) as an example. It comprises 3 volumes, spanning over 832,000 words and 2,116 pages. Manually navigating this behemoth to find a specific clause or understand an intricate requirement is a Herculean task, prone to error and consuming invaluable project time. Similar complexities exist with building codes globally. The consequences of misinterpretation or oversight can be severe, leading to costly rework, project delays, legal liabilities, and compromised safety.
AI Knowledge Management & RAG to the Rescue
This is where AI Knowledge Management (AI KM) steps in as a transformative solution. It leverages artificial intelligence to efficiently capture, process, organize, and retrieve information. It's about moving beyond simple keyword searches to systems that understand context and deliver precise answers, not just lists of documents.
A key technology powering many modern AI KM systems is Retrieval-Augmented Generation (RAG). In simple terms, RAG combines the strengths of large language models (LLMs) with information retrieval. When a user asks a question, the RAG system first retrieves relevant information from a trusted, curated knowledge base (like the NCC). Then, it uses the LLM to generate a coherent, context-aware answer based solely on that retrieved information. This grounding in factual source material is crucial, especially in high-stakes fields like AEC compliance.
At AECFoundry, we recognized the acute need for a specialized AI KM solution in the AEC sector. That's why we developed Archie - an AI-powered chat assistant designed to make navigating complex building codes and compliance documents effortless.
With Archie, AEC professionals can ask questions in natural language, just like they would to a human expert. For example, instead of manually searching through hundreds of pages, a user can simply ask: "Maximum slope of access ramps in schools" Archie, using the knowledge from the relevant codes (like the Australian NCC, UK Building Regulations, or Italian NTC), will swiftly provide a precise answer.

Archie doesn't just give you an answer; it provides direct references to the source material within the code, such as "NCC 2022 Volume One - Building Code of Australia". This transparency is vital. For RAG systems like Archie, the ability to cite sources and allow users to verify the information is paramount for building trust and ensuring reliability. With a single click on a reference, users are taken directly to the exact page and section within the official document (e.g., "Section D Access and egress" in the NCC), allowing them to see the context and confirm the information for themselves. This traceability is a cornerstone of Archie's design, ensuring professionals can confidently rely on the outputs for their critical work.
Archie Under the Hood: The Power of Graph-RAG
What makes Archie particularly powerful is its Graph-RAG approach. Let's break down how it works, referencing the core concepts from our system architecture:

Data Ingestion & Knowledge Graph Creation:
We start with Compliance Knowledge – the actual documents like building codes.
Using Natural Language Processing (NLP) and LLM-based techniques, we perform Knowledge Graph Construction. This is where the magic happens. Instead of just storing text, we identify key entities (like "fire resistance level," "stairway," "riser height"), relative code sections, and the relationships between them.
Multimodal Embeddings convert text, tables and parts of images into numerical representations (vectors) that capture their semantic meaning. It is extremely important to capture the knowledge across multiple modalities because in building codes the crucial information is frequently stored in tables and diagrams. This is a crucial detail since standard text-based RAG misses visual information stored in architectural and engineering drawings.
The result is a rich Knowledge Graph – a network of interconnected nodes (concepts) and edges (relationships), complete with their multimodal embedding vectors.
Querying, Retrieval & Response Generation:
A User submits a Query, which can be text-based (like "What are the fire resistance level requirements for a Type A construction in a multi-story residential building according to the NCC?") or, in future enhancements, involve images (like a floorplan).
The query undergoes Language Processing & Understanding (or Image Processing for visual queries) where its embedding is created.
The system then intelligently searches the Knowledge Graph to find the most Relevant Sub-graphs – portions of the graph that contain the information needed to answer the query. This is far more sophisticated than simple keyword matching and standard vector search; it leverages vector similarity of embeddings and the graph's structure at the same time.
If the initial query isn't precise enough, or if it's an image query, Query Reformulation might occur to optimize the search.
Finally, the retrieved, highly relevant information from the graph is fed as Context to a Large Language Model, which generates a clear, concise Response for the user.
While standard RAG applications are powerful, using a Knowledge Graph as the retrieval backbone offers significant advantages:
Deeper Contextual Understanding: Graphs explicitly map relationships between concepts, allowing the AI to understand nuances and context far better than systems relying solely on semantic similarity of isolated text chunks. It is especially useful for AEC since useful parts about particular compliance checks can be scattered among different sections and even documents.
Enhanced Precision: By navigating these relationships, Archie can pinpoint highly specific information, reducing ambiguity and delivering more accurate answers.
Improved Explainability: It's easier to trace why a particular piece of information was deemed relevant by following the connections in the graph.
Scalability and Maintainability: Knowledge graphs can be more easily updated and expanded as codes change or new documents are added, ensuring the knowledge base remains current.
The Value Archie Brings Today and the Glimpse into Tomorrow
Using an AI KM tool like Archie brings immediate benefits:
Drastic Time Savings: Get accurate answers in seconds, not hours or days.
Improved Accuracy & Compliance: Reduce the risk of misinterpreting complex codes, leading to safer, more compliant designs.
Enhanced Accessibility: Democratize access to complex knowledge, empowering every team member.
Consistent Information: Ensure everyone is working from the same, up-to-date understanding of compliance documents.
To better understand how professionals are leveraging Archie, we've analyzed the topics of user interactions, where each chat session is classified by its primary subject. The accompanying chart, "Frequency Plot of LLM Classifications," visualizes these findings. As you can see, topics like "Fire" safety regulations, "Stair design," and "Ramps" are frequently queried, reflecting core compliance concerns. The "Others" category, while prominent, simply represents the long tail of many diverse, individual topics that don't fall into the more common groupings, underscoring the wide range of specific challenges users face. Analyzing this kind of usage data is incredibly valuable for refining and enhancing RAG applications like Archie. The sheer diversity of topics, from "Sanitary facilities" and "Energy efficiency" to "Structural engineering" and "Ventilation," clearly demonstrates the broad spectrum of information needs within the AEC industry. This varied range of inquiries illustrates the utility of AI KM systems in addressing information requests across a multitude of distinct professional domains.

Further insights into user engagement can be drawn from the "Average Messages per Chat by Month" graph. This chart tracks the typical length of a conversation users have with Archie. Notably, the average number of messages per chat consistently hovers around or above four messages. This consistent level of interaction is a strong indicator of user engagement. It suggests that users are not just asking a single question and leaving; rather, they are often delving deeper, asking follow-up questions, or refining their initial queries, indicating that they find value in continuing the dialogue with Archie beyond the first response. Furthermore, this consistency around four messages per chat could also indicate that this is approximately the interaction length users often need to fully explore a topic or obtain the specific level of detail they require, potentially using follow-up messages to clarify points or request more in-depth information on particular aspects of the initial response.

But we're not stopping here. Our roadmap for Archie includes:
Expanding Our Building Codes Library: Continuously adding more national and international codes based on demand, ensuring global applicability.
Company-Specific Knowledge Integration: Enabling companies to securely integrate their own internal standards, project documents, and even proprietary design guides into Archie, creating a truly customized knowledge hub.
Advanced Image Recognition for Compliance: Imagine uploading a technical drawing, and Archie automatically checks it against relevant building codes for compliance. This is where our multimodal capabilities, including understanding images of floorplans and technical drawings, will truly shine, proactively flagging potential issues before they become costly problems. Moreover, we have already built a prototype:
The Future Will Be Intelligently Managed
The AEC industry stands on the cusp of a knowledge revolution. The days of wrestling with unwieldy documents and relying on fragmented information are numbered. AI Knowledge Management, particularly sophisticated approaches like Graph-RAG, offers a clear path towards greater efficiency, accuracy, and innovation.
Archie is at the forefront of this transformation, specifically built to address the critical compliance challenges faced by AEC professionals. By turning complex codes into an interactive, intelligent resource, we're empowering you to spend less time searching and more time designing and building the future.
Ready to experience the power of AI-driven compliance?