← Back to Home

Architecture Design Planning AI Assistant System

Architecture Design Planning AI Assistant

Author: Hanyang Yin

Role: Product Design / System Architecture / Core Developer

Keywords: LLM · RAG · OCR · PPT Generation · AWS · Docker · AI Agent


Project Overview

This project is an intelligent assistance system designed for the early-stage architectural planning process. By integrating large language models, multi-source knowledge retrieval, automated content generation, and image generation technologies, the system provides end-to-end support from requirement analysis and concept development to final deliverable production.

The system primarily targets architects, architecture students, and planning professionals, aiming to lower the entry barrier for planning tasks while improving efficiency and professional quality.

Core Goal: Enable non-expert users to construct professional-level architectural planning proposals.

System Architecture

System Architecture
Figure 1: Overall system architecture

The system adopts a layered architecture consisting of frontend interaction, middleware orchestration, intelligent engines, and multi-source databases.

Main components include:


🔹 Module 1: Multi-Knowledge-Base Intelligent QA System (RAG-based Chat System)

Description

This module integrates the Gemini large language model with RAG technology to provide high-precision question-answering services specialized for the architectural domain.

Connected knowledge bases include:

System Flow


User Query
↓
Intent Recognition
↓
Knowledge Retrieval
↓
RAG Fusion
↓
LLM Generation
↓
Final Answer

Interface Example

Chat UI
Figure 2: RAG-based QA interface

Technical Highlights

📄 Detailed RAG Module Documentation


🔹 Module 2: Architectural Planning Proposal Generator (PPT Generator)

Description

This module assists users in building complete architectural planning proposals from scratch and outputs editable PPT documents.

Core Concept: Interactive Guidance + Case-Based Reference + Intelligent Generation

Workflow

Process Diagram

PPT Flow
Figure 3: Planning generation interface

Editing Interface

PPT Editor
Figure 4: Online PPT editor

Prompt Design Strategy

📄 Detailed PPT Module Documentation


🔹 Module 3: Architectural Visualization Generation System (Image Generation)

Description

This module integrates multiple image generation APIs to produce architectural visualizations based on user input.

Currently supported:

Future directions:

Example Outputs

Image Generation
Figure 5: Generated architectural visualizations

Key Module: CAD-Based Building Code Intelligent Parsing System (Independent Development)

Background

Traditional building code retrieval relies on manual page searching, resulting in low efficiency and high error rates.

This module enables:

CAD Screenshot → Automatic Code Page Localization → Instant PDF Retrieval

System Flow


PDF Preprocess → OCR Indexing → Page Database
↑
CAD Screenshot → OCR + LLM → Code Extraction
↓
Page Matching
↓
PDF Return

Interface Example

CAD Query
Figure 6: CAD parsing interface

Core Technologies

Engineering Implementation


Role and Contributions

In this project, I primarily focused on product planning and system architecture design, while also developing several core technical modules.

Main Responsibilities

Key Contributions


Technology Stack

Category Technology
LLM Gemini / Claude-opus-4.7
RAG LanceDB / Meilisearch
OCR PaddleOCR
Backend Python / Node.js / FastAPI
Frontend React
Cloud AWS EC2
Deploy Docker
DB MySQL / S3

Technical Challenges and Solutions

1. Unstable RAG Accuracy

Issue: High retrieval noise

Solution: Multi-stage filtering and re-ranking

2. OCR Recognition Errors

Issue: Complex CAD text structures

Solution: OCR + LLM-based correction fusion

3. Incomplete User Requirements

Issue: Missing critical information

Solution: Multi-turn guided dialogue design


Future Work


```