Extraction et Inférence de Schéma DB/ERD

VérifiéPrudence

Se connecte aux bases MySQL, MSSQL et Oracle pour extraire les métadonnées de schéma et inférer les relations entre tables via contraintes FK, motifs de nommage (_id, _cd, _code, _no) et correspondance de colonnes. Exporte le schéma inféré en diagrammes ERD DBML ou Mermaid, utile pour la rétro-ingénierie de bases existantes ou la documentation de modèles de données.

Spar Skills Guide Bot
DeveloppementIntermédiaire
5002/06/2026
Claude Code
#database#erd#schema#python#mysql

Recommandé pour

Notre avis

Un worker Python qui extrait les métadonnées d'une base de données, infère les relations entre tables et exporte des diagrammes ERD au format DBML ou Mermaid.

Points forts

  • Supporte MySQL, MSSQL et Oracle via des connecteurs dédiés
  • Inférence de relations basée sur les clés étrangères, les conventions de nommage et les commentaires
  • Export vers DBML et Mermaid pour visualisation immédiate
  • API REST claire avec 7 endpoints bien définis

Limites

  • Seul MySQL a été testé en conditions réelles
  • Pas de tests unitaires ni d'intégration automatisés
  • Les performances sur les grands schémas (200+ tables) restent à valider
Quand l'utiliser

Lorsque vous avez besoin d'analyser rapidement la structure d'une base de données et d'en générer un diagramme ERD pour documentation ou refactoring.

Quand l'éviter

Pour des bases avec des relations sémantiques complexes non exprimées par des clés étrangères ou un nommage standard (ex: dépendances implicites dans des colonnes JSON).

Analyse de sécurité

Prudence
Score qualité80/100

The SKILL.md is a project documentation file describing a backend worker that starts a web service and interacts with databases. While not inherently destructive, automated execution of a server could expose internal resources and database access without proper authentication checks, warranting caution.

Points d'attention
  • Skill describes starting a FastAPI server on port 8000, which could expose internal services if run automatically by an AI agent.
  • Connects to production databases (MySQL, MSSQL, Oracle) using potentially stored credentials, increasing risk if mishandled.
  • No explicit dangerous commands, but running an unhardened server introduces network surfaces.

Exemples

Extract metadata from a MySQL database
Connect to my MySQL database at localhost:3306, schema 'sales', and extract all table metadata including columns, types, comments, and primary keys.
Infer relationships and generate ERD diagram
From the extracted metadata of the 'sales' database, infer all relationships (foreign keys, _id patterns, etc.) and build an ERD graph, then export it as Mermaid code.
Test database connection and export DBML
Test connection to my MySQL database 'inventory', then extract its schema, infer relations, and export the ERD as DBML.

ERDAI DB/ERD Skill (Python Worker)

참조 문서

  • agent/db-connection.md
  • agent/erd-engine.md
  • agent/backend.md

현재 구현 상태

✅ 완료

| 항목 | 파일 | |---|---| | FastAPI 앱 구조 | app/main.py | | 전체 7개 엔드포인트 | app/routers/worker.py | | MySQL 커넥터 | app/services/connectors/mysql_connector.py | | MSSQL 커넥터 (pymssql) | app/services/connectors/mssql_connector.py | | Oracle 커넥터 (oracledb) | app/services/connectors/oracle_connector.py | | 커넥터 팩토리 | app/services/connectors/factory.py | | 메타데이터 추출 서비스 | app/services/metadata_service.py | | 관계 추론 서비스 | app/services/inference_service.py | | DBML/Mermaid export | app/services/export_service.py | | Pydantic 모델 | app/models/connection.py, metadata.py, erd.py | | requirements.txt | pymysql/pymssql/oracledb/pydantic>=2.10.0 |

엔드포인트 목록

GET  /worker/health
POST /worker/test-connection       # TestConnectionRequest → TestConnectionResponse
POST /worker/extract-metadata      # ExtractMetadataRequest → SchemaMetadata
POST /worker/infer-relations       # InferRelationsRequest → list[InferredRelation]
POST /worker/build-erd             # BuildErdRequest → ErdGraph
POST /worker/export/dbml           # BuildErdRequest → PlainText
POST /worker/export/mermaid        # BuildErdRequest → PlainText

관계 추론 로직 (inference_service.py)

  1. 실제 FK — fk_refs에서 confidence=FK (score=1.0)
  2. _id 패턴 — 테이블명 매칭 시 HIGH (0.8~0.9), 코멘트 일치 +0.1
  3. _cd/_code 패턴 — MEDIUM (0.6~0.7)
  4. _no 패턴 — LOW (0.45~0.55)
  5. PK 컬럼명 직접 일치 — MEDIUM (0.55)

❌ 남은 작업

  1. MSSQL/Oracle 실제 연결 테스트 — MySQL만 검증됨
  2. 도메인 자동 분류 — 테이블 prefix 기반 (예: mail_ → MAIL)
    • 현재 Worker에서 domain 필드 반환하지 않음 (erdBuilder에서 'ETC' 기본값)
    • inference_service.py 또는 build-erd 엔드포인트에서 prefix 분류 추가 필요
  3. 대형 스키마 성능 — 200+ 테이블 처리 시 타임아웃 테스트
  4. pytest 테스트tests/ 디렉터리 없음

기동 방법

cd worker
python -m uvicorn app.main:app --port 8000 --reload
# python main.py 로 실행 금지 (모듈 경로 문제)

Python 버전 주의

  • Python 3.14 이상: pydantic>=2.10.0 필수 (cp314 wheel 있음)
  • pydantic 2.9.x: Rust build 필요 → 설치 실패
Skills similaires