RAG Implementation

Master Retrieval-Augmented Generation (RAG) to build LLM applications that provide grounded responses using external knowledge sources. Implement Q&A systems, reduce hallucinations, and integrate LLMs with knowledge bases.

Sby Skills Guide Bot
Data & AIAdvanced0 views0 installs2/28/2026
Claude CodeCursorWindsurf
ragvector-databasesembeddingssemantic-searchllm-applications

version: 4.1.0-fractal name: rag-implementation description: Build Retrieval-Augmented Generation (RAG) systems for LLM applications with vector databases and semantic search. Use when implementing knowledge-grounded AI, building document Q&A systems, or integrating LLMs with external knowledge bases.

RAG Implementation

Master Retrieval-Augmented Generation (RAG) to build LLM applications that provide accurate, grounded responses using external knowledge sources.

Use this skill when

  • Building Q&A systems over proprietary documents
  • Creating chatbots with current, factual information
  • Implementing semantic search with natural language queries
  • Reducing hallucinations with grounded responses
  • Enabling LLMs to access domain-specific knowledge
  • Building documentation assistants
  • Creating research tools with source citation

Do not use this skill when

  • You only need purely generative writing without retrieval
  • The dataset is too small to justify embeddings
  • You cannot store or process the source data safely

Instructions

  1. Define the corpus, update cadence, and evaluation targets.
  2. Choose embedding models and vector store based on scale.
  3. Build ingestion, chunking, and retrieval with reranking.
  4. Evaluate with grounded QA metrics and monitor drift.

Safety

  • Redact sensitive data and enforce access controls.
  • Avoid exposing source documents in responses when restricted.

Core Components

🧠 Knowledge Modules (Fractal Skills)

1. 1. Vector Databases

2. 2. Embeddings

3. 3. Retrieval Strategies

4. 4. Reranking

5. Pattern 1: Hybrid Search

6. Pattern 2: Multi-Query Retrieval

7. Pattern 3: Contextual Compression

8. Pattern 4: Parent Document Retriever

9. Recursive Character Text Splitter

10. Token-Based Splitting

11. Semantic Chunking

12. Markdown Header Splitter

13. Pinecone

14. Weaviate

15. Chroma (Local)

16. 1. Metadata Filtering

17. 2. Maximal Marginal Relevance

18. 3. Reranking with Cross-Encoder

19. Contextual Prompt

20. With Citations

21. With Confidence

Related skills