Skip to content

Document Chunking

Splitting documents into context-aware chunks for RAG pipelines.

Overview

Document chunking is essential for effective Retrieval-Augmented Generation (RAG) workflows.

Features

  • Intelligent chunk boundaries
  • Configurable chunk sizes
  • Overlap management
  • Metadata preservation

Usage

from ingenious.chunk import DocumentChunker

chunker = DocumentChunker()
chunks = chunker.chunk_document(document)

For more details, see the Development Guide.