AWS Documentation References
Official AWS documentation supporting the NorthBuilt RAG System architecture.
Table of Contents
Overview
Amazon S3 Vectors
Core Documentation
Configuration and Limits
Integration
Key Limits (as of 2025)
Blog Posts and Announcements
Amazon Bedrock Knowledge Bases
Core Documentation
Data Source Configuration
Chunking Strategies
Query and Retrieval
API Reference
Product Resources
Metadata Filtering
Documentation
Supported Filter Operators
Blog Posts
Multi-Tenancy
AWS Blog Posts
Patterns Comparison
Reranking
Documentation
Available Models
Blog Posts
Amazon Titan Embeddings
Documentation
Model Specifications
Blog Posts
Claude on Amazon Bedrock
Documentation
Models Used
AWS Cognito
Documentation
Additional Resources
AWS Samples and Examples
Architecture Guidance
How We Use These References
Overview
This page provides direct links to official AWS documentation that supports and validates our architectural decisions. These references are authoritative sources for understanding the capabilities, limitations, and best practices of the AWS services we use.
Amazon S3 Vectors
S3 Vectors is the vector storage backend for our Bedrock Knowledge Base, providing purpose-built, cost-optimized storage for semantic search.
Core Documentation
Configuration and Limits
Integration
Key Limits (as of 2025)
Blog Posts and Announcements
Amazon Bedrock Knowledge Bases
Bedrock Knowledge Bases provides the managed RAG orchestration layer, handling document ingestion, chunking, embedding, and retrieval.
Core Documentation
Data Source Configuration
Chunking Strategies
Chunking Strategy Options:
Fixed-size : Token count with overlap percentage
Hierarchical : Parent-child chunk relationships
Semantic : NLP-based meaning boundaries
No chunking : Treat document as single chunk
Query and Retrieval
API Reference
Product Resources
Metadata filtering enables multi-tenant isolation and targeted retrieval in our RAG system.
Documentation
Supported Filter Operators
Operator
Description
S3 Vectors Support
equals
Exact match
Yes
notEquals
Exclusion
Yes
greaterThan
Numeric comparison
Yes
lessThan
Numeric comparison
Yes
in
Match any in list
Yes
notIn
Exclude any in list
Yes
andAll
Logical AND
Yes
orAll
Logical OR
Yes
startsWith
Prefix match
No
stringContains
Substring match
No
listContains
List membership
Yes
Important : When using S3 Vectors as the vector store, startsWith and stringContains operators are not supported. See RetrievalFilter documentation .
Blog Posts
Multi-Tenancy
Multi-tenancy patterns enable serving multiple clients from a single Knowledge Base while maintaining data isolation.
AWS Blog Posts
Patterns Comparison
Pattern
Description
Use Case
Pool (our approach)
Single KB with metadata filtering
Cost-effective, simpler management
Silo
Separate KB per tenant
Maximum isolation, per-tenant encryption
Hybrid
Shared infrastructure, separate data sources
Balance of isolation and efficiency
Reranking
Reranking improves retrieval relevance by re-scoring results based on semantic similarity to the query.
Documentation
Available Models
Model
Availability
Notes
Cohere Rerank 3.5
us-east-1, us-west-2, ca-central-1, eu-central-1, ap-northeast-1
Used in our system
Amazon Rerank 1.0
Not us-east-1
Not available in our region
Blog Posts
Amazon Titan Embeddings
Titan Embeddings V2 generates the vector representations for semantic search.
Documentation
Model Specifications
Specification
Value
Model ID
amazon.titan-embed-text-v2:0
Max input
8,192 tokens
Default dimensions
1,024
Available dimensions
256, 512, 1,024
Dimension retention (512)
~99% accuracy
Dimension retention (256)
~97% accuracy
Blog Posts
Claude on Amazon Bedrock
Claude Sonnet 4.5 powers our response generation and query understanding.
Documentation
Models Used
Model
Purpose
Model ID
Claude Sonnet 4.5
Response generation
us.anthropic.claude-sonnet-4-5-20250929-v1:0
Claude Haiku
Query understanding (cost-efficient)
anthropic.claude-3-haiku-20240307-v1:0
AWS Cognito
Cognito provides authentication with Google OAuth integration.
Documentation
Additional Resources
AWS Samples and Examples
Architecture Guidance
How We Use These References
Throughout our documentation, we link to these AWS sources to:
Validate architectural decisions - Our choices are backed by official AWS guidance
Explain limitations - Service constraints are documented by AWS
Provide deeper context - Readers can explore official docs for more detail
Stay current - AWS documentation reflects the latest service capabilities
When you see a claim in our documentation, look for the accompanying AWS reference link to verify and learn more.
Last updated: 2026-01-01