Core Concepts
This section covers the foundational building blocks of Laurus. Understanding these concepts will help you design effective schemas and configure your search engine.
Topics
Schema & Fields
How to define the structure of your documents. Covers:
SchemaandSchemaBuilder- Lexical field types (Text, Integer, Float, Boolean, DateTime, Geo, Bytes)
- Vector field types (Flat, HNSW, IVF)
DocumentandDocumentBuilderDataValue— the unified value type
Text Analysis
How text is processed before indexing. Covers:
- The
Analyzertrait and the analysis pipeline - Built-in analyzers (Standard, Japanese, Keyword, Pipeline)
PerFieldAnalyzer— different analyzers for different fields- Tokenizers and token filters
Embeddings
How text and images are converted to vectors. Covers:
- The
Embeddertrait - Built-in embedders (Candle BERT, OpenAI, CLIP, Precomputed)
PerFieldEmbedder— different embedders for different fields
Storage
Where index data is stored. Covers:
- The
Storagetrait - Storage backends (Memory, File, Mmap)
PrefixedStoragefor component isolation- Choosing the right backend for your use case