Semantic Search
Use semantic search when exact keyword matching is too brittle for filing research. This surface is built for concept retrieval across filing sections and related-filing discovery, not just literal text lookup.
Pick the right mode
| Mode | Best for | Why |
|---|---|---|
keyword | Exact lookups such as AAPL 10-K 2024 or a known phrase | Uses traditional keyword retrieval and works best when precision depends on literal terms |
semantic | Concept-based discovery such as supply chain stress, AI capex, or pricing pressure | Uses vector similarity to retrieve filings and sections that discuss the idea even when they use different wording |
hybrid | Most real research workflows | Runs keyword and vector retrieval together, merges them, and reranks the combined results for the strongest overall quality |
Start with this workflow
Start with `hybrid`
Use
hybrid first unless you know you need strict keyword matching. It is the best default when you want both literal matches and concept recall.Add narrow filters second
Once the result set looks right, add filters like
ticker, form, filing_year, or limit to focus the search on the issuer or filing family you care about.Common research plays
Find a concept across filings
hybrid is usually the right starting point.
Narrow the search to a filing family
Expand from one filing to related filings
What powers this surface
Semantic search is backed by Voyage AIvoyage-4-large embeddings, Pinecone vector retrieval, and Voyage rerank-2 reranking. voyage-4-large is the primary finance retrieval model for this surface. In hybrid mode, Datastream combines keyword retrieval with vector retrieval, merges the candidate set with Reciprocal Rank Fusion, and reranks the final list for better research quality.
Cost note
- Pay As You Go price:
$0.04per call undersemantic_search - plan discounts apply on the same meter family
- Voyage 4 uses a shared embedding space, which lets Datastream pair a higher-quality index path with a cheaper query path when cost efficiency matters
Read next
GET /v1/search/semantic
Full API reference for semantic and hybrid section search.
GET /v1/filings/{filingId}/similar
Find filings with semantically similar content to a source filing.
Plans and pricing
Review the
semantic_search meter family and launch pricing posture.