You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Add links where some were missing
- Less bold
- Native notes instead of blockquotes
- Fix bogus SQL queries
- More cross-linking
- Muted teaser texts at top of pages
- Less mixed case
Copy file name to clipboardExpand all lines: docs/start/query/search/fulltext.md
+7-3Lines changed: 7 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,11 +1,15 @@
1
1
(start-fulltext)=
2
2
# Full-text search
3
3
4
-
Unlike exact-match filters, full-text search allows **fuzzy, linguistic matching** on human language text. It tokenizes input, analyzes language, and searches for **tokens, stems, synonyms**, etc.
4
+
:::{div} sd-text-muted
5
+
CrateDB enables real-time full-text search at scale.
6
+
:::
5
7
6
-
CrateDB supports powerful**full-text search**capabilities directly via the `FULLTEXT` index and the `MATCH()` SQL predicate. This allows you to **combine unstructured search with structured filtering and aggregations**—all in one query, with no need for external search systems like Elasticsearch.
8
+
Unlike exact-match filters,**full-text search** allows **fuzzy, linguistic matching** on human language text. It tokenizes input, analyzes language, and searches for **tokens, stems, synonyms**, etc.
7
9
8
-
Whether you're working with log messages, customer feedback, machine-generated data, or IoT event streams, CrateDB enables **real-time full-text search at scale**.
10
+
CrateDB supports powerful full-text search capabilities directly via the `FULLTEXT` index and the `MATCH()` SQL predicate. This allows you to **combine unstructured search with structured filtering and aggregations**—all in one query, with no need for external search systems like Elasticsearch.
11
+
12
+
CrateDB supports you whether you are working with log messages, customer feedback, machine-generated data, or IoT event streams.
Copy file name to clipboardExpand all lines: docs/start/query/search/geo.md
+7-5Lines changed: 7 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,15 +1,17 @@
1
1
(start-geospatial)=
2
2
# Geospatial search
3
3
4
-
CrateDB enables geospatial search using **Lucene’s Prefix Tree** and **BKD Tree** indexing structures. With CrateDB, you can:
4
+
:::{div} sd-text-muted
5
+
Query geospatial data through SQL, combining ease of use with advanced capabilities.
6
+
:::
7
+
8
+
CrateDB enables geospatial search using **Lucene’s prefix tree** and **BKD tree** indexing structures. With CrateDB, you can:
5
9
6
10
* Store and index geographic **points** and **shapes**
7
11
* Perform spatial queries using **bounding boxes**, **circles**, **donut shapes**, and more
8
12
* Filter, sort, or boost results by **distance**, **area**, or **spatial relationship**
9
13
10
-
You interact with geospatial data through SQL, combining ease of use with advanced capabilities.
11
-
12
-
See the Data Modelling (!!! add link) section for details of Data Types and how to insert data.
14
+
See the {ref}`data-modelling` section for details of data types and how to insert data.
13
15
14
16
## Querying Geospatial Data
15
17
@@ -70,7 +72,7 @@ You can choose and configure the indexing method when defining your table schema
70
72
71
73
While CrateDB can perform **exact computations** on complex geometries (e.g. large polygons, geometry collections), these can be computationally expensive. Choose your index strategy carefully based on your query patterns.
72
74
73
-
For full details, refer to the Geo Shape Column Definition section (!!! add link) in the reference.
75
+
For full details, refer to the Geo Shape column definition section in the reference documentation.
Copy file name to clipboardExpand all lines: docs/start/query/search/hybrid.md
+19-6Lines changed: 19 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,22 +1,27 @@
1
1
(start-hybrid)=
2
2
# Hybrid search
3
3
4
+
:::{div} sd-text-muted
5
+
Combine vector similarity (kNN) and term-based full-text (BM25)
6
+
searches in a single SQL query.
7
+
:::
8
+
4
9
While **vector search** provides powerful semantic retrieval based on machine learning models, it's not always optimal, especially when models are not fine-tuned for a specific domain. On the other hand, **traditional full-text search** (e.g., BM25 scoring) offers high precision on exact or keyword-based queries, with strong performance out of the box. **Hybrid search** blends these approaches, combining semantic understanding with keyword relevance to deliver more accurate, robust, and context-aware search results.
5
10
6
-
Hybrid search is particularly effective for **Knowledge bases, Product or document search, Multilingual content search, FAQ bots and semantic assistants**, and **AI-powered search experiences.** It allows applications to go beyond keyword matching, incorporating vector similarity while still respecting domain-specific terms.
11
+
Hybrid search is particularly effective for **knowledge bases, product or document search, multilingual content search, FAQ bots and semantic assistants**, and **AI-powered search experiences.** It allows applications to go beyond keyword matching, incorporating vector similarity while still respecting domain-specific terms.
7
12
8
-
CrateDB supports **hybrid search** by combining **vector similarity search** (kNN) and **term-based full-text search** (BM25) in a single SQL query. CrateDB lets you implement hybrid search natively in SQL using **Common Table Expressions (CTEs)** and **scoring fusion techniques**, such as:
13
+
CrateDB supports **hybrid search** by combining **vector similarity search** (kNN) and **term-based full-text search** (BM25) in a single SQL query. CrateDB lets you implement hybrid search natively in SQL using **common table expressions (CTEs)** and **scoring fusion techniques**, such as:
Copy file name to clipboardExpand all lines: docs/start/query/search/vector.md
+12-7Lines changed: 12 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,18 +1,23 @@
1
1
(start-vector)=
2
2
# Vector search
3
3
4
-
Vector search retrieves the most semantically similar items to a query vector using **Approximate Nearest Neighbor (ANN)** algorithms (e.g., HNSW via Lucene). 
4
+
:::{div} sd-text-muted
5
+
Store, manage, and search vector embeddings at scale.
6
+
:::
7
+
8
+
Vector search retrieves the most semantically similar items to a query vector using **approximate nearest neighbor (ANN)** algorithms (e.g., HNSW via Lucene). 
5
9
6
10
CrateDB supports **native vector search**, enabling you to perform **similarity-based retrieval** directly in SQL, without needing a separate vector database or search engine.
7
11
8
-
Whether you're powering **semantic search**, **recommendation engines**, **anomaly detection**, or **AI-enhanced applications**, CrateDB lets you store, manage, and search vector embeddings at scale**right alongside your structured, JSON, and full-text data.**
12
+
Whether you're powering **semantic search**, **recommendation engines**, **anomaly detection**, or **AI-enhanced applications**, CrateDB lets you managevector data**right alongside your structured, JSON, and full-text data.**
9
13
10
14
## Why CrateDB for Vector Search?
11
15
12
-
| FLOAT\_VECTOR | Store embeddings up to 2048 dimensions |
0 commit comments