You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/user_guides/fs/feature_group/data_types.md
+13-1Lines changed: 13 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -139,10 +139,22 @@ The byte size of each column is determined by its data type and calculated as fo
139
139
| VARCHAR(LENGTH) | LENGTH * 4 |
140
140
| VARCHAR(LENGTH) charset latin1; | LENGTH * 1 |
141
141
| TEXT | 256 |
142
-
| VARBINARY(LENGTH) | LENGTH / 1.4|
142
+
| VARBINARY(LENGTH) | LENGTH |
143
143
| BLOB | 256 |
144
144
| other | 8 |
145
145
146
+
!!! note "VARCHAR / VARBINARY overhead"
147
+
148
+
For VARCHAR and VARBINARY data types, an additional 1 byte is required if the size is less than 256 bytes. If the size is 256 bytes or greater, 2 additional bytes are required.
149
+
150
+
Memory allocation is performed in groups of 4 bytes. For example, a VARBINARY(100) requires 104 bytes of memory:
151
+
152
+
- 100 bytes for the data itself
153
+
- 1 byte of overhead
154
+
- Total = 101 bytes
155
+
156
+
Since memory is allocated in 4-byte groups, storing 101 bytes requires 26 groups (26 × 4 = 104 bytes) of allocated memory.
157
+
146
158
147
159
#### Pre-insert schema validation for online feature groups
148
160
For online enabled feature groups, the dataframe to be ingested needs to adhere to the online schema definitions. The input dataframe is validated for schema checks accordingly.
0 commit comments