Major refactoring to simplify architecture and improve serverless compatibility: #10
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
• Consolidate UC connections from 2 to 1 (skyflow_conn with flexible base path)
• Migrate tokenization notebook to UC HTTP connections for full Unity Catalog parity
• Remove unused SKYFLOW_ACCOUNT_ID configuration from all components
• Eliminate X-SKYFLOW-ACCOUNT-ID headers (not required by Skyflow APIs)
• Increase tokenization timeout from 5 to 15 minutes for large datasets
• Preserve 3-tier chunking strategy while switching to SQL http_request()
• Use dbutils.secrets.get() + UC connections hybrid for serverless compatibility
This achieves complete UC parity between tokenization and detokenization while
maintaining performance optimizations and fail-fast error handling.