Skip to content

Add ModelColumnStats extraction cell for column-level storage statistics#1

Open
imtkain wants to merge 2 commits intoBeSmarterWithData:mainfrom
imtkain:claude/check-governance-identifiers-qpvnO
Open

Add ModelColumnStats extraction cell for column-level storage statistics#1
imtkain wants to merge 2 commits intoBeSmarterWithData:mainfrom
imtkain:claude/check-governance-identifiers-qpvnO

Conversation

@imtkain
Copy link
Copy Markdown

@imtkain imtkain commented Jan 29, 2026

  • Add new cell (In[5]) to extract column storage stats via DMV queries
  • Extract cardinality, data size, dictionary size, segments, temperature
  • Iterate over all datasets in target workspaces using shared config
  • Include unique ColumnStatsId (SHA-256 hash of composite key)
  • Include WorkspaceName, DatasetName, DatasetId for full context
  • Single write operation to ModelColumnStats lakehouse table
  • Graceful error handling for permission/timeout issues

https://claude.ai/code/session_0126nBTQ4hko9p4vsWSfBCfU

claude and others added 2 commits January 29, 2026 11:27
- Add new cell (In[5]) to extract column storage stats via DMV queries
- Extract cardinality, data size, dictionary size, segments, temperature
- Iterate over all datasets in target workspaces using shared config
- Include unique ColumnStatsId (SHA-256 hash of composite key)
- Include WorkspaceName, DatasetName, DatasetId for full context
- Single write operation to ModelColumnStats lakehouse table
- Graceful error handling for permission/timeout issues

https://claude.ai/code/session_0126nBTQ4hko9p4vsWSfBCfU
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants