Posts
All the articles I've posted.
-
How VertiPaq Sorts a Table — A Visual Walkthrough
A visual companion to the row-ordering article — step through the greedy bucket-splitting algorithm that VertiPaq uses to maximise RLE compression.
-
Parsing Power Pivot Data Models from Excel XLSX Files
Excel workbooks with Power Pivot models contain the same Analysis Services and VertiPaq storage ideas as PBIX files. This article shows how pbixray reads them directly from Python.
-
PBIX Parsing in Your Browser: Introducing PBIX.info
PBIX.info: a browser-based PBIX inspector built on WebAssembly. Local-first parsing, interactive diagrams, and shareable collaboration links.
-
How VertiPaq Sorts Rows to Maximize RLE Compression
How VertiPaq appears to use a greedy row-reordering algorithm to maximise RLE compression across all columns in a segment.
-
Reconstructing Column Data from .idf and .idfmeta
How VertiPaq stores column segments in .idf and .idfmeta files, and how those files combine to reconstruct imported table data from a PBIX file.
-
VertiPaq Dictionaries and Hash Indexes
How VertiPaq stores dictionaries and hash indexes inside a PBIX file, and how those structures help turn internal IDs back into readable values.
-
Parsing PBIX Files with Python (pbixray)
Extract tables, measures, M queries, and report metadata from any PBIX file using the open source Python library pbixray — no Power BI required.
-
Inside metadata.sqlitedb: Tables, Columns, Measures & Relationships
How metadata.sqlitedb inside a PBIX file exposes the semantic model as an embedded SQLite database, mirroring the same objects as TMSCHEMA DMVs.