Home - MiguelElGallo/iparq GitHub Wiki
Welcome to the iparq wiki

iparq is a Python CLI for inspecting Apache Parquet files and surfacing the details that are usually hidden in metadata: compression codecs, bloom filters, encryption status, min/max statistics, compression ratios, and more.
It was born from a simple curiosity: which Parquet versions and compression methods do everyday tools actually produce? iparq exists to make those answers easy to discover from the command line. It is built with Typer, Pydantic, PyArrow, and Rich, is MIT licensed, and is maintained by MiguelElGallo.
You can learn more on PyPI or in the GitHub repository. iparq can be installed with uvx, pip, uv, or Homebrew.
Quick Start
Run iparq without installing it globally:
uvx --refresh iparq inspect yourparquet.parquet
Key Features
- Bloom filter detection
- Encryption detection
- Statistics exactness reporting (PyArrow 22+)
- Compression ratio reporting
- JSON output for automation
- Multi-file and glob support
Wiki Pages
- Installation - All installation methods
- Usage Guide - Comprehensive usage examples
- Architecture - Code structure and design
- Understanding Parquet Metadata - What iparq shows and why it matters
- Contributing - Development setup and how to contribute