Working with parquet files - HelmutZechmann/dataEngineeringCookbook GitHub Wiki
For working with parquet files you can use the parquet-tools from the parquet project.
If you want to use them locally, make sure to enable the local profile during the build:
cd parquet-tools && mvn clean package -Plocal
Supported commands are
head
-n : Display the first n recordscat
: Cat the whole filedump
: Dump file contentsmeta
: Show metadata information for the fileschema
: Display the file schema