Reports

You can now do this using directly pyarrow :

import pyarrow.parquet as pq
 
# global file informations (number of columns, rows, groups,...)
metadata = pq.read_metadata(my_file_path)
print(metadata)
 
# detail of indexes and columns names and types
schema = pq.read_schema(my_file_path)
print(schema)

More information at https://arrow.apache.org/docs/python/parquet.html#inspecting-the-parquet-file-metadata

79448840