79448840

Date: 2025-02-18 16:00:07
Score: 1.5
Natty:
Report link

You can now do this using directly pyarrow :

import pyarrow.parquet as pq
 
# global file informations (number of columns, rows, groups,...)
metadata = pq.read_metadata(my_file_path)
print(metadata)
 
# detail of indexes and columns names and types
schema = pq.read_schema(my_file_path)
print(schema)

More information at https://arrow.apache.org/docs/python/parquet.html#inspecting-the-parquet-file-metadata

Reasons:
  • Probably link only (1):
  • Has code block (-0.5):
  • Low reputation (1):
Posted by: Klamelia