Note that this function is now deprecated. Please use
read_parquet_schema()
for files, and infer_parquet_schema()
for
data frames.
Usage
parquet_column_types(x, options = parquet_options())
Arguments
- x
Path to a Parquet file, or a data frame.
- options
Nanoparquet options, see
parquet_options()
.
Value
Data frame with columns:
file_name
: file name.name
: column name.type
: (low level) Parquet data type.r_type
: the R type that corresponds to the Parquet type. Might beNA
ifread_parquet()
cannot read this column. See nanoparquet-types for the type mapping rules.repetition_type
: whether the column inREQUIRED
(cannot beNA
) orOPTIONAL
(may beNA
).REPEATED
columns are not currently supported by nanoparquet.logical_type
: Parquet logical type in a list column. An element has at least an entry calledtype
, and potentially additional entries, e.g.bit_width
,is_signed
, etc.
Details
This function works two ways. It can map the R types of a data frame to
Parquet types, to see how write_parquet()
would write out the data
frame. It can also map the types of a Parquet file to R types, to see
how read_parquet()
would read the file into R.
See also
read_parquet_metadata()
to read more metadata,
read_parquet_info()
for a very short summary.
read_parquet_schema()
for the complete Parquet schema.
read_parquet()
, write_parquet()
, nanoparquet-types.