Answers for "read parquet from s3 and convert to dataframe"

0

pandas dataframe to parquet s3

import awswrangler as wr
wr.pandas.to_parquet(
    dataframe=df,
    path="s3://my-bucket/key/my-file.parquet"
)
Posted by: Guest on June-21-2020
0

read parquet from s3 and convert to dataframe

import pyarrow.parquet as pq
import s3fs

dataset = pq.ParquetDataset('s3://<s3_path_to_folder_or_file>', 
filesystem=s3fs.S3FileSystem(), filters=[('colA', '=', 'some_value'), ('colB', '>=', some_number)])
table = dataset.read()
df = table.to_pandas()
Posted by: Guest on March-19-2021

Browse Popular Code Answers by Language