S3 Select

S3 Select is an S3 feature that allows you to operate on JSON, CSV, and Parquet files in a row-based manner using SQL syntax. T4 features experimental support for S3 Select queries as part of the Bucket interface:

$ python
>>> import t4
>>> b = t4.Bucket("s3://alpha-quilt-example")
>>> b.select("foo/bar.csv", "SELECT * FROM S3Object LIMIT 5")
<<< <pandas DataFrame object at ...>

Last updated