Guide

What is a Parquet file?

A plain-English guide to the Apache Parquet format — what .parquet files are, why they're so small and fast, and how to open one without any code.

Runs in your browser · nothing is uploaded · no signup

A Parquet file (extension .parquet) is an open, binary file format for storing tabular data — rows and columns — built for analytics. It was created in the Apache Hadoop ecosystem and is now one of the most common formats for data engineering, data science, and data lakes. Unlike CSV, you can't read a Parquet file in a text editor: it's compressed and stored column-by-column. The easiest way to look inside one is to open it in the web viewer.

Columnar storage, explained

Most formats (like CSV) store data row by row: all the fields of row 1, then all of row 2, and so on. Parquet stores it column by column: all the values of the first column together, then the next column, and so on.

That one change has big consequences. Because values in a column are all the same type and often similar, they compress extremely well. And when a query only needs three columns out of fifty, the engine reads just those three and skips the rest — which is why analytics over Parquet is so much faster than over CSV.

What makes Parquet useful

  • Small. Columnar layout plus compression (Snappy, ZSTD, and others) makes Parquet far smaller than the equivalent CSV or JSON.
  • Fast. Column pruning and built-in statistics let engines skip data they don't need.
  • Typed. Each column has a real data type and a schema, so numbers, dates, and booleans are preserved exactly.
  • Portable and open. It's an open standard read by Spark, pandas, Polars, BigQuery, Snowflake, and many more.
  • Self-describing. The schema and column statistics live inside the file's footer.

Parquet vs CSV vs JSON

  • CSV — plain text, universal, human-readable, but large, untyped, and slow to scan. Good for sharing small tables.
  • JSON — flexible and great for nested data and APIs, but verbose and untyped for tabular use.
  • Parquet — compact, typed, and fast for analytics, but binary and not human-readable.

A common workflow is to keep working data in Parquet and convert to CSV or JSON only when a tool or a colleague needs it. You can do all of those conversions here: Parquet to CSV, Parquet to JSON, CSV to Parquet, and more on the converters page.

How do I open a Parquet file?

You don't need Python or pandas just to look inside one. Open the online Parquet viewer, drag your .parquet file in, and it loads instantly — read locally in your browser. You can browse the rows, inspect the schema, filter and sort, run SQL, and export to another format. The file's contents never leave your device.

Frequently asked questions

How do I open a .parquet file?

Open the web viewer and drag the file in — it opens in your browser with nothing to install. You can also use the macOS app, or libraries like pandas and Polars.

Is Parquet better than CSV?

For analytics, usually yes: it's smaller, typed, and much faster to query. CSV still wins for simple sharing and human readability. They serve different jobs.

Can I edit a Parquet file?

Parquet files are typically written once and read many times rather than edited in place. To change one, you read it, transform the data, and write a new file — for example by exporting from the viewer.

Is Parquet only for big data?

No. It's an excellent format for small tables too, thanks to compression and typing — but its advantages grow with size.

Open your file in seconds — right in the browser

No upload, no signup, no install. Your data is read locally on your device and never sent to us.