Free online tool
In development

Convert JSON to Parquet

Convert JSON arrays or NDJSON (newline-delimited JSON) into typed, compressed .parquet — preserving nested objects, lists, and inferred types. Online tool in development; reliable methods that work today are below.

Why convert JSON to Parquet?

JSON is everywhere — APIs, logs, MongoDB exports, webhook archives. It's a great wire format and a poor analytics format:

Convert JSON to Parquet today (without this tool)

DuckDB — NDJSON / JSON Lines

duckdb -c "COPY (SELECT * FROM read_json_auto('data.ndjson'))
           TO 'data.parquet'
           (FORMAT 'parquet', COMPRESSION 'zstd')"

Python (pandas)

Works for both JSON arrays and NDJSON — pass lines=True for line-delimited:

import pandas as pd

# JSON array
df = pd.read_json("data.json")
# Or NDJSON
df = pd.read_json("data.ndjson", lines=True)

df.to_parquet("data.parquet", compression="snappy")

PyArrow — preserves nested structures

import pyarrow.json as pj
import pyarrow.parquet as pq

table = pj.read_json("data.ndjson")
pq.write_table(table, "data.parquet", compression="zstd")

Things to watch out for

Related tools