Documentation
Docs for Meridian's datasets and the open tools that build them.
Meridian is an open data commons for analysts: selected public datasets you can search and query in the browser, plus the open-source tools every dataset is built with.
Datasets
The toolchain
The datasets and the tools are one project: type every column, verify every relationship, make every pipeline reproducible.
Finetype
Semantic type inference for messy data — 245 types across 65+ locales, pure Rust, with a DuckDB extension.
Dovetail
Discovers how unfamiliar data loads and how tables relate — then compiles it to runnable SQL.
Arcform
Local-first, asset-aware data pipelines: YAML steps, DuckDB engine, one Rust binary.