Working with Parquet in TypeScript
Apache Parquet file handling with high-performance columnar data processing and compression.
Installation
Section titled “Installation”npm install fairspecGetting Started
Section titled “Getting Started”The Parquet plugin provides:
loadParquetTable- Load Parquet files into tablessaveParquetTable- Save tables to Parquet filesParquetPlugin- Plugin for framework integration
For example:
import { loadParquetTable } from "fairspec"
const table = await loadParquetTable({ data: "table.parquet" })// Efficient columnar format with compressionBasic Usage
Section titled “Basic Usage”Loading Parquet Files
Section titled “Loading Parquet Files”import { loadParquetTable } from "fairspec"
// Load from local fileconst table = await loadParquetTable({ data: "data.parquet" })
// Load from remote URLconst table = await loadParquetTable({ data: "https://example.com/data.parquet"})
// Load multiple files (concatenated)const table = await loadParquetTable({ data: ["file1.parquet", "file2.parquet"]})Saving Parquet Files
Section titled “Saving Parquet Files”import { saveParquetTable } from "fairspec"
// Save with default optionsawait saveParquetTable(table, { path: "output.parquet" })
// Save with explicit formatawait saveParquetTable(table, { path: "output.parquet", format: { name: "parquet" }})Advanced Features
Section titled “Advanced Features”Remote File Loading
Section titled “Remote File Loading”// Load from URLconst table = await loadParquetTable({ data: "https://example.com/data.parquet"})
// Load multiple remote filesconst table = await loadParquetTable({ data: [ "https://api.example.com/data-2023.parquet", "https://api.example.com/data-2024.parquet" ]})