An independent workshop for open-source data tooling.
No SaaS. No seats. Just software you can read, run, and own - from visual data pipelines to file-format libraries.
Flagship
File formats
Reverse-engineering and documenting proprietary binary formats - open specs and clean-room readers for closed file types.
OpenYXDB
Maintained fork of Alteryx's YXDB library. Cross-platform C++ core with Python bindings via nanobind.
OpenQVD
Clean-room Rust reader/writer for Qlik QVD files. Full spec, Python bindings (PyArrow, Polars, Pandas), CLI.
OpenSQLAnywhere
Pure-Rust reader and open specification for SAP SQL Anywhere page-store files. CRC verification, slotted-page parsing, AP deobfuscation primitive.
OpenQBW
Pure-Rust reader for Intuit QuickBooks Desktop .QBW company files. Migrate to CSV, SQLite, or IIF without QuickBooks installed. Validated on 13,375 invoices, $399.9M total.
SigilYX
Standalone Rust reader and writer for Alteryx YXDB files, with Python bindings for Polars, PyArrow, and Pandas. Full E1 read/write; experimental E2 read.
Bioinformatics
BioLance
Fast, columnar multi-sample variant store on LanceDB. Ingests VCF/BCF and ClinVar - query by region or gene, no Spark, no cloud.
OpenKSpace
Rust library and CLI for Cartesian MRI k-space reconstruction from
ISMRMRD .h5 files.
DICOM Atlas
Open registry of public and private DICOM tags compiled from vendor conformance statements plus PS3.6, pydicom, and GDCM.
Proteomics
Clean-room specs and readers for closed mass-spectrometry formats, reverse-engineered without vendor SDKs.
OpenTFRaw
Pure-Rust parser for Thermo Fisher RAW files. Covers format versions 8-66, from the LCQ Classic through the Orbitrap Astral.
OpenTimsTDF
Pure-Rust parser for Bruker timsTOF TDF bundles. SQLite metadata + binary frames, both LZF and zstd codecs, full calibration.
OpenWRaw
Pure-Rust parser for Waters MassLynx RAW directories. Header, index, function inventory, externs, chromatograms, and frame data.
OpenProteo
Unified Rust and Python stack over all three parsers. Single API, mzML 1.1.0 conversion, Arrow output - no vendor SDKs.
OpenProteoCore
The canonical PSI-MS / Arrow vocabulary crate underlying OpenProteo. Lightweight, dependency-free, designed to be shared across the mass-spec ecosystem.
ProLance
Columnar, memory-mapped mass-spec store built on Lance. Ingests via OpenProteo or mzML; query from Rust, Python, or the CLI.