Show HN: Xorq – open compute catalog for AI

✨ Xorq is an opinionated framework for cataloging, sharing, and shipping multi-engine compute as diffable artifacts for your data in flight. ✨

Xorq helps teams build declarative, reusable ML pipelines across Python and SQL engines like DuckDB, Snowflake, and DataFusion. It offers:

🧠 Multi-engine, declarative expressions using pandas-style syntax and Ibis.

using pandas-style syntax and Ibis. 📦 Expression Format for Python in YAML, enabling repeatable compute.

for Python in YAML, enabling repeatable compute. ⚡ Portable UDFs and UDAFs with automatic serialization.

with automatic serialization. 🔁 Shift-left with caching using expr hash for naming things.

using expr hash for naming things. 🔍 Column-level lineage and observability out of the box.

🔧 Quickstart

pip install xorq[examples] xorq init -t penguins

Then follow the Quickstart Tutorial for a full walk-through using the Penguins dataset.