✨ Xorq is an opinionated framework for cataloging, sharing, and shipping multi-engine compute as diffable artifacts for your data in flight. ✨
Xorq helps teams build declarative, reusable ML pipelines across Python and SQL engines like DuckDB, Snowflake, and DataFusion. It offers:
🧠 Multi-engine, declarative expressions using pandas-style syntax and Ibis.
using pandas-style syntax and Ibis. 📦 Expression Format for Python in YAML, enabling repeatable compute.
for Python in YAML, enabling repeatable compute. ⚡ Portable UDFs and UDAFs with automatic serialization.
with automatic serialization. 🔁 Shift-left with caching using expr hash for naming things.
using expr hash for naming things. 🔍 Column-level lineage and observability out of the box.
🔧 Quickstart
pip install xorq[examples] xorq init -t penguins
Then follow the Quickstart Tutorial for a full walk-through using the Penguins dataset.
... continue reading