Find Related products on Amazon

Shop on Amazon

Show HN: Xorq – open-source Python-first Pandas-style pipelines

Published on: 2025-05-26 12:27:31

xorq: Multi-engine ML pipelines made simple xorq is a deferred computational framework that brings the replicability and performance of declarative pipelines to the Python ML ecosystem. It enables us to write pandas-style transformations that never run out of memory, automatically cache intermediate results, and seamlessly move between SQL engines and Python UDFs—all while maintaining replicability. xorq is built on top of Ibis and DataFusion. Feature Description Declarative expressions Express and execute complex data processing logic via declarative functions. Define transformations as Ibis expressions so that you are not tied to a specific execution engine. Multi-engine Create unified ML workflows that leverage the strengths of different data engines in a single pipeline. xorq orchestrates data movement between engines (e.g., Snowflake for initial extraction, DuckDB for transformations, and Python for ML model training). Built-in caching xorq automatically caches intermediate pipe ... Read full article.