Installation

Requirements

  • Python 3.10 or later
  • A SQL database for the metadata store (SQLite for local development, PostgreSQL for production)

Install

The package is published as datapipe-core on PyPI:

pip install datapipe-core

For local development with SQLite, add the sqlite extra. Python ships with an older SQLite version that datapipe cannot use — the extra installs a compatible binary:

pip install "datapipe-core[sqlite]"

Optional extras

ExtraInstalls
sqlitepysqlite3-binary — required for SQLite support
redisredis client
elasticelasticsearch client
qdrantqdrant-client
milvuspymilvus
rayray[default] — for parallel execution across steps
gcsfsgcsfs — for Google Cloud Storage file backends
s3fss3fs — for S3 file backends
excelxlrd, openpyxl — for Excel file backends
gcpOpenTelemetry GCP trace exporter
pyarrowParquet file backend support
neo4jNeo4j graph store backend

Multiple extras can be combined:

pip install "datapipe-core[sqlite,redis]"

Verify

datapipe --help

This should print the datapipe CLI help. If the command is not found, check that the Python environment where you installed the package is active.