Analytics
Prism is the easiest way to create data pipelines in Python.
Data projects often require multiple steps that need to be executed in sequence (think extract-transform-load, data wrangling, etc.). With Prism, users can break down their project into modular tasks, manage dependencies, and execute complex computations in sequence.
Here are some of Prism's main features:
Prism can be installed via pip
. Prism requires Python >= 3.7.
pip install --upgrade pip
pip install prism-ds
Start your first Prism project with the prism init
command:
$ prism init --project-name my_first_project
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Running with Prism v0.3.0...
Creating template project at ./my_first_project...
______
____ __ \_____(_)________ _______
_____ /_/ / ___/ / ___/ __ `__ \ ____
____ / ____/ / / (__ ) / / / / / _____
___/_/ /_/ /_/____/_/ /_/ /_/ ___
Welcome to Prism, the easiest way to create clean, modular data pipelines
using Python!
To get started, navigate to your newly created project "my_first_project" and try
running the following commands:
> python main.py
> prism run
> prism graph
Consult the documentation here for more information on how to get started.
docs.runprism.com
Happy building!
Done!
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Run your project by navigating to your project directory and running prism run
:
$ cd my_first_project
$ prism run
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
[HH:MM:SS] INFO | Running with Prism v0.3.0...
[HH:MM:SS] INFO | Creating run magnetic-pony-BBDYfwdDzH for client my_first_project-1.0...
[HH:MM:SS] INFO | Found 2 task(s) in 2 module(s) in job magnetic-pony-BBDYfwdDzH...
[HH:MM:SS] INFO | Parsing task dependencies............................................... [RUN]
[HH:MM:SS] INFO | FINISHED parsing task dependencies...................................... [DONE in 0.01s]
────────────────────────────────────────────── Tasks ──────────────────────────────────────────────
[HH:MM:SS] INFO | 1 of 2 RUNNING TASK example-decorated-task.............................. [RUN]
[HH:MM:SS] INFO | 1 of 2 FINISHED TASK example-decorated-task............................. [DONE in 0.02s]
[HH:MM:SS] INFO | 2 of 2 RUNNING TASK example-class-task.................................. [RUN]
[HH:MM:SS] INFO | 2 of 2 FINISHED TASK example-class-task................................. [DONE in 0.02s]
Done!
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
To get started with Prism projects, check out our documentation. Some sections of interest include:
In addition, check out some example projects.
Prism integrates with a wide variety of third-party developer tools There are two kinds of integrations that Prism supports: adapters, and agents.
Adapters allow users to connect to data warehouses or analytics engines. Prism currently supports the following adapters:
Adapter | Command |
---|---|
Google BigQuery | pip install "prism-ds[bigquery]" |
Postgres | pip install "prism-ds[postgres]" |
Presto | pip install "prism-ds[presto]" |
Redshift | pip install "prism-ds[redshift]" |
Snowflake | pip install "prism-ds[snowflake]" |
Trino | pip install "prism-ds[trino]" |
We're always looking to improve our product. Here's what we're working on at the moment:
Let us know if you'd like to see another feature!