Skip to content

Robin Sparkless

PySpark-style DataFrames in Rust—no JVM. A DataFrame library that mirrors PySpark's API and semantics while using Polars as the execution engine. The same engine powers Sparkless v4 for Python: a drop-in PySpark replacement with no JVM and no Polars Python at runtime.

Rust

Python (Sparkless v4)

What is Robin Sparkless?

Robin Sparkless provides a PySpark-like API in Rust so you can write familiar DataFrame code without the JVM. It is designed to power Sparkless—the Python PySpark drop-in replacement—as its execution backend via PyO3.

Feature Description
Core SparkSession, DataFrame; lazy by default. ExprIr (engine-agnostic): root col/lit_*/gt/… → filter_expr_ir, collect_rows, agg_expr_ir. Column/Expr (Polars): prelude/functions → filter, with_column, full expression set. Plus groupBy, joins
Engine Polars for fast, native execution
Optional SQL (spark.sql, temp views, global temp views, saveAsTable in-memory or warehouse), Delta Lake (read_delta / write_delta)

Documentation

Rust

Python (Sparkless v4) — mirrors Sparkless doc structure

For the full list of documents, see the Doc index in the navigation.

Rust API

License

MIT