Skip to content

Add streamxl — Rust-powered streaming XLSX reader for Python#1

Open
Mullassery wants to merge 1 commit into
rp-libs:mainfrom
Mullassery:add-streamxl
Open

Add streamxl — Rust-powered streaming XLSX reader for Python#1
Mullassery wants to merge 1 commit into
rp-libs:mainfrom
Mullassery:add-streamxl

Conversation

@Mullassery

Copy link
Copy Markdown

streamxl

https://github.com/Mullassery/StreamXL

A Python library that reads .xlsx files row-by-row using a Rust engine exposed via PyO3 + maturin.

Why it belongs here:

  • Pure Rust core using zip + quick-xml crates
  • PyO3 0.23 bindings, built with maturin
  • 4–5× faster than openpyxl across all tested file sizes (~27,000 rows/sec on Apple Silicon)
  • Streams row-by-row — peak memory stays proportional to the shared string table, not file size

Author: Georgi Mullassery
License: MIT
Install: pip install streamxl / uv add streamxl

Added to the Data Processing section in alphabetical order.

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request adds the streamxl library to the list of high-performance data processing and serialization libraries in the README.md. The reviewer suggested removing the author's name from the description to maintain consistency with the other entries in the list.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Comment thread README.md
- [polars](https://github.com/pola-rs/polars) - DataFrame library with a Pandas-like API.
- [pydantic-core](https://github.com/pydantic/pydantic-core) - Core validation logic for Pydantic v2.
- [rustworkx](https://github.com/Qiskit/rustworkx) - High-performance Python graph library implemented in Rust.
- [streamxl](https://github.com/Mullassery/StreamXL) - Streaming XLSX reader for Python powered by a Rust engine (PyO3 + quick-xml). Reads large Excel files row-by-row without loading the full file into memory; 4–5× faster than openpyxl. By Georgi Mullassery.

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

To maintain consistency with the other entries in this list, please remove the author's name ("By Georgi Mullassery.") from the description. None of the other listed projects include author attribution in their descriptions.

Suggested change
- [streamxl](https://github.com/Mullassery/StreamXL) - Streaming XLSX reader for Python powered by a Rust engine (PyO3 + quick-xml). Reads large Excel files row-by-row without loading the full file into memory; 4–5× faster than openpyxl. By Georgi Mullassery.
- [streamxl](https://github.com/Mullassery/StreamXL) - Streaming XLSX reader for Python powered by a Rust engine (PyO3 + quick-xml). Reads large Excel files row-by-row without loading the full file into memory; 4–5× faster than openpyxl.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant