Add AudiencePro, statguard, and StreamXL#2
Conversation
…L, data quality, and Excel streaming
There was a problem hiding this comment.
Code Review
This pull request adds three new libraries (statguard, StreamXL, and AudiencePro) to the README.md file. The review feedback points out stylistic inconsistencies regarding the use of em-dashes in the descriptions of statguard and AudiencePro, suggesting alternative phrasing to align with the rest of the document.
Important
The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.
| - [polars](https://github.com/pola-rs/polars) - DataFrame library with a Pandas-like API. | ||
| - [pydantic-core](https://github.com/pydantic/pydantic-core) - Core validation logic for Pydantic v2. | ||
| - [rustworkx](https://github.com/Qiskit/rustworkx) - High-performance Python graph library implemented in Rust. | ||
| - [statguard](https://github.com/Mullassery/statguard) - Declarative data quality and validation library — schema checks, drift detection (PSI + KS), anomaly detection, and native Delta Lake/Iceberg support. 13–25× faster than pandera and Great Expectations. |
There was a problem hiding this comment.
The use of the em-dash (—) with spaces is inconsistent with the rest of the README.md file, which uses standard hyphens, colons, or prepositions for punctuation. Consider rephrasing to use a preposition like 'supporting' to maintain stylistic consistency across the list.
| - [statguard](https://github.com/Mullassery/statguard) - Declarative data quality and validation library — schema checks, drift detection (PSI + KS), anomaly detection, and native Delta Lake/Iceberg support. 13–25× faster than pandera and Great Expectations. | |
| - [statguard](https://github.com/Mullassery/statguard) - Declarative data quality and validation library supporting schema checks, drift detection (PSI + KS), anomaly detection, and native Delta Lake/Iceberg support. 13–25× faster than pandera and Great Expectations. |
|
|
||
| Tools for machine learning, NLP, and AI applications. | ||
|
|
||
| - [AudiencePro](https://github.com/Mullassery/AudiencePro) - Python library for customer segmentation — RFM analysis, KMeans/K-Prototypes clustering, drift detection, and streaming updates at 10–25× the speed of scikit-learn + pandas. |
There was a problem hiding this comment.
The use of the em-dash (—) with spaces is inconsistent with the rest of the README.md file, which uses standard hyphens, colons, or prepositions for punctuation. Consider rephrasing to use a preposition like 'featuring' to maintain stylistic consistency across the list.
| - [AudiencePro](https://github.com/Mullassery/AudiencePro) - Python library for customer segmentation — RFM analysis, KMeans/K-Prototypes clustering, drift detection, and streaming updates at 10–25× the speed of scikit-learn + pandas. | |
| - [AudiencePro](https://github.com/Mullassery/AudiencePro) - Python library for customer segmentation featuring RFM analysis, KMeans/K-Prototypes clustering, drift detection, and streaming updates at 10–25× the speed of scikit-learn + pandas. |
New libraries
Three Rust-powered Python libraries for data engineering and ML:
Data Processing
Machine Learning & AI
All three use PyO3 bindings, are pip/uv installable, and support Python 3.8+.