[CANCELLED] eval(omlx): oMLX parallel-inference evaluation (abandoned) by FluffyAIcode · Pull Request #151 · FluffyAIcode/Kakeya-LLM-Inference-engine

FluffyAIcode · 2026-06-18T07:19:34Z

CANCELLED — please close this PR. Per the user's request, the oMLX evaluation is abandoned and the head branch (AgentMemory/omlx-parallel-inference-eval-2815) has been deleted, so there is nothing to merge. GitHub did not auto-close it; one click on Close pull request will finish it off.

(Original scope: a read-only omlx-env-probe found oMLX was not installed on the Mac runner, and a parallel-inference bench was prepared but never run. All of it lived only on the now-deleted branch.)

… capture its launch CLI Prereq for evaluating whether oMLX (jundot/omlx) continuous-batching can do the Gemma-4 parallel inference vllm-mlx could not. Read-only: detects CLI/app bundle/brew/pip and dumps --help/serve|launch help; no server, no model load. Co-authored-by: FluffyAIcode <FluffyAIcode@users.noreply.github.com>

Drives an already-running oMLX OpenAI server (OMLX_BASE_URL/OMLX_MODEL) with N unique-needle requests serially then concurrently; reports errors, per-request correctness (no cross-request contamination), and wall speedup — the exact Gemma-4 parallel case vllm-mlx crashed on (shared_kv TypeError). Stdlib-only (urllib+threads). Ready to run once oMLX is installed + serving on the Mac. Co-authored-by: FluffyAIcode <FluffyAIcode@users.noreply.github.com>

cursoragent and others added 2 commits June 18, 2026 07:13

github-actions Bot added the needs-mac-m4 label Jun 18, 2026

cursor Bot deleted the AgentMemory/omlx-parallel-inference-eval-2815 branch June 18, 2026 07:23

cursor Bot changed the title ~~eval(omlx): probe + parallel-inference bench to test oMLX continuous batching on Gemma-4~~ [CANCELLED] eval(omlx): oMLX parallel-inference evaluation (abandoned) Jun 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CANCELLED] eval(omlx): oMLX parallel-inference evaluation (abandoned)#151

[CANCELLED] eval(omlx): oMLX parallel-inference evaluation (abandoned)#151
FluffyAIcode wants to merge 2 commits into
mainfrom
AgentMemory/omlx-parallel-inference-eval-2815

FluffyAIcode commented Jun 18, 2026 •

edited by cursor Bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

FluffyAIcode commented Jun 18, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

FluffyAIcode commented Jun 18, 2026 •

edited by cursor Bot

Loading