MetaBox: Benchmarking Platform for Meta-Black-Box Optimization
-
Updated
Oct 10, 2025 - Python
MetaBox: Benchmarking Platform for Meta-Black-Box Optimization
大模型评测平台 — 本地/API/HuggingFace/OpenCompass 三路后端,支持数据生产(Self-Instruct/Evol-Instruct)、长尾场景生成、弱项挖掘、回归分析、污染检测、Bad Case归因。可扩展的 Benchmark 系统和 LLM-as-Judge 自动评分。
Add a description, image, and links to the benchmark-platform topic page so that developers can more easily learn about it.
To associate your repository with the benchmark-platform topic, visit your repo's landing page and select "manage topics."