Add LAQ/LSQ support and recipes#1884
Conversation
Signed-off-by: realAsma <akuriparambi@nvidia.com>
|
Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually. Contributors can view more details about this message here. |
|
Important Review skippedDraft detected. Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
Codecov Report❌ Patch coverage is Additional details and impacted files@@ Coverage Diff @@
## main #1884 +/- ##
==========================================
- Coverage 61.16% 61.13% -0.04%
==========================================
Files 515 515
Lines 57216 57474 +258
==========================================
+ Hits 34995 35135 +140
- Misses 22221 22339 +118
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Harness. 🚀 New features to boost your workflow:
|
Summary
main$importrecipes and setquantize_pre_scale: falseqad_laq_scales.yamlfor scale-only QADValidation
pre-commit run --files ...git diff --checkpytest_pwd tests/unit/recipe/test_laq_recipes.py tests/unit/torch/quantization/test_laq.py(37 passed)Parity
Compared the new recipes against the original LAQ folder reference at
/home/scratch.akuriparambi_coreai/Model-Optimizer-LAQ. The two new recipes resolve to the same core LAQ algorithm and weight/input/KV/lm_head quantizer settings as the old LAQ recipes, with the new explicitquantize_pre_scale: falsesetting.