Skip to content

Llama-2 proj weight #15

@shanhx2000

Description

@shanhx2000

In greedyopt.py, why k, v, weight ratio is set to 1/8? For Llama2-7B, shouldn't it be 1? (4096x4096)

"Llama-2-7B": {
'q': 1, 'k': 1/8, 'v': 1/8, 'o': 1,
'gate': 2.6875, 'up': 2.6875, 'down': 2.6875
},

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions