Investigate why Unispac/Llama2-7B-Chat-Augmented is so slow #24

Open

Labels

opened

Unispac/Llama2-7B-Chat-Augmented is very slow in generation with generate_ragged_batched - would be nice to understand why.

Metadata

Assignees

No one assigned

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests