Cache: add new flag to distinguish models that `Cache` but not static cache #30800

gante · 2024-05-14T12:05:04Z

What does this PR do?

See title :)

Models like Jamba (will only ever support HybridMambaAttentionDynamicCache) or Mistral (currently only supports dynamic caches, like SinkCache) need a distinction in terms of support flags, to run the appropriate checks and tests.

Without this flag, we have the following bad practices:

manually skip tests
don't do cache support checks in generate

HuggingFaceDocBuilderDev · 2024-05-14T12:24:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp

Thanks for fixing the tests!

ArthurZucker

Alright, not super fan of increasing the things a model supports by adding more attributes, but fine for now.

gante · 2024-05-16T11:08:32Z

@ArthurZucker we can remove this attribute when we update all Llama-copied models to support the static cache :D

… cache (#30800) * jamba cache * new flag * generate exception

jamba cache

a4b4cbf

gante requested a review from zucchini-nlp May 14, 2024 12:05

new flag

fb6cf85

gante changed the title ~~Jamba: set _supports_cache_class to False, jamba has its own cache type~~ Cache: add new flag to distinguish models that Cache but not static cache May 14, 2024

gante requested a review from ArthurZucker May 14, 2024 12:45

generate exception

92abb11

zucchini-nlp approved these changes May 14, 2024

View reviewed changes

ArthurZucker approved these changes May 15, 2024

View reviewed changes

zucchini-nlp mentioned this pull request May 15, 2024

Unable to run generation tests for Mamba & Jamba models #30828

Open

4 tasks

Merge branch 'main' into jamba_cache

45958fc

gante merged commit 9d889f8 into huggingface:main May 16, 2024
23 checks passed

gante deleted the jamba_cache branch May 16, 2024 11:08

itazap pushed a commit that referenced this pull request May 24, 2024

Cache: add new flag to distinguish models that Cache but not static…

340ae88

… cache (#30800) * jamba cache * new flag * generate exception

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache: add new flag to distinguish models that `Cache` but not static cache #30800

Cache: add new flag to distinguish models that `Cache` but not static cache #30800

gante commented May 14, 2024 •

edited

HuggingFaceDocBuilderDev commented May 14, 2024

zucchini-nlp left a comment

ArthurZucker left a comment

gante commented May 16, 2024

Cache: add new flag to distinguish models that Cache but not static cache #30800

Cache: add new flag to distinguish models that Cache but not static cache #30800

Conversation

gante commented May 14, 2024 • edited

What does this PR do?

HuggingFaceDocBuilderDev commented May 14, 2024

zucchini-nlp left a comment

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

gante commented May 16, 2024

Cache: add new flag to distinguish models that `Cache` but not static cache #30800

Cache: add new flag to distinguish models that `Cache` but not static cache #30800

gante commented May 14, 2024 •

edited