update model_max_length for a valuable signal on whether this is Schnell or Dev model

Currently we have to check the model name or its guidance embedding configurations, but both of these are editable by continued finetuning. The sequence length cannot be changed through fine-tuning, it requires continued pretraining and corrected attn_mask handling during SDPA.

This is a humble request that should improve the utility of Schnell with less work for downstream adaptations.

Files changed (1) hide show

tokenizer_2/tokenizer_config.json +1 -1

tokenizer_2/tokenizer_config.json CHANGED Viewed

@@ -932,7 +932,7 @@
   "eos_token": "</s>",
   "extra_ids": 100,
   "legacy": true,
-  "model_max_length": 512,
   "pad_token": "<pad>",
   "sp_model_kwargs": {},
   "tokenizer_class": "T5Tokenizer",

   "eos_token": "</s>",
   "extra_ids": 100,
   "legacy": true,
+  "model_max_length": 256,
   "pad_token": "<pad>",
   "sp_model_kwargs": {},
   "tokenizer_class": "T5Tokenizer",