Support intern-s1 #14875

RunningLeon · 2025-07-25T11:33:20Z

convert_hf_to_gguf.py

ngxson · 2025-07-25T15:23:12Z

convert_hf_to_gguf.py

+            name = name.replace(r".lambda_2", r".ls2")
+            name = name.replace(r".layernorm_before.", r".norm1.")
+            name = name.replace(r".layernorm_after.", r".norm2.")
+        return name


Use self.map_tensor_name(name) with proper mapping

this name remapping is only for intern-s1, so I believe this is better to put the logic in a function and mapping it the internvlchat model weight names.

Please at least add the vision_tower/vision_model mappings to gguf-py/gguf/tensor_mapping.py.

ok. will update later.

convert_hf_to_gguf.py

CISC · 2025-07-29T09:22:11Z

The Python Type-Check CI needs to be resolved.

RunningLeon · 2025-07-30T06:16:05Z

The Python Type-Check CI needs to be resolved.

@CISC hi, could you tell how to fix this error? Seems not reasonable to me

/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:3219:23 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:3220:13 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:3220:49 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:32[21](https://github.com/ggml-org/llama.cpp/actions/runs/16612224904/job/46997396567?pr=14875#step:5:22):23 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:3[22](https://github.com/ggml-org/llama.cpp/actions/runs/16612224904/job/46997396567?pr=14875#step:5:23)2:13 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:3222:49 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
6 errors, 0 warnings, 14 informations
Error: 6 errors

CISC · 2025-07-30T06:40:02Z

The Python Type-Check CI needs to be resolved.

@CISC hi, could you tell how to fix this error? Seems not reasonable to me

Running pyright locally helps, the line numbers are wrong for some reason, this is the actual erroneous codeblock:

llama.cpp/convert_hf_to_gguf.py

Lines 3002 to 3005 in 5eba3e3

    
           if isinstance(self.hparams_vision['image_size'], list): 
        
               self.hparams_vision['image_size'] = self.hparams_vision['image_size'][0] 
        
           if isinstance(self.hparams_vision['patch_size'], list): 
        
               self.hparams_vision['patch_size'] = self.hparams_vision['patch_size'][0]

CISC · 2025-07-30T06:44:02Z

convert_hf_to_gguf.py

@@ -2998,7 +2999,12 @@ def modify_tensors(self, data_torch: Tensor, name: str, bid: int | None) -> Iter
 @ModelBase.register("InternVisionModel")
 class InternVisionModel(MmprojModel):
    def set_gguf_parameters(self):
+        if isinstance(self.hparams_vision['image_size'], list):


Suggested change

if isinstance(self.hparams_vision['image_size'], list):

assert self.hparams_vision is not None

if isinstance(self.hparams_vision['image_size'], list):

ngxson · 2025-07-30T09:10:47Z

convert_hf_to_gguf.py

+        names_map = {
+            "model.multi_modal_projector.layer_norm.bias": "mlp1.0.bias",
+            "model.multi_modal_projector.layer_norm.weight": "mlp1.0.weight",
+            "model.multi_modal_projector.linear_1.bias": "mlp1.1.bias",
+            "model.multi_modal_projector.linear_1.weight": "mlp1.1.weight",
+            "model.multi_modal_projector.linear_2.bias": "mlp1.3.bias",
+            "model.multi_modal_projector.linear_2.weight": "mlp1.3.weight",
+        }


Hmm ok I think the mapping of 6 tensors can't be added to tensor_mapping.py, as it will mess up conversion for other models. So it's ok to keep these 6 tensors here for now.

But one thing I'm not sure, why mapped name you are using is mlp1.%d.%s? I think it should be mm.model.mlp.%d.%s to match the original InternVL model

it just maps interns1 weight name to internvl weight name

llama.cpp/gguf-py/gguf/tensor_mapping.py

Line 1078 in 00131d6

"mlp1.{bid}", # InternVL

ngxson · 2025-07-30T09:13:37Z

gguf-py/gguf/tensor_mapping.py

@@ -1190,6 +1205,7 @@ class TensorNameMap:

        MODEL_TENSOR.V_MM_INP_NORM: (
            "multi_modal_projector.norm",
+            "model.multi_modal_projector.layer_norm", # Intern-S1


This should be removed as the norm is already mapped to mlp.0. This is an artifact from InternVL: https://huggingface.co/OpenGVLab/InternVL3-8B-Instruct/blob/a34d3e4e129a5856abfd6aa6de79776484caa14e/modeling_internvl_chat.py#L79

Suggested change

"model.multi_modal_projector.layer_norm", # Intern-S1

RunningLeon added 2 commits July 16, 2025 20:55

support internvl

7cf5c4c

support interns1

859796e

github-actions bot added the python python script changes label Jul 25, 2025

CISC reviewed Jul 25, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

ngxson requested changes Jul 25, 2025

View reviewed changes

resolve comments

483ffef

put interns1 in tensor mapping

5eba3e3

CISC reviewed Jul 30, 2025

View reviewed changes

ngxson reviewed Jul 30, 2025

View reviewed changes

resolve comment

c71543c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support intern-s1 #14875

Support intern-s1 #14875

RunningLeon commented Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ngxson Jul 25, 2025

Uh oh!

RunningLeon Jul 28, 2025

Uh oh!

CISC Jul 29, 2025

Uh oh!

RunningLeon Jul 30, 2025

Uh oh!

Uh oh!

CISC commented Jul 29, 2025

Uh oh!

RunningLeon commented Jul 30, 2025

Uh oh!

CISC commented Jul 30, 2025

Uh oh!

CISC Jul 30, 2025

Uh oh!

ngxson Jul 30, 2025 •

edited

Loading

Uh oh!

RunningLeon Jul 30, 2025

Uh oh!

ngxson Jul 30, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

	if isinstance(self.hparams_vision['image_size'], list):
	assert self.hparams_vision is not None
	if isinstance(self.hparams_vision['image_size'], list):

Support intern-s1 #14875

Are you sure you want to change the base?

Support intern-s1 #14875

Conversation

RunningLeon commented Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ngxson Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

RunningLeon Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

CISC Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

RunningLeon Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CISC commented Jul 29, 2025

Uh oh!

RunningLeon commented Jul 30, 2025

Uh oh!

CISC commented Jul 30, 2025

Uh oh!

CISC Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

ngxson Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RunningLeon Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

ngxson Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

ngxson Jul 30, 2025 •

edited

Loading