Parsing Non‑Standard Function Calls from GigaChat3‑10B‑A1.8B Model Responses

by gdagil - opened 10 days ago

10 days ago

I am trying to use the OpenAI‑compatible API (the v1/chat/completions endpoint) with the model ai-sage/GigaChat3-10B-A1.8B hosted on Hugging Face. In the request I include a tools section (function‑calling) according to the OpenAI specification:

{
  "model": "ai-sage/GigaChat3-10B-A1.8B",
  "messages": [
    {"role": "user", "content": "Save to my personal memory: prefers short answers"}
  ],
  "tools": [
    {
      "type": "function",
      "function": {
        "name": "manage_user_memory",
        "description": "...",
        "parameters": {
          "type": "object",
          "properties": {
            "content": {"anyOf":[{"type":"string"},{"type":"null"}],"default":null},
            "action": {"type":"string","enum":["create","update","delete"],"default":"create"},
            "id": {"anyOf":[{"type":"string","format":"uuid"},{"type":"null"}],"default":null}
          }
        }
      }
    }
  ]
}

The server’s response looks like this:

{
  "id": "...",
  "model": "ai-sage/GigaChat3-10B-A1.8B",
  "choices": [
    {
      "message": {
        "role": "assistant",
        "content": "<|message_sep|>\n\nfunction call<|role_sep|>\n{\"name\": \"manage_user_memory\", \"arguments\": {\"action\": \"create\", \"content\": \"Prefers short answers\"}}",
        "function_call": null,
        "tool_calls": []
      },
      "finish_reason": "stop"
    }
  ]
}

The model does produce a function call, but it does not populate the official function_call or tool_calls fields defined by OpenAI. Instead, it embeds a serialized function call as a plain string inside the content field. Because of this, client libraries such as openai, LangChain, or vLLM/sglang cannot automatically detect and handle the tool call; it has to be parsed manually

gdagil

10 days ago

Could anyone share a reliable code snippet (Python, JavaScript, or another language) that extracts the function name and arguments from the content field returned by the model ai-sage/GigaChat3-10B-A1.8B? The response embeds a serialized function call inside a plain string, e.g.:

{
  "role": "assistant",
  "content": "<|message_sep|>\n\nfunction call<|role_sep|>\n{\"name\": \"manage_user_memory\", \"arguments\": {\"action\": \"create\", \"content\": \"Prefers short answers\"}}"
}

bsfg

ai-sage org 9 days ago

Hi! Please use this function to extract the function name and arguments, make sure there is no EOS at the end:

import json
import re
REGEX_FUNCTION_CALL_V3 = re.compile(r"function call<\|role_sep\|>\n(.*)$", re.DOTALL)
REGEX_CONTENT_PATTERN = re.compile(r"^(.*?)<\|message_sep\|>", re.DOTALL)
def parse_function_and_content(completion_str: str):
    """
    Using the regexes the user provided, attempt to extract function call and content.
    Returns (function_call_str_or_None, content_str_or_None)
    """

    function_call = None
    content = None

    m_func = REGEX_FUNCTION_CALL_V3.search(completion_str)
    if m_func:
        try:
            function_call = json.loads(m_func.group(1))
            if isinstance(function_call, dict) and "name" in function_call and "arguments" in function_call:
                if not isinstance(function_call["arguments"], dict):
                    function_call = None
            else:
                function_call = None
        except json.JSONDecodeError:
            function_call = None

            # will return raw string in failed attempt of function calling
            return function_call, completion_str

    m_content = REGEX_CONTENT_PATTERN.search(completion_str)
    if m_content:
        content = m_content.group(1)
    else:
        # as a fallback, everything before the first message_sep marker if present
        if "<|message_sep|>" in completion_str:
            content = completion_str.split("<|message_sep|>")[0]
        else:
            content = completion_str

    return function_call, content

bulatovv

1 day ago

•

edited 1 day ago

А с каким --tool-call-parser надо запускать vllm чтобы этот формат парсился?

UPD: как я понял, под такой формат нет парсера в vllm, навайбкодил под нужный формат плагин, используйте на свой страх и риск
https://gist.github.com/bulatovv/b9b5116a0af14fe09164146ed8eabafb

При запуске прокиньте полный путь до плагина с этими параметрами

        --enable-auto-tool-choice \
        --tool-parser-plugin /.../.../gigachat3_tool_parser.py \
        --tool-call-parser gigachat3 \

bsfg

ai-sage org about 15 hours ago

Full support in vLLM, SGLang and llama.cpp It's on the way.
Our temporary solution is available for vLLM in a separate branch - https://github.com/vllm-project/vllm/pull/29905 .
In llama.cpp function calls are also available, but with a number of technical limitations — we have added detailed instructions to the description of the GGUF model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment