Skip to content

Commit b26a190

Browse files
yuanzhuangyuanzhuangYuan Zhuangshrgupta773
authored
fix wrong para position for the newly added params for AOAI (#30849)
* fix wrong para position for the newly added params * Add parallel_tool_calls to assistant api * bug fix --------- Co-authored-by: Yuan Zhuang <yuzhua@microsoft.com> Co-authored-by: Shruti Gupta <shrgupta@microsoft.com>
1 parent c1b5abd commit b26a190

File tree

2 files changed

+20
-4
lines changed

2 files changed

+20
-4
lines changed

specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2024-09-01-preview/inference.json

Lines changed: 14 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -3739,6 +3739,10 @@
37393739
"nullable": true,
37403740
"description": "Number between -2.0 and 2.0. Positive values penalize new tokens based on whether they appear in the text so far, increasing the model's likelihood to talk about new topics.\n"
37413741
},
3742+
"parallel_tool_calls": {
3743+
"$ref": "#/components/schemas/ParallelToolCalls",
3744+
"nullable": true
3745+
},
37423746
"response_format": {
37433747
"description": "An object specifying the format that the model must output. Compatible with [GPT-4o](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models#gpt-4-and-gpt-4-turbo-models), [GPT-4o mini](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models#gpt-4-and-gpt-4-turbo-models), [GPT-4 Turbo](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models#gpt-4-and-gpt-4-turbo-models) and all [GPT-3.5](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models#gpt-35) Turbo models newer than `gpt-3.5-turbo-1106`.\n\nSetting to `{ \"type\": \"json_schema\", \"json_schema\": {...} }` enables Structured Outputs which guarantees the model will match your supplied JSON schema.\n\nSetting to `{ \"type\": \"json_object\" }` enables JSON mode, which guarantees the message the model generates is valid JSON.\n\n**Important:** when using JSON mode, you **must** also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly \"stuck\" request. Also note that the message content may be partially cut off if `finish_reason=\"length\"`, which indicates the generation exceeded `max_tokens` or the conversation exceeded the max context length.\n",
37443748
"oneOf": [
@@ -3785,6 +3789,10 @@
37853789
"nullable": true,
37863790
"default": false
37873791
},
3792+
"stream_options": {
3793+
"$ref": "#/components/schemas/chatCompletionStreamOptions",
3794+
"nullable": true
3795+
},
37883796
"temperature": {
37893797
"type": "number",
37903798
"minimum": 0,
@@ -8202,10 +8210,6 @@
82028210
"nullable": true,
82038211
"description": "If `true`, returns a stream of events that happen during the Run as server-sent events, terminating when the Run enters a terminal state with a `data: [DONE]` message.\n"
82048212
},
8205-
"stream_options": {
8206-
"$ref": "#/components/schemas/chatCompletionStreamOptions",
8207-
"nullable": true
8208-
},
82098213
"max_prompt_tokens": {
82108214
"type": "integer",
82118215
"nullable": true,
@@ -8226,6 +8230,9 @@
82268230
"$ref": "#/components/schemas/assistantsApiToolChoiceOption",
82278231
"nullable": true
82288232
},
8233+
"parallel_tool_calls": {
8234+
"$ref": "#/components/schemas/ParallelToolCalls"
8235+
},
82298236
"response_format": {
82308237
"$ref": "#/components/schemas/assistantsApiResponseFormatOption",
82318238
"nullable": true
@@ -8486,6 +8493,9 @@
84868493
"$ref": "#/components/schemas/assistantsApiToolChoiceOption",
84878494
"nullable": true
84888495
},
8496+
"parallel_tool_calls": {
8497+
"$ref": "#/components/schemas/ParallelToolCalls"
8498+
},
84898499
"response_format": {
84908500
"$ref": "#/components/schemas/assistantsApiResponseFormatOption",
84918501
"nullable": true

specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2024-09-01-preview/inference.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2488,6 +2488,8 @@ components:
24882488
example: 1
24892489
nullable: true
24902490
description: How many chat completion choices to generate for each input message. Note that you will be charged based on the number of generated tokens across all of the choices. Keep `n` as `1` to minimize costs.
2491+
parallel_tool_calls:
2492+
$ref: "#/components/schemas/ParallelToolCalls"
24912493
presence_penalty:
24922494
type: number
24932495
default: 0
@@ -5723,6 +5725,8 @@ components:
57235725
tool_choice:
57245726
$ref: "#/components/schemas/assistantsApiToolChoiceOption"
57255727
nullable: true
5728+
parallel_tool_calls:
5729+
$ref: "#/components/schemas/ParallelToolCalls"
57265730
response_format:
57275731
$ref: "#/components/schemas/assistantsApiResponseFormatOption"
57285732
nullable: true
@@ -5922,6 +5926,8 @@ components:
59225926
tool_choice:
59235927
$ref: "#/components/schemas/assistantsApiToolChoiceOption"
59245928
nullable: true
5929+
parallel_tool_calls":
5930+
$ref: "#/components/schemas/ParallelToolCalls"
59255931
response_format:
59265932
$ref: "#/components/schemas/assistantsApiResponseFormatOption"
59275933
nullable: true

0 commit comments

Comments
 (0)
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy