Scripted API Info - Elsa API
Scripted API Info - Elsa API
Scripted API Info - Elsa API
IMPORTANT: Depending on the api_plan you select, the output will vary. Below we the properties
that appear in “premium” plans.
{
"api_version": "code indicating the version of the API that is being called.",
"total_time": "Total duration of the audio (for all utterances)",
"utterance":"[] field displaying information about the played stream and the
results obtained by the user."
[
{
"initial_silence": "flag indicating whether there is at least one
frame of silence at the beginning of the audio recording. This is useful to warn
the user he needs to wait a bit before speaking when returned as “false”.",
"sentence": "String with the word(s) that the user is prompted to
pronounce.",
"sentence_id": "ID of the processed sentence indicating which of the
input sentences is most likely to have been pronounced by the user. Default is 0
(only one sentence is input). This is used in our multi-input mode.",
"total_time": "duration of the utterance in seconds.",
"has_speech": "(deprecated, use attempt_type instead). flag indicating
whether the audio sample has any speech. It is a simple way to inform the user
when the microphone did not pick up anything. It is equivalent to the “UNSPOKEN”
type in attempt_type below.",
"attempt_type": "Characterizes the content of the speech attempt. It
can have the following values: ",
https://api-external-doc.elsanow.co/Scripted-api-info 1/20
12/19/23, 10:24 PM Scripted API info | Elsa API
},
{
"start_index": "position of the first character in the
sentence that is represented in this entry.",
"end_index": "position of the last character in the
sentence that is represented in this entry.",
"text": "set of characters in the word that
corresponds with the phoneme(s) of this entry.",
"trans": "phonetic transcription of the sounds
represented in this entry, in API format.",
"trans_arpabet": "phonetic transcription of the sounds
represented in this entry using ArpaBET format.",
"start_time": "time (in seconds) in which the sound in
this entry starts within the audio recording.",
"end_time": "time (in seconds) in which the sound in
this entry ends within the audio recording.",
"decision": "string indicating whether the user
correctly pronounced the phonemes in this entry. Possible values are “correct”,
“warning” or “error”.",
"nativeness_score": "score achieved by the user for
the sound or sounds represented in this entry, in the rance 0-100.",
"phoneme_error": "whenever we find an error (decision
== “incorrect”) and the error is a substitution we detect, this field presents the
correct phonetic transcription and what we detected that the user pronounced. This
is in IPA format. If all is correct, we return “correct”.",
"phoneme_error_arpabet": "when the error is a
substitution, this field presents the correct phonetic transcription and that
pronounced by the user in ArpaBET. If all is correct, we return “correct”."
},
{
"start_index": "position of the first character in the
sentence that is represented in this entry.",
"end_index": "position of the last character in the
sentence that is represented in this entry.",
"text": "set of characters in the word that
corresponds with the phoneme(s) of this entry.",
"trans": "phonetic transcription of the sounds
represented in this entry, in API format.",
"trans_arpabet": "phonetic transcription of the sounds
represented in this entry using ArpaBET format.",
"start_time": "time (in seconds) in which the sound in
this entry starts within the audio recording.",
"end_time": "time (in seconds) in which the sound in
this entry ends within the audio recording.",
"decision": "string indicating whether the user
correctly pronounced the phonemes in this entry. Possible values are “correct”,
“warning” or “error”.",
"nativeness_score": "score achieved by the user for
the sound or sounds represented in this entry, in the rance 0-100.",
"phoneme_error": "whenever we find an error (decision
https://api-external-doc.elsanow.co/Scripted-api-info 4/20
12/19/23, 10:24 PM Scripted API info | Elsa API
== “incorrect”) and the error is a substitution we detect, this field presents the
correct phonetic transcription and what we detected that the user pronounced. This
is in IPA format. If all is correct, we return “correct”.",
"phoneme_error_arpabet": "when the error is a
substitution, this field presents the correct phonetic transcription and that
pronounced by the user in ArpaBET. If all is correct, we return “correct”."
"feedback":"List with automatically generated hints
for the user to improve their pronunciation of the sounds in this entry."
[
{
"id": "internal numeric identifier for the
feedback sentence.",
"audio_link": "link to an audio recording with
the hint in spoken format.",
"text": "string with the hint in written
format.",
"language": "language in which the feedback is
presented. By default hints are sent in English on the API results."
}
]
},
{
"start_index": "position of the first character in the
sentence that is represented in this entry.",
"end_index": "position of the last character in the
sentence that is represented in this entry.",
"text": "set of characters in the word that
corresponds with the phoneme(s) of this entry.",
"trans": "phonetic transcription of the sounds
represented in this entry, in API format.",
"trans_arpabet": "phonetic transcription of the sounds
represented in this entry using ArpaBET format.",
"start_time": "time (in seconds) in which the sound in
this entry starts within the audio recording.",
"end_time": "time (in seconds) in which the sound in
this entry ends within the audio recording.",
"decision": "string indicating whether the user
correctly pronounced the phonemes in this entry. Possible values are “correct”,
“warning” or “error”.",
"nativeness_score": "score achieved by the user for
the sound or sounds represented in this entry, in the rance 0-100.",
"phoneme_error": "whenever we find an error (decision
== “incorrect”) and the error is a substitution we detect, this field presents the
correct phonetic transcription and what we detected that the user pronounced. This
is in IPA format. If all is correct, we return “correct”.",
"phoneme_error_arpabet": "when the error is a
substitution, this field presents the correct phonetic transcription and that
pronounced by the user in ArpaBET. If all is correct, we return “correct”."
}
],
https://api-external-doc.elsanow.co/Scripted-api-info 5/20
12/19/23, 10:24 PM Scripted API info | Elsa API
“INCOMPLETE” → The user spoke only part of what was expected. This could also be
returned when the user speaks something unrelated but part of it is close enough to the
target text.
“UNSPOKEN” → There is no speech detected in the submitted audio.
“UNRELATED” → Speech is detected, but it does not seem to correspond to what was
requested.
“RELEVANT” → Text and audio match sufficiently well, the scoring is reliable.
https://api-external-doc.elsanow.co/Scripted-api-info 10/20
12/19/23, 10:24 PM Scripted API info | Elsa API
{
"api_version": "code indicating the version of the API that is being called.",
"total_time": "Total duration of the audio (for all utterances)",
"utterance":"[] field displaying information about the played stream and the
results obtained by the user."
[
{
"initial_silence": "flag indicating whether there is at least one
frame of silence at the beginning of the audio recording. This is useful to warn
the user he needs to wait a bit before speaking when returned as “false”.",
"sentence": "String with the word(s) that the user is prompted to
pronounce.",
"sentence_id": "ID of the processed sentence indicating which of the
input sentences is most likely to have been pronounced by the user. Default is 0
(only one sentence is input). This is used in our multi-input mode.",
"total_time": "duration of the utterance in seconds.",
"attempt_type": "Characterizes the content of the speech attempt. It
can have the following values: ",
"snr": " Signal to noise ratio (i.e. percentage of energy of speech
parts versus silent parts in the audio).",
"decision": "tring indicating how proficient the user was at
pronouncing the utterance. It is computed from the parameter nativeness_score.
Possible values are “correct”, “almost_correct” or “incorrect”.",
"ipa": "ˈpi.tsə ˈpɑr.ti",
"intonation_score": "evaluates the ability to emphasize the key words
https://api-external-doc.elsanow.co/Scripted-api-info 11/20
12/19/23, 10:24 PM Scripted API info | Elsa API
substitution, this field presents the correct phonetic transcription and that
pronounced by the user in ArpaBET. If all is correct, we return “correct”."
},
{
"start_index": "position of the first character in the
sentence that is represented in this entry.",
"end_index": "position of the last character in the
sentence that is represented in this entry.",
"text": "set of characters in the word that
corresponds with the phoneme(s) of this entry.",
"trans": "phonetic transcription of the sounds
represented in this entry, in API format.",
"trans_arpabet": "phonetic transcription of the sounds
represented in this entry using ArpaBET format.",
"start_time": "time (in seconds) in which the sound in
this entry starts within the audio recording.",
"end_time": "time (in seconds) in which the sound in
this entry ends within the audio recording.",
"decision": "string indicating whether the user
correctly pronounced the phonemes in this entry. Possible values are “correct”,
“warning” or “error”.",
"nativeness_score": "score achieved by the user for
the sound or sounds represented in this entry, in the rance 0-100.",
"phoneme_error": "whenever we find an error (decision
== “incorrect”) and the error is a substitution we detect, this field presents the
correct phonetic transcription and what we detected that the user pronounced. This
is in IPA format. If all is correct, we return “correct”.",
"phoneme_error_arpabet": "when the error is a
substitution, this field presents the correct phonetic transcription and that
pronounced by the user in ArpaBET. If all is correct, we return “correct”."
},
{
"start_index": "position of the first character in the
sentence that is represented in this entry.",
"end_index": "position of the last character in the
sentence that is represented in this entry.",
"text": "set of characters in the word that
corresponds with the phoneme(s) of this entry.",
"trans": "phonetic transcription of the sounds
represented in this entry, in API format.",
"trans_arpabet": "phonetic transcription of the sounds
represented in this entry using ArpaBET format.",
"start_time": "time (in seconds) in which the sound in
this entry starts within the audio recording.",
"end_time": "time (in seconds) in which the sound in
this entry ends within the audio recording.",
"decision": "string indicating whether the user
correctly pronounced the phonemes in this entry. Possible values are “correct”,
“warning” or “error”.",
"nativeness_score": "score achieved by the user for
https://api-external-doc.elsanow.co/Scripted-api-info 18/20
12/19/23, 10:24 PM Scripted API info | Elsa API
produced by the user. Its value is “high” if the user stressed the syllable and
“low” otherwise."
},
{
"start_index": "position of the first character in the
sentence that is represented in this entry.",
"end_index": "position of the last character in the
sentence that is represented in this entry.",
"decision": "string indicating whether the user
correctly stressed the syllable. Possible values are correct or incorrect.",
"stress_level_measured": "string indicating the stress
produced by the user. Its value is “high” if the user stressed the syllable and
“low” otherwise."
}
]
}
]
}
],
"success": "flag indicating whether the call was successful, does not say
anything about the content."
}
https://api-external-doc.elsanow.co/Scripted-api-info 20/20