I am using the node sdk to use the IBM watson speech-to-text module. After sending the audio sample and receiving a response, the confidence factor looks weird.
{
"results": [
{
"word_alternatives": [
{
"start_time": 3.31,
"alternatives": [
{
"confidence": 0.7563,
"word": "you"
},
{
"confidence": 0.0254,
"word": "look"
},
{
"confidence": 0.0142,
"word": "Lou"
},
{
"confidence": 0.0118,
"word": "we"
}
],
"end_time": 3.43
},
...
and
...
],
"alternatives": [
{
"word_confidence": [
[
"you",
0.36485132893469713
],
...
and I am asking for recognition with this config:
var params = {
audio: fs.createReadStream(req.file.path),
content_type: 'audio/wav',
'interim_results': false,
'word_confidence': true,
'timestamps': true,
'max_alternatives': 3,
'continuous': true,
'word_alternatives_threshold': 0.01,
'smart_formatting': true
};
Notice how the confidence factors for the word "you" is different in both places. Is one of these numbers something different? What is going on here?