p 
 smart-turn-v2
Voice Activity Detection • pipecat-aiAn open source, community-driven, native audio turn detection model in 2nd version
| Model Info | |
|---|---|
| Batch | Yes | 
| Real-time | Yes | 
| Unit Pricing | $0.00034 per audio minute | 
Parameters
* indicates a required field
Input
-  
0object-  
audioobject requiredreadable stream with audio data and content-type specified for that data
-  
bodyobject required -  
contentTypestring required 
 -  
 -  
dtypestringtype of data PCM data that's sent to the inference server as raw array
 
 -  
 -  
1object-  
audiostring requiredbase64 encoded audio data
 -  
dtypestringtype of data PCM data that's sent to the inference server as raw array
 
 -  
 
Output
-  
is_completebooleanif true, end-of-turn was detected
 -  
probabilitynumberprobability of the end-of-turn detection
 
API Schemas
The following schemas are based on JSON Schema
{    "type": "object",    "oneOf": [        {            "properties": {                "audio": {                    "type": "object",                    "description": "readable stream with audio data and content-type specified for that data",                    "properties": {                        "body": {                            "type": "object"                        },                        "contentType": {                            "type": "string"                        }                    },                    "required": [                        "body",                        "contentType"                    ]                },                "dtype": {                    "type": "string",                    "description": "type of data PCM data that's sent to the inference server as raw array",                    "enum": [                        "uint8",                        "float32",                        "float64"                    ]                }            },            "required": [                "audio"            ]        },        {            "properties": {                "audio": {                    "type": "string",                    "description": "base64 encoded audio data"                },                "dtype": {                    "type": "string",                    "description": "type of data PCM data that's sent to the inference server as raw array",                    "enum": [                        "uint8",                        "float32",                        "float64"                    ]                }            },            "required": [                "audio"            ]        }    ]}{    "type": "object",    "contentType": "application/json",    "properties": {        "is_complete": {            "type": "boolean",            "description": "if true, end-of-turn was detected"        },        "probability": {            "type": "number",            "description": "probability of the end-of-turn detection"        }    }}Was this helpful?
- Resources
 - API
 - New to Cloudflare?
 - Directory
 - Sponsorships
 - Open Source
 
- Support
 - Help Center
 - System Status
 - Compliance
 - GDPR
 
- Company
 - cloudflare.com
 - Our team
 - Careers
 
- © 2025 Cloudflare, Inc.
 - Privacy Policy
 - Terms of Use
 - Report Security Issues
 - Trademark